Discover the SciOpen Platform and Achieve Your Research Goals with Ease.
Search articles, authors, keywords, DOl and etc.
Author name disambiguation (AND) is a central task in academic search, which has received more attention recently accompanied by the increase of authors and academic publications. To tackle the AND problem, existing studies have proposed various approaches based on different types of information, such as raw document features (e.g., co-authors, titles, and keywords), the fusion feature (e.g., a hybrid publication embedding based on multiple raw document features), the local structural information (e.g., a publication's neighborhood information on a graph), and the global structural information (e.g., interactive information between a node and others on a graph). However, there has been no work taking all the above-mentioned information into account and taking full advantage of the contributions of each raw document feature for the AND problem so far. To fill the gap, we propose a novel framework named EAND (Towards Effective Author Name Disambiguation by Hybrid Attention). Specifically, we design a novel feature extraction model, which consists of three hybrid attention mechanism layers, to extract key information from the global structural information and the local structural information that are generated from six similarity graphs constructed based on different similarity coefficients, raw document features, and the fusion feature. Each hybrid attention mechanism layer contains three key modules: a local structural perception, a global structural perception, and a feature extractor. Additionally, the mean absolute error function in the joint loss function is used to introduce the structural information loss of the vector space. Experimental results on two real-world datasets demonstrate that EAND achieves superior performance, outperforming state-of-the-art methods by at least +2.74% in terms of the micro-F1 score and +3.31% in terms of the macro-F1 score.
Gupta S, Duhan N, Bansal P. An approach for focused crawler to harvest digital academic documents in online digital libraries. International Journal of Information Retrieval Research, 2019, 9(3): 23–47. DOI: 10.4018/IJIRR.2019070103.
Chikazawa Y, Katsurai M, Ohmukai I. Multilingual author matching across different academic databases: A case study on KAKEN, DBLP, and PubMed. Scientometrics, 2021, 126(3): 2311–2327. DOI: 10.1007/s11192-020-03861-3.
Martín-Martín A, Thelwall M, Orduna-Malea E, López-Cózar E D. Google Scholar, Microsoft Academic, Scopus, Dimensions, Web of Science, and OpenCitations' COCI: A multidisciplinary comparison of coverage via citations. Scientometrics, 2021, 126(1): 871–906. DOI: 10.1007/s11192-020-03690-4.
Pooja K M, Mondal S, Chandra J. A graph combination with edge pruning-based approach for author name disambiguation. Journal of the Association for Information Science and Technology, 2020, 71(1): 69–83. DOI: 10.1002/ asi.24212.
Ma Y Y, Wu Y L, Lu C Q. A graph-based author name disambiguation method and analysis via information theory. Entropy, 2020, 22(4): 416. DOI: 10.3390/e22040416.
Santana A F, Gonçalves M A, Laender A H F, Ferreira A A. On the combination of domain-specific heuristics for author name disambiguation: The nearest cluster method. International Journal on Digital Libraries, 2015, 16(3): 229–246. DOI: 10.1007/s00799-015-0158-y.
Kim J, Owen-Smith J. ORCID-linked labeled data for evaluating author name disambiguation at scale. Scientometrics, 2021, 126(3): 2057–2083. DOI: 10.1007/s11192-020-03826-6.
Chen B, Zhang J, Tang J, Cai L F, Wang Z Y, Zhao S, Chen H, Li C P. CONNA: Addressing name disambiguation on the fly. IEEE Trans. Knowledge and Data Engineering, 2022, 34(7): 3139–3152. DOI: 10.1109/TKDE.2020.3021256.
Cota R G, Ferreira A A, Nascimento C, Gonçalves M A, Laender A H F. An unsupervised heuristic-based hierarchical method for name disambiguation in bibliographic citations. Journal of the American Society for Information Science and Technology, 2010, 61(9): 1853–1870. DOI: 10.1002/asi.21363.
Fan X M, Wang J Y, Pu X, Zhou L Z, Lv B. On graph-based name disambiguation. ACM Journal of Data and Information Quality, 2011, 2(2): Article No. 10. DOI: 10.1145/1891879.1891883.
Tang J, Fong A C M, Wang B, Zhang J. A unified probabilistic framework for name disambiguation in digital library. IEEE Trans. Knowledge and Data Engineering, 2012, 24(6): 975–987. DOI: 10.1109/TKDE.2011.13.