Automatic Disambiguation of Author Names: Foundations, Methods and Open Issues

  • Anderson A. Ferreira Universidade Federal de Ouro Preto (UFOP)
  • Alberto H. F. Laender Universidade Federal de Minas Gerais (UFMG)

Resumo


This tutorial is based on our book “Automatic Disambiguation of Author Names in Bibliographic Repositories” and aims to spread the problem and its challenges among the SBBD community. Author name ambiguity problem occurs when an author publishes works under distinct names or distinct authors publish works under similar names. This problem may be caused by a number of reasons, including the lack of standards and common practices, and the decentralized generation of bibliographic content. In this tutorial, we intend to present an ample view on the automatic disambiguation of author names. We start by discussing its motivational issues, defining the author name disambiguation task and presenting its foundations. Next, we describe some methods proposed by our research group, as well as some recent approaches to author name disambiguation. Finally, we discuss open issues.
Palavras-chave: Author Name Ambiguity Problem, Disambiguation Methods, Bibliographic Data Repository

Referências

Boukhers, Z., & Asundi, N. B. (2022). Whois? Deep Author Name Disambiguation Using Bibliographic Data. In Linking Theory and Practice of Digital Libraries: 26th International Conference on Theory and Practice of Digital Libraries, TPDL 2022, Padua, Italy, September 20–23, 2022, Proceedings (pp. 201-215). Cham: Springer International Publishing.

Cota, R. G., Ferreira, A. A., Nascimento, C., Gonçalves, M. A., & Laender, A. H. (2010). An unsupervised heuristic‐based hierarchical method for name disambiguation in bibliographic citations. Journal of the American Society for Information Science and Technology, 61(9), 1853-1870.

Espiridião, L. V., Dias, L. L., & Ferreira, A. A. (2021). Applying Data Augmentation for Disambiguating Author Names. In Anais do XXXVI Simpósio Brasileiro de Bancos de Dados (pp. 109-120). SBC.

Ferreira, A. A., Gonçalves, M. A., & Laender, A. H. (2012). A brief survey of automatic methods for author name disambiguation. ACM Sigmod Record, 41(2), 15-26.

Ferreira, A. A., Gonçalves, M. A., & Laender, A. H. (2020). Automatic disambiguation of author names in bibliographic repositories. Synthesis Lectures on Information Concepts, Retrieval, and Services, 12(1), 1-146.

Ferreira, A. A., Veloso, A., Gonçalves, M. A., & Laender, A. H. (2014). Self‐training author name disambiguation for information scarce scenarios. Journal of the Association for Information Science and Technology, 65(6), 1257-1278.

Hussain, I., & Asghar, S. (2018) DISC: Disambiguating homonyms using graph structural clustering. Journal of Information Science, 44(6):830-847.

Liu,Y., Li, W., Huang, Z., & Fang, Q. (2015) A fast method based on multiple clustering for name disambiguation in bibliographic citations. Journal of the Association for Information Science and Technology, 66(3):634-644.

Santana, A. F., Gonçalves, M. A., Laender, A. H., & Ferreira, A. A. (2017). Incremental author name disambiguation by exploiting domain‐specific heuristics. Journal of the Association for Information Science and Technology, 68(4), 931-945.

Shen, Q., Wu, T., Yang, H., Wu, Y., Qu, H., & Cui, W. (2017). NameClarifier: A Visual Analytics System for Author Name Disambiguation. IEEE Transactions on Visualization and Computer Graphics, 23(1):141-150.
Publicado
25/09/2023
FERREIRA, Anderson A.; LAENDER, Alberto H. F.. Automatic Disambiguation of Author Names: Foundations, Methods and Open Issues. In: TUTORIAIS - SIMPÓSIO BRASILEIRO DE BANCO DE DADOS (SBBD), 38. , 2023, Belo Horizonte/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 179-182. DOI: https://doi.org/10.5753/sbbd_estendido.2023.25633.