Identification of Profiles on Multiple Social Networks

  • Mariana Barreto Pontifical Catholic University of Rio de Janeiro (PUC-Rio)
  • Sérgio Lifschitz Pontifical Catholic University of Rio de Janeiro (PUC-Rio)

Abstract


This work presents the development of a web tool designed to identify user profiles on various social media platforms, such as Twitter, Facebook and Instagram, exclusively employing text similarity measures. The study focuses on using the Levenshtein distance, a metric known for measuring the difference between sequences of characters, to evaluate the similarity between usernames and profile names, seeking to determine the effectiveness of this approach in producing accurate results in identifying matching profiles between different platforms. The research conducts an analysis of the performance and accuracy of this metric, exploring and implementing strategies to improve these aspects. Furthermore, it describes the process of creating the web tool, highlighting how it facilitates the construction of a database that associates digital profiles with real individuals. The application of Levenshtein distance allows for a more efficient and rapid identification of connections between profiles from different social networks, enhancing the recognition and analysis of users' online presence in multiple digital environments.
Keywords: databases, levenshtein, social networks

References

Gagnon, T. (2013). The disinhibition of reddit users. Adele Richardson’s Spring.

Li, Y., Peng, Y., Zhang, Z., Yin, H., and Xu, Q. (2018). Matching user accounts across social networks based on username and display name. World Wide Web, 22(3):1075–1097.

Li, Z., Lin, D., and Li, P. (2023). Across online social network user identification based on usernames. In Jiang, X., editor, Machine Learning and Intelligent Communication, pages 117–127, Cham. Springer Nature Switzerland.

Sackers, M., de Vries, A. P., and de Boer, M. H. (2017). A comparison of string distance metrics on usernames for cross-platform identification.

Shu, K., Wang, S., Tang, J., Zafarani, R., and Liu, H. (2017). User identity linkage across online social networks: A review. Acm Sigkdd Explorations Newsletter, 18(2):5–17.

Vosecky, J., Hong, D., and Shen, V. Y. (2009). User identification across multiple social networks. In 2009 First International Conference on Networked Digital Technologies. IEEE.

Zafarani, R. and Liu, H. (2013). Connecting users across social media sites: a behavioral-modeling approach. In Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 41–49.

Zhang, H., Kan, M.-Y., Liu, Y., and Ma, S. (2014). Online social network profile linkage. In Information Retrieval Technology: 10th Asia Information Retrieval Societies Conference, AIRS 2014, Kuching, Malaysia, December 3-5, 2014. Proceedings 10, pages 197–208. Springer.
Published
2024-10-14
BARRETO, Mariana; LIFSCHITZ, Sérgio. Identification of Profiles on Multiple Social Networks. In: WORKSHOP ON UNDERGRADUATE STUDENT WORK (WTAG) - BRAZILIAN SYMPOSIUM ON DATABASES (SBBD), 39. , 2024, Florianópolis/SC. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2024 . p. 51-57. DOI: https://doi.org/10.5753/sbbd_estendido.2024.243784.