Análise e Implementação de Modelos Contextuais para Desambiguação de Entidades Nomeadas em Fluxos de Mensagens
Abstract
Named entity disambiguation in message streams is a new challenge in natural language processing. The low informational rate and lack of syntactical structure in this kind of text may decrease the accuracy of traditional disambiguation approaches. In this paper, we propose to use contextual models of Twitter messages to minimize text usage. Our models are based on the behavior of social network users and on the instant in which the message has been posted. Our results show that these models perform better than approaches that consider only textual attributes.References
Bizer, C., Lehmann, J., Kobilarov, G., Auer, S., Becker, C., Cyganiak, R., and Hellmann, S. (2009). DBpedia A crystallization point for the Web of Data. Web Semantics: Science, Services and Agents on the World Wide Web, 7(3).
Cucerzan, S. (2007). Large-Scale Named Entity Disambiguation Based on Wikipedia Data. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pages 708–716, Prague, Czech Republic. Association for Computational Linguistics.
Davis, A., Veloso, A., da Silva, A. S., Laender, A. H. F., and Meira Jr, W. (2012). Named Entity Disambiguation in Streaming Data. In The 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, pages 815–824.
Guerra, P. H. C., i c Cerf, L., Porto, T. C., Veloso, A., Meira Jr, W., and Almeida, V. i. l. A. F. (2011a). Exploiting Temporal Locality to Determine User Bias in Microblogging Platforms. JIDM, 2(3):273–288.
Guerra, P. H. C., Veloso, A., Meira Jr., W., and Almeida, V. (2011b). From bias to opinion: a transfer-learning approach to real-time sentiment analysis. In KDD ’11, San Diego, CA. ACM Request Permissions.
Hoffart, J., Yosef, M. A., Bordino, I., Fürstenau, H., Pinkal, M., Spaniol, M., Taneva, B., Thater, S., and Weikum, G. (2011). Robust disambiguation of named entities in text. In EMNLP ’11: Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics.
Nguyen, K., Pham, C., Tran, D. A., and Zhang, F. (2011). Preserving Social Locality in Data Replication for Online Social Networks. In 31st IEEE International Conference on Distributed Computing Systems Workshops (ICDCS 2011 Workshops), 20-24 June 2011, Minneapolis, Minnesota, USA, pages 129–133.
Sperber, D. and Wilson, D. (1986). Relevance: communication and cognition. Harvard University Press, Cambridge, MA, USA.
Spitzer, F. (2001). Principles of random walk, volume 34. Springer Verlag.
Suchanek, F. M., Kasneci, G., and Weikum, G. (2007). Yago: a core of semantic knowledge. In WWW ’07: Proceedings of the 16th international conference on World Wide Web. ACM.
Veloso, A., Meira Jr., W., and Zaki, M. J. (2006). Lazy Associative Classication. In ICDM ’06: Proceedings of the Sixth International Conference on Data Mining. IEEE Computer Society.
Wang, C., Chakrabarti, K., Cheng, T., and Chaudhuri, S. (2012). Targeted disambiguation of ad-hoc, homogeneous sets of named entities. In WWW ’12: Proceedings of the 21st international conference on World Wide Web. ACM.
Yosef, M. A., Hoffart, J., Bordino, I., Spaniol, M., and Weikum, G. (2011). AIDA: An Online Tool for Accurate Disambiguation of Named Entities in Text and Tables. PVLDB, 4(12):1450–1453.
Yus, F. (2011). Cyberpragmatics Internet-mediated communication in context Pragmatics & Beyond New Series 2011.
Cucerzan, S. (2007). Large-Scale Named Entity Disambiguation Based on Wikipedia Data. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, pages 708–716, Prague, Czech Republic. Association for Computational Linguistics.
Davis, A., Veloso, A., da Silva, A. S., Laender, A. H. F., and Meira Jr, W. (2012). Named Entity Disambiguation in Streaming Data. In The 50th Annual Meeting of the Association for Computational Linguistics, Proceedings of the Conference, pages 815–824.
Guerra, P. H. C., i c Cerf, L., Porto, T. C., Veloso, A., Meira Jr, W., and Almeida, V. i. l. A. F. (2011a). Exploiting Temporal Locality to Determine User Bias in Microblogging Platforms. JIDM, 2(3):273–288.
Guerra, P. H. C., Veloso, A., Meira Jr., W., and Almeida, V. (2011b). From bias to opinion: a transfer-learning approach to real-time sentiment analysis. In KDD ’11, San Diego, CA. ACM Request Permissions.
Hoffart, J., Yosef, M. A., Bordino, I., Fürstenau, H., Pinkal, M., Spaniol, M., Taneva, B., Thater, S., and Weikum, G. (2011). Robust disambiguation of named entities in text. In EMNLP ’11: Proceedings of the Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics.
Nguyen, K., Pham, C., Tran, D. A., and Zhang, F. (2011). Preserving Social Locality in Data Replication for Online Social Networks. In 31st IEEE International Conference on Distributed Computing Systems Workshops (ICDCS 2011 Workshops), 20-24 June 2011, Minneapolis, Minnesota, USA, pages 129–133.
Sperber, D. and Wilson, D. (1986). Relevance: communication and cognition. Harvard University Press, Cambridge, MA, USA.
Spitzer, F. (2001). Principles of random walk, volume 34. Springer Verlag.
Suchanek, F. M., Kasneci, G., and Weikum, G. (2007). Yago: a core of semantic knowledge. In WWW ’07: Proceedings of the 16th international conference on World Wide Web. ACM.
Veloso, A., Meira Jr., W., and Zaki, M. J. (2006). Lazy Associative Classication. In ICDM ’06: Proceedings of the Sixth International Conference on Data Mining. IEEE Computer Society.
Wang, C., Chakrabarti, K., Cheng, T., and Chaudhuri, S. (2012). Targeted disambiguation of ad-hoc, homogeneous sets of named entities. In WWW ’12: Proceedings of the 21st international conference on World Wide Web. ACM.
Yosef, M. A., Hoffart, J., Bordino, I., Spaniol, M., and Weikum, G. (2011). AIDA: An Online Tool for Accurate Disambiguation of Named Entities in Text and Tables. PVLDB, 4(12):1450–1453.
Yus, F. (2011). Cyberpragmatics Internet-mediated communication in context Pragmatics & Beyond New Series 2011.
Published
2013-07-23
How to Cite
DAVIS, Alexandre; PEREIRA, Adriano C. M..
Análise e Implementação de Modelos Contextuais para Desambiguação de Entidades Nomeadas em Fluxos de Mensagens. In: SBC UNDERGRADUATE RESEARCH CONTEST (CTIC-SBC), 32. , 2013, Maceió.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2013
.
p. 122-131.