Representing and Classifying User Reviews

  • Denis D. Mauá USP
  • Fabio G. Cozman USP

Resumo


A large number of user reviews in the internet contains valuable information on services and products; for this reason, there is interest in automatically understanding such reviews. Sentiment Classification labels documents according to the feelings they express; instead of classifying a document into topics (sports, economics, etc), one attempts to tag the document according to overall feelings. Compared to the accuracy of traditional text categorization methods, sentiment classifiers have shown poor performance. We argue that such bad results are due to an improper representation of reviews. We describe a weakly supervised method that converts raw text into an appropriate representation, and show how techniques from information retrieval can acquire labeled data and process data using Markov logic. We report results on sentence classification and rating prediction that support our claims.

Referências

Hu, M. and Liu, B. (2004). “Mining opinion features in customer reviews”. In McGuinness, D. L. and Ferguson, G., editors, AAAI, Proceedings of the Nineteenth National Conference on Artificial Intelligence, Sixteenth Conference on Innovative Applications of Artificial Intelligence, July 25-29, 2004, San Jose, California, USA, pages 755–760. AAAI Press / The MIT Press.

Manning, C. D. and Schütze, H. (1999). Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, MA.

Pang, B. (2005). “Seeing stars”. In Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics - ACL 05 ACL 05, page 115.

Pang, B., Lee, L., and Vaithyanathan, S. (2002). “Thumbs up?: sentiment classification using machine learning techniques”. In EMNLP ’02: Proceedings of the ACL-02 conference on Empirical methods in natural language processing, pages 79–86, Morristown, NJ, USA. Association for Computational Linguistics.

Popescu, A.-M. and Etzioni, O. (2005). “Extracting product features and opinions from reviews”. In Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing - HLT 05 HLT 05, pages 339–346, Morristown, NJ, USA. Association for Computational Linguistics.

Richardson, M. and Domingos, P. (2006). “Markov logic networks”. Machine Learning, 62(1-2):107–136.

Snyder, B. and Barzilay, R. (2007). “Multiple aspect ranking using the good grief algorithm”. In Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference, pages 300–307, Rochester, New York. Association for Computational Linguistics.

Titov, I. and McDonald, R. (2008). “Modeling online reviews with multi-grain topic models”. In WWW ’08: Proceeding of the 17th international conference on World Wide Web, pages 111–120, New York, NY, USA. ACM.

Turney, P. D. (2001). “Thumbs up or thumbs down?: semantic orientation applied to unsupervised classification of reviews”. In ACL ’02: Proceedings of the 40th Annual Meeting on Association for Computational Linguistics, pages 417–424, Morristown, NJ, USA. Association for Computational Linguistics.
Publicado
20/07/2009
MAUÁ, Denis D.; COZMAN, Fabio G.. Representing and Classifying User Reviews. In: ENCONTRO NACIONAL DE INTELIGÊNCIA ARTIFICIAL E COMPUTACIONAL (ENIAC), 7. , 2009, Bento Gonçalves/RS. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2009 . p. 322-331. ISSN 2763-9061.

Artigos mais lidos do(s) mesmo(s) autor(es)