Analysis of techniques for automatic summarization of hotel opinions

  • Paulo César M. Sousa UFCAT
  • Márcio de Souza Dias UFCAT
  • Sérgio Francisco da Silva UFCAT

Resumo


This paper presents a comparison of different techniques aimed at automatic summarization of textual content found in hotel reviews. Extractive techniques that generate an aspect-based summary as well as techniques that generate a general summary are analyzed. The reviews themselves were extracted from a novel corpus comprising data collected from the TripAdvisor platform, focusing on hotels from different regions of Brazil. All automatic summaries were evaluated using the ROUGE set of metrics against summaries created by human annotators. The results revealed some key limitations within ROUGE when used on shorter, informal documents, as well as variations in the effectiveness of different techniques in addressing specific aspects of summarization.

Referências

Akhtar, N., Zubair, N., Kumar, A., Ahmad, T.: Aspect-based sentiment-oriented summarization of hotel reviews. Procedia computer science 115, 563–571 (2017)

Carbonell, J., Goldstein, J.: The use of mmr, diversity-based reranking for reordering documents and producing summaries. In: Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval. pp. 335–336 (1998)

Condori, R.E.L.: Sumarização automática de opiniões baseada em aspectos. Ph.D. thesis, Universidade de São Paulo (2014)

Cortez, M.C.A., Mondo, T.S.: Comentários online: formação de expectativa e decisão de compra de consumidores hoteleiros. Rosa dos Ventos 10(1), 119–136 (2018)

Erkan, G., Radev, D.R.: Lexrank: Graph-based lexical centrality as salience in text summarization. Journal of artificial intelligence research 22, 457–479 (2004)

Esuli, A., Sebastiani, F.: Sentiwordnet: A publicly available lexical resource for opinion mining. In: Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06) (2006)

Freires Junior, J.H.: Sumopinions: Sumarização automática de opiniões sobre pontos turísticos. Repositório Institucional UFC (2018). [link].

Freitas, C., Motta, E., Milidiu, R., Cesar, J.: Vampiro que brilha... ra! desafios na anotação de opinião em um corpus de resenhas de livros. In: XI Encontro de Linguística de Corpus (ELC 2012) (2012)

Hartmann, N., Avanço, L., Balage Filho, P.P., Duran, M.S., Nunes, M.D.G.V., Pardo, T.A.S., Aluísio, S.M., et al.: A large corpus of product reviews in Portuguese: Tackling out-of-vocabulary words. In: LREC. pp. 3865–3871 (2014)

Hu, M., Liu, B.: Mining and summarizing customer reviews. In: Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining. pp. 168–177 (2004)

Hu, Y.H., Chen, Y.L., Chou, H.L.: Opinion mining from online hotel reviews–a text summarization approach. Information Processing & Management 53(2), 436–449 (2017)

Lin, C.Y.: Rouge: A package for automatic evaluation of summaries. In: Text summarization branches out. pp. 74–81 (2004)

Mani, I.: Automatic summarization, vol. 3. John Benjamins Publishing (2001)

Nenkova, A., McKeown, K.: Automatic summarization. Now Publishers Inc (2011)

Raut, V.B., Londhe, D.: Opinion mining and summarization of hotel reviews. In: 2014 International Conference on Computational Intelligence and Communication Networks. pp. 556–559. IEEE (2014)

Tadano, R., Shimada, K., Endo, T.: Multi-aspects review summarization based on identification of important opinions and their similarity. In: Proceedings of the 24th Pacific Asia conference on language, information and computation. pp. 685–692 (2010)

Tay, W., Joshi, A., Zhang, X.J., Karimi, S., Wan, S.: Red-faced rouge: Examining the suitability of rouge for opinion summary evaluation. In: Proceedings of the The 17th Annual Workshop of the Australasian Language Technology Association. pp. 52–60 (2019)
Publicado
07/12/2023
SOUSA, Paulo César M.; DIAS, Márcio de Souza; DA SILVA, Sérgio Francisco. Analysis of techniques for automatic summarization of hotel opinions. In: ESCOLA REGIONAL DE INFORMÁTICA DE GOIÁS (ERI-GO), 11. , 2023, Goiânia/GO. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . DOI: https://doi.org/10.5753/erigo.2023.237285.