Evaluating Sentiment Quantification Methods in Brazilian Portuguese Corpora

  • Lucas Nildaimon dos Santos Silva UFSCar
  • Diego Furtado Silva USP
  • Helena de Medeiros Caseli UFSCar

Resumo


This paper evaluates sentiment quantification methods applied to Brazilian Portuguese corpora. Sentiment quantification, distinct from sentiment classification, estimates the distribution of sentiment classes (positive and negative) within a dataset. We investigate several quantification techniques, including the family Classify and Count (CC) and more sophisticated methods, such as Kernel Density Estimation (KDE) and Distribution y-Similarity (DyS). Our analysis uses five datasets, each containing different distributions of sentiment classes. Our experimental results indicate that KDE and DyS methods consistently outperform others, achieving the best average ranks in terms of quantification accuracy. Statistical tests, including the Friedman and Nemenyi tests, confirm significant performance differences among the methods, with KDE and DyS showing statistically significant improvements over the baseline CC method. These findings highlight the importance of choosing robust quantification techniques for accurate sentiment quantification in corpora across different domains.
Publicado
17/11/2024
SILVA, Lucas Nildaimon dos Santos; SILVA, Diego Furtado; CASELI, Helena de Medeiros. Evaluating Sentiment Quantification Methods in Brazilian Portuguese Corpora. In: BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 13. , 2024, Belém/PA. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2024 . p. 230-244. ISSN 2643-6264.