Um Método Linguístico que combina Polaridade, Emoção e Aspectos Gramaticais para Detecção de Fake News em Inglês

Gustavo A. Testoni; Marcelo P. Souza; Paulo Márcio S. Freire; Ronaldo R. Goldschimidt

doi:10.5753/brasnam.2021.16133

Gustavo A. Testoni Instituto Militar de Engenharia - IME
Marcelo P. Souza Instituto Militar de Engenharia - IME
Paulo Márcio S. Freire Instituto Militar de Engenharia - IME
Ronaldo R. Goldschimidt Instituto Militar de Engenharia - IME

DOI: https://doi.org/10.5753/brasnam.2021.16133

Resumo

O uso crescente dos meios digitais aumentou a proliferação de Fake News. Métodos baseados na abordagem linguística que consideram o sentimento (i.e. polaridade ou emoções) presente nos textos das notícias têm mostrado resultados promissores para detectar, automaticamente, esse tipo de notícia nos meios digitais. Embora as emoções nos textos possam fornecer informações significativas para identificar Fake News, esses métodos eram restritos à extração de polaridade quando aplicados a notícias escritas em inglês. Assim, o presente trabalho propõe um protótipo baseado na abordagem linguística que considera a polaridade e as emoções para detectar Fake News escritas em inglês. O protótipo proposto foi adaptado de um método bem-sucedido, baseado no sentimento, que foi concebido e aplicado a notícias escritas em português. O protótipo proposto apresentou resultados promissores nos experimentos, superando os baselines em até 3,59 pontos percentuais.

Palavras-chave: Fake News, Poalaridade, Emoção

Referências

Ajao, O., Bhowmik, D., and Zargari, S. (2019). Sentiment Aware Fake News Detectionon Online Social Networks. InICASSP, IEEE International Conference on Acoustics,Speech and Signal Processing - Proceedings, volume 2019-May, pages 2507–2511.Institute of Electrical and Electronics Engineers Inc.

Bhavika Bhutani, Neha Rastogi, P. S. and Purwar, A. (2019). Fake news detection usingsentiment analysis. InInstitute of Electrical and Electronics Engineers (IEEE).

Bondielli, A. and Marcelloni, F. (2019). A survey on fake news and rumour detectiontechniques.Information Sciences, 497:38–55.

de Morais, J. I., Abonizio, H. Q., Tavares, G. M., da Fonseca, A. A., and Barbon, S.(2019). Deciding among Fake, Satirical, Objective and Legitimate news. pages 1–8.Association for Computing Machinery (ACM).

de Souza, M. P., da Silva, F. R. M., Freire, P. M. S., and Goldschmidt, R. R. (2020). Alinguistic-based method that combines polarity, emotion and grammatical characteris-tics to detect fake news in portuguese. InProceedings of the Brazilian Symposium onMultimedia and the Web, WebMedia ’20, page 217–224, New York, NY, USA. Asso-ciation for Computing Machinery.

Durier, F. and Garcia, A. C. (2019). Fake news and sarcasm, what is the limit of a criticand what is intentionally fake? InAnais do Simpósio Brasileiro de Sistemas Colabo-rativos, pages 58–61. Sociedade Brasileira de Computação - SBC.

Faustini, P. and Cov ̃oes, T. (2019). Fake news detection using one-class classification.InProceedings - 2019 Brazilian Conference on Intelligent Systems, pages 592–597.BRACIS 2019.

Freire, P. M. S. and Goldschmidt, R. R. (2019). Uma introdução ao combate autom ́atico`as fake news em redes sociais virtuais. InTópicos de Gerenciamento de Dados eInformação, 34th SBBD, pages 38–67, Fortaleza, CE, Brazil. SBC.

Kohavi, R. et al. (1995). A study of cross-validation and bootstrap for accuracy estima-tion and model selection. InIjcai, volume 14, pages 1137–1145, Montreal, Canada.Montreal, Canada, researchgate.

Mejova, Y. and Kalimeri, K. (2020). Advertisers jump on coronavirus bandwagon: Poli-tics, news, and business.ArXiv, abs/2003.00923.

Moraes, M. P., Sampaio, J. d. O., and Charles, A. C. (2019). Data mining applied infake news classification through textual patterns. InProceedings of the 25th BrazillianSymposium on Multimedia and the Web - WebMedia ’19, pages 321–324. ACM Press,New York, New York, USA.

Sharma, K., Qian, F., Jiang, H., Ruchansky, N., Zhang, M., and Liu, Y. (2019). Combatingfake news: A survey on identification and mitigation techniques.ACM TIST.

Shu, K., Mahudeswaran, D., and Liu, H. (2019). Fakenewstracker: A tool for fake newscollection, detection, and visualization.Comput. Math. Organ. Theory, 25(1):60–71.

Shu, K., Mahudeswaran, D., Wang, S., Lee, D., and Liu, H. (2020). Fakenewsnet: Adata repository with news content, social context, and spatiotemporal information forstudying fake news on social media.Big Data, 8(3):171–188.

Shu, K., Sliva, A., WNAG, S., Tang, J., and Liu, H. (2017). Fake news detection on socialmedia: A data mining perspective. InSIGKDD Explor. Newsl. 19, 1, pages 22-36.

Tausczik, Y. R. and Pennebaker, J. W. (2010). The Psychological Meaning of Words:LIWC and Computerized Text Analysis Methods.Journal of Language and SocialPsychology, 29(1):24–54.

Vosoughi, S., Mohsenvand, M. N., and Roy, D. (2017). Rumor gauge: Predicting theveracity of rumors on twitter.ACM Trans. Knowl. Discov. Data, 11(4):50:1–50:36.