Detecção de Fake News via Sinais Implícitos de Crowds: Uma Abordagem para Mitigar o Cold-Start no Cálculo da Reputação
Resumo
The evolution of Digital Media for News Distribution has changed how people share content. Anyone can freely share information, including fake news (i.e., false information shared intentionally). In this scenario, fake news detection approaches have been proposed, with notable attention given to the one that uses crowd signals. Such approaches explore the collective sense by combining opinions (i.e., signals) of a high number of users (i.e., crowd), considering the reputations of these users regarding their capacity to identify fake news. Although promising, the crowd signals approaches have a significant limitation when many users in the crowds have not interacted with prior news. This lack of information leads to a cold-start problem when calculating those users’ reputations. The present work raises the following hypothesis: the performance of the crowd signals-based detection models can be improved if they mitigate the cold-start problem by inferring the users’ reputation based on the behavior those users would present in the face of prior news. This hypothesis is grounded on the fact that people tend to share news that seems familiar to their beliefs. In a search to validate the raised hypothesis, SCS, a crowd signals-based fake news detection method, is proposed. SCS considers similarities among texts (e.g., title and content) of past news to infer the reputation of users with the cold-start problem. BERT, a known Large Language Model (LLM) was used to provide embeddings that represent the texts. Preliminary experimental results demonstrate the effectiveness of the proposed method when compared to the crowd signals-based SOTA in detecting fake news.
Referências
David Camacho, M Victoria Luzón, and Erik Cambria. 2021. New research methods algorithms in social network analysis. Future Generation Computer Systems 114 (2021), 290–293. DOI: 10.1016/j.future.2020.08.006
Argus Antonio Barbosa Cavalcante, Paulo Márcio Souza Freire, Ronaldo Ribeiro Goldschmidt, and Claudia Marcela Justel. 2024. Early detection of fake news on virtual social networks: A time-aware approach based on crowd signals. Expert Systems with Applications 247 (2024), 123350. DOI: 10.1016/j.eswa.2024.123350
Flávio Roberto Matias da Silva, Paulo Márcio Souza Freire, Marcelo Pereira de Souza, Gustavo de A. B. Plenamente, and Ronaldo Ribeiro Goldschmidt. 2020. FakeNewsSetGen: a Process to Build Datasets that Support Comparison Among Fake News Detection Methods. In Proceedings of the Brazilian Symposium on Multimedia and the Web (São Luís, Brazil) (WebMedia ’20). Association for Computing Machinery, New York, NY, USA, 241–248. DOI: 10.1145/3428658.3430965
Jacob Devlin, Ming-Wei Chang, Kenton Lee, and Kristina Toutanova. 2019. BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv:1810.04805 [cs.CL]
Bin Guo, Yasan Ding, Lina Yao, Yunji Liang, and Zhiwen Yu. 2020. The Future of False Information Detection on Social Media: New Perspectives and Trends. ACM Comput. Surv. 53, 4, Article 68 (jul 2020), 36 pages. DOI: 10.1145/3393880
Jürgen Habermas. 1982. Teoría de la Acción Comunicativa: Complementos y Estudios Previos. Madri: Cátedra. Teorema: International Journal of Philosophy (1982).
Rohit Kumar Kaliyar and Navya Singh. 2019. Misinformation Detection on Online Social Media-A Survey. In 2019 10th International Conference on Computing, Communication and Networking Technologies (ICCCNT). 1–6. DOI: 10.1109/ICCCNT45670.2019.8944587
Norbert Schwarz and Madeline Jalbert. 2019. When (Fake) News Feels True: Intuitions of Truth and the Acceptance and Correction of Misinformation.
Kai Shu, Limeng Cui, Suhang Wang, Dongwon Lee, and Huan Liu. 2019. DEFEND: Explainable Fake News Detection. (2019), 395–405. DOI: 10.1145/3292500.3330935
Kai Shu, Amy Sliva, Suhang Wang, Jiliang Tang, and Huan Liu. 2017. Fake news detection on social media: A data mining perspective. ACM SIGKDD explorations newsletter 19, 1 (2017), 22–36.
Fábio Souza, Rodrigo Nogueira, and Roberto Lotufo. 2020. BERTimbau: pretrained BERT models for Brazilian Portuguese. In 9th Brazilian Conference on Intelligent Systems, BRACIS, Rio Grande do Sul, Brazil, October 20-23 (to appear).
Paulo Márcio Souza Freire, Flávio Roberto Matias da Silva, and Ronaldo Ribeiro Goldschmidt. 2021. Fake news detection based on explicit and implicit signals of a hybrid crowd: An approach inspired in meta-learning. Expert Systems with Applications 183 (2021), 115414. DOI: 10.1016/j.eswa.2021.115414
Sebastian Tschiatschek, Adish Singla, Manuel Gomez Rodriguez, Arpit Merchant, and Andreas Krause. 2018. Fake News Detection in Social Networks via Crowd Signals. (2018), 517–524. DOI: 10.1145/3184558.3188722
Soroush Vosoughi, Mostafa ‘Neo’ Mohsenvand, and Deb Roy. 2017. Rumor Gauge: Predicting the Veracity of Rumors on Twitter. ACM Trans. Knowl. Discov. Data 11, 4, Article 50 (jul 2017), 36 pages. DOI: 10.1145/3070644