Análise Linguística de Comentários de Humanos e de Modelos de Linguagem a Postagens em Comunidades Brasileiras do Reddit

Fernanda Luiza Tobias; Ana Paula Couto da Silva

doi:10.5753/webmedia.2025.15908

Fernanda Luiza Tobias UFMG
Ana Paula Couto da Silva UFMG

DOI: https://doi.org/10.5753/webmedia.2025.15908

Resumo

In this work, we analyze the linguistic differences between comments written by humans and those generated by large language models in social support communities on Reddit. Overall, the models tend to better align with the semantic content and linguistic style of the posts and are generally more informative, but their responses are often repetitive and harder to read. In contrast, human comments are typically more concise, diverse, and include personal experiences. Additionally, differences across communities suggest that the type of support being sought influences writing patterns.

Palavras-chave: Plataformas sociais online, Suporte Emocional, Julgamento Social, Modelos de Inteligência Artificial Generativa

Referências

Abeer ALDayel and Walid Magdy. 2021. Stance detection on social media: State of the art and trends. Information Processing Management 58, 4 (2021), 102597. DOI: 10.1016/j.ipm.2021.102597

Gumhee Baek, Chiyoung Cha, and Jin-Hui Han. 2025. AI chatbots for psychological health for health professionals: Scoping review. JMIR Human Factors 12, 1 (2025), e67682. DOI: 10.2196/67682

Langtao Chen, Aaron Baird, and Detmar Straub. 2020. A linguistic signaling model of social support exchange in online health communities. Decision Support Systems 130 (2020), 113233. DOI: 10.1016/j.dss.2019.113233

Meri Coleman and Ta Lin Liau. 1975. A computer readability formula designed for machine scoring. Journal of Applied Psychology 60, 2 (1975), 283. DOI: 10.1037/h0076540

Gustavo F. Cunha and Ana Paula C. da Silva. 2025. Caracterizando Polarização nas Eleições Brasileiras de 2018 e 2022: Uma Análise das Discussões no Reddit com um Modelo de Regressão para Stance Detection. MSI. Universidade Federal de Minas Gerais. [link]

Luiz Fernando de Lima and Renata Mendes Araujo. 2023. A call for a research agenda on fair NLP for Portuguese. In Anais do XIV Simpósio Brasileiro de Tecnologia da Informação e da Linguagem Humana. SBC, 187–192. [link]

Haonan Hou, Kevin Leach, and Yu Huang. 2024. ChatGPT Giving Relationship Advice–How Reliable Is It?. In Proceedings of the International AAAI Conference on Web and Social Media, Vol. 18. 610–623. DOI: 10.1609/icwsm.v18i1.31338

LenDigLearn. [n. d.]. Formality Classifier MDeBERTa v3 Base. Retrieved 13 de Julho de 2025 from [link]

Elizabeth Linos, Jessica Lasky-Fink, Chris Larkin, Lindsay Moore, and Elspeth Kirkman. 2024. The formality effect. Nature Human Behaviour 8, 2 (2024), 300–310. DOI: 10.1038/s41562-023-01761-z

Zilin Ma, Yiyang Mei, and Zhaoyuan Su. 2024. Understanding the benefits and challenges of using large language model-based conversational agents for mental well-being support. In AMIA Annual Symposium Proceedings, Vol. 2023. 1105. DOI: 10.48550/arXiv.2307.15810

Birger Moëll. 2024. Comparing the efficacy of GPT-4 and chat-gpt in mental health care: A blind assessment of large language models for psychological support. arXiv preprint arXiv:2405.09300 (2024). DOI: 10.48550/arXiv.2405.09300

Fahad Mansoor Pasha, Fatima Habib, Komal Kamran, Akbar Azam, Zeeshan Ali, and Dildar Hussain. 2025. Let the Customers Speak Their Hearts Out: The Role of Verbosity and Emotions in Online Viewer-to-Viewer Engagement. Human Behavior and Emerging Technologies 2025, 1 (2025), 6282833. DOI: 10.1155/hbe2/6282833

Ellie Pavlick and Joel Tetreault. 2016. An empirical analysis of formality in online communication. Transactions of the association for computational linguistics 4 (2016), 61–74. DOI: 10.1162/tacl_a_00083

JamesWPennebaker, Cindy K Chung, Joey Frazee, Gary M Lavergne, and David I Beaver. 2014. When small words foretell academic success: The case of college admissions essays. PloS one 9, 12 (2014), e115844. DOI: 10.1371/journal.pone.0115844

Julie Prescott, Amy Leigh Rathbone, and Terry Hanley. 2020. Online mental health communities, self-efficacy and transition to further support. Mental Health Review Journal 25, 4 (2020), 329–344. DOI: 10.1108/MHRJ-12-2019-0048

Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing. Association for Computational Linguistics. [link]

Luiz Rodrigues, Cleon Xavier, Newarney Costa, Hyan Batista, Luiz Felipe Bagnhuk Silva, Weslei Chaleghi de Melo, Dragan Gasevic, and Rafael Ferreira Mello. 2025. LLMs Performance in Answering Educational Questions in Brazilian Portuguese: A Preliminary Analysis on LLMs Potential to Support Diverse Educational Needs. In Proceedings of the 15th International Learning Analytics and Knowledge Conference. Association for Computing Machinery, 865–871. DOI: 10.1145/3706468.3706515

Sahand Sabour, Wen Zhang, Xiyao Xiao, Yuwei Zhang, Yinhe Zheng, Jiaxin Wen, Jialu Zhao, and Minlie Huang. 2023. A chatbot for mental health support: exploring the impact of Emohaa on reducing mental distress in China. Frontiers in digital health 5 (2023), 1133987. DOI: 10.48550/arXiv.2209.10183

Koustuv Saha, Yoshee Jain, and Munmun De Choudhury. 2025. Linguistic Comparison of AI- and Human-Written Responses to Online Mental Health Queries. DOI: 10.48550/arXiv.2504.09271

Multimodal strategies for balancing formality and informality: The role of kaomoji in online comment-reply interactions. 2022. Kaneyasu, Michiko. Internet Pragmatics 5, 1 (2022), 143–164. DOI: 10.1075/ip.00071.kan

Elena Tikhonova, Daria Mezentseva, and Peter Kasatkin. 2024. Text Redundancy in Academic Writing: A Systematic Scoping Review. Journal of Language and Education 10, 3 (39) (2024), 128–160. DOI: 10.17323/jle.2024.23747

Análise Linguística de Comentários de Humanos e de Modelos de Linguagem a Postagens em Comunidades Brasileiras do Reddit

Resumo

Referências

Artigos mais lidos do(s) mesmo(s) autor(es)