Authorship Attribution Using Data from Reddit Forum
Resumo
As the online social networks become more and more part of people’s daily lives, analyzes of the content posted in these media to avoid the circulation of fake news or doubtful authorship become necessary. This paper analyzes comments from a Reddit forum community in order to evaluate different representations of the text and using artificial intelligence classification techniques for the authorship attribution in the context of social networks in the forum style. The results showed that, for each scenario, a given combination of classifier and selected characteristics (different representations) is more recommended and presents good efficiency in the distinction among authors.
Palavras-chave:
Authorship Attribution, Reddit, Data Mining
Publicado
03/11/2020
Como Citar
CASIMIRO, Guilherme Ramos; DIGIAMPIETRI, Luciano Antonio.
Authorship Attribution Using Data from Reddit Forum. In: SIMPÓSIO BRASILEIRO DE SISTEMAS DE INFORMAÇÃO (SBSI), 16. , 2020, Evento Online.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2020
.
DOI: https://doi.org/10.5753/sbsi.2020.13760.