Towards a process to support solving the content selection problem from online community forums

  • Dárlinton B. F. Carvalho Universidade Federal do Rio de Janeiro
  • Ricardo M. Marcacini Universidade de São Paulo
  • Carlos J. P. de Lucena Universidade Federal do Rio de Janeiro
  • Solange O. Rezende Universidade de São Paulo

Resumo


There are plenty of public content available on the Internet, especiallyin online communities, enabling researchers to study society in new ways.Since the qualitative content analysis of online forums is very time consuming,the following problem arises: how to select the content to be analyzed? Thispaper introduces a new process to support solving this problem. This process isbased on unsupervised machine learning techniques and provides consolidatedand structured results. This includes measurements and a content explorationmethod. A tool that helps to apply the proposed process was created and ispresented as well.

Palavras-chave: Seleção de Conteúdo, Fóruns Online, Aprendizado de Máquina

Referências

Carvalho, D., Madeira, W., Okamura, M., Lucena, C., and Zanetta, S. (2012). A practical approach to exploit public data available on the internet to study healthcare issues. In Proceeding of XXXII Congresso da Sociedade Brasileira de Computação (CSBC) – XII Workshop de Informática Médica, page to appear.

Kozinets, R. (2009). Netnography: Doing Ethnographic Research Online. Sage Publications Ltd, London.

Lefevre, F. and Lefevre, A. M. C. (2006). The collective subject that speaks. Interface - Comunicação, Saúde, Educação, 10(20):517–524.

Marcacini, R. M. and Rezende, S. O. (2010). Torch: a tool for building topic hierarchies from growing text collection. In WTA’2010: IX Workshop on Tools and Applications. In 8th Brazilian Symposium on Multimedia and the Web (Webmedia), pages 133–135.

Nogueira, B. M., Moura, M. F., Conrado, M. S., Rossi, R. G., Marcacini, R. M., and Rezende, S. O. (2008). Winning some of the document preprocessing challenges in a text mining process. In IV Workshop on Algorithms and Data Mining Applications, XXIV Brazilian Symposium on Database, pages 10–18.

Preece, J. and Maloney-Krichmar, D. (2005). Online communities: Design, theory, and practice. Journal of Computer-Mediated Communication, 10(4):article 1.

Zhao, Y., Karypis, G., and Fayyad, U. (2005). Hierarchical clustering algorithms for document datasets. Data Mining and Knowledge Discovery, 10(2):141–168.
Publicado
17/07/2012
CARVALHO, Dárlinton B. F.; MARCACINI, Ricardo M.; LUCENA, Carlos J. P. de; REZENDE, Solange O.. Towards a process to support solving the content selection problem from online community forums. In: BRAZILIAN WORKSHOP ON SOCIAL NETWORK ANALYSIS AND MINING (BRASNAM), 1. , 2012, Curitiba. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2012 . p. 264-267. ISSN 2595-6094.