Towards a process to support solving the content selection problem from online community forums

Dárlinton B. F. Carvalho; Ricardo M. Marcacini; Carlos J. P. de Lucena; Solange O. Rezende

Dárlinton B. F. Carvalho Universidade Federal do Rio de Janeiro
Ricardo M. Marcacini Universidade de São Paulo
Carlos J. P. de Lucena Universidade Federal do Rio de Janeiro
Solange O. Rezende Universidade de São Paulo

Resumo

There are plenty of public content available on the Internet, especiallyin online communities, enabling researchers to study society in new ways.Since the qualitative content analysis of online forums is very time consuming,the following problem arises: how to select the content to be analyzed? Thispaper introduces a new process to support solving this problem. This process isbased on unsupervised machine learning techniques and provides consolidatedand structured results. This includes measurements and a content explorationmethod. A tool that helps to apply the proposed process was created and ispresented as well.

Palavras-chave: Seleção de Conteúdo, Fóruns Online, Aprendizado de Máquina

Referências

Carvalho, D., Madeira, W., Okamura, M., Lucena, C., and Zanetta, S. (2012). A practical approach to exploit public data available on the internet to study healthcare issues. In Proceeding of XXXII Congresso da Sociedade Brasileira de Computação (CSBC) – XII Workshop de Informática Médica, page to appear.

Kozinets, R. (2009). Netnography: Doing Ethnographic Research Online. Sage Publications Ltd, London.

Lefevre, F. and Lefevre, A. M. C. (2006). The collective subject that speaks. Interface - Comunicação, Saúde, Educação, 10(20):517–524.

Marcacini, R. M. and Rezende, S. O. (2010). Torch: a tool for building topic hierarchies from growing text collection. In WTA’2010: IX Workshop on Tools and Applications. In 8th Brazilian Symposium on Multimedia and the Web (Webmedia), pages 133–135.

Nogueira, B. M., Moura, M. F., Conrado, M. S., Rossi, R. G., Marcacini, R. M., and Rezende, S. O. (2008). Winning some of the document preprocessing challenges in a text mining process. In IV Workshop on Algorithms and Data Mining Applications, XXIV Brazilian Symposium on Database, pages 10–18.

Preece, J. and Maloney-Krichmar, D. (2005). Online communities: Design, theory, and practice. Journal of Computer-Mediated Communication, 10(4):article 1.

Zhao, Y., Karypis, G., and Fayyad, U. (2005). Hierarchical clustering algorithms for document datasets. Data Mining and Knowledge Discovery, 10(2):141–168.

Towards a process to support solving the content selection problem from online community forums

Resumo

Referências

Artigos mais lidos do(s) mesmo(s) autor(es)