Towards a process to support solving the content selection problem from online community forums
Resumo
There are plenty of public content available on the Internet, especiallyin online communities, enabling researchers to study society in new ways.Since the qualitative content analysis of online forums is very time consuming,the following problem arises: how to select the content to be analyzed? Thispaper introduces a new process to support solving this problem. This process isbased on unsupervised machine learning techniques and provides consolidatedand structured results. This includes measurements and a content explorationmethod. A tool that helps to apply the proposed process was created and ispresented as well.
Referências
Kozinets, R. (2009). Netnography: Doing Ethnographic Research Online. Sage Publications Ltd, London.
Lefevre, F. and Lefevre, A. M. C. (2006). The collective subject that speaks. Interface - Comunicação, Saúde, Educação, 10(20):517–524.
Marcacini, R. M. and Rezende, S. O. (2010). Torch: a tool for building topic hierarchies from growing text collection. In WTA’2010: IX Workshop on Tools and Applications. In 8th Brazilian Symposium on Multimedia and the Web (Webmedia), pages 133–135.
Nogueira, B. M., Moura, M. F., Conrado, M. S., Rossi, R. G., Marcacini, R. M., and Rezende, S. O. (2008). Winning some of the document preprocessing challenges in a text mining process. In IV Workshop on Algorithms and Data Mining Applications, XXIV Brazilian Symposium on Database, pages 10–18.
Preece, J. and Maloney-Krichmar, D. (2005). Online communities: Design, theory, and practice. Journal of Computer-Mediated Communication, 10(4):article 1.
Zhao, Y., Karypis, G., and Fayyad, U. (2005). Hierarchical clustering algorithms for document datasets. Data Mining and Knowledge Discovery, 10(2):141–168.