Um Método para Detecção de Bots Sociais Baseado em Redes Neurais Convolucionais Aplicadas em Mensagens Textuais

  • Paulo A. Braz IME
  • Ronaldo R. Goldschmidt IME

Abstract


Currently, social networks are subject to social bots that perform malicious activities such as the dissemination of fake news. Some research to detect this type of malware is based on statistics extracted from the content of posted messages. Once the extraction of statistics may lead to information loss, this work aims to present experimental evidences that the use of original textual messages can improve detection accuracy. To this end, we propose a method that applies a convolutional neural network to identify suspicious messages. Good preliminary results with Twitter data indicate adequacy of the proposed method.

References

Badri Satya, P. R., Lee, K., Lee, D., Tran, T., and Zhang, J. J. (2016). Uncovering fake likers in online social networks. In Proceedings of the 25th ACM International on Conference on Information and Knowledge Management, CIKM ’16, pages 2365–2370, New York, NY, USA. ACM.

Beaudry, N. J. and Renner, R. (2012). An intuitive proof of the data processing inequality. Quantum Info. Comput., 12(5-6):432–441.

Bezerra, E. (2016). Introdução à aprendizagem profunda. http://sbbd2016.fpc.ufba.br/sbbd2016/minicursos/minicurso3.pdf.

Chollet, F. et al. (2015). Keras. https://github.com/fchollet/keras.

Davis, C. A., Varol, O., Ferrara, E., Flammini, A., and Menczer, F. (2016). Botornot: A system to evaluate social bots. In Proceedings of the 25th International Conference Companion on World Wide Web, pages 273–274. International World Wide Web Conferences Steering Committee.

Ferrara, E., Varol, O., Davis, C., Menczer, F., and Flammini, A. (2016). The rise of social bots. Commun. ACM, 59(7):96–104.

Ho, T. K. (1995). Random decision forests. In Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on, volume 1, pages 278–282. IEEE.

Hochreiter, S. and Schmidhuber, J. (1997). Long short-term memory. Neural computation, 9(8):1735–1780.

Kononenko, I. (1994). Estimating attributes: analysis and extensions of relief. In European conference on machine learning, pages 171–182. Springer.

LeCun, Y., Huang, F. J., and Bottou, L. (2004). Learning methods for generic object recognition with invariance to pose and lighting. In Computer Vision and Pattern Recognition, 2004. CVPR 2004. Proceedings of the 2004 IEEE Computer Society Conference on, volume 2, pages II–104. IEEE.

Lee, K., Eoff, B. D., and Caverlee, J. (2011). Seven months with the devils: A long-term study of content polluters on twitter.

Ratkiewicz, J., Conover, M., Meiss, M. R., Gonçalves, B., Flammini, A., and Menczer, F. (2011). Detecting and tracking political abuse in social media.

Rosenblatt, F. (1961). Principles of neurodynamics. perceptrons and the theory of brain mechanisms. Technical report, CORNELL AERONAUTICAL LAB INC BUFFALO NY.

Wang, G., Zhang, X., Tang, S., Zheng, H., and Zhao, B. Y. (2016). Unsupervised clickstream clustering for user behavior analysis. In SIGCHI Conference on Human Factors in Computing Systems.

Witten, I. H., Frank, E., Hall, M. A., and Pal, C. J. (2016). Data Mining: Practical machine learning tools and techniques. Morgan Kaufmann.

Yang, Z., Wilson, C., Wang, X., Gao, T., Zhao, B. Y., and Dai, Y. (2014). Uncovering social network sybils in the wild. ACM Transactions on Knowledge Discovery from Data (TKDD), 8(1):2.

Zhang, X., Zhao, J., and LeCun, Y. (2015). Character-level convolutional networks for text classification. In Advances in neural information processing systems, pages 649–657.
Published
2017-11-06
BRAZ, Paulo A.; GOLDSCHMIDT, Ronaldo R.. Um Método para Detecção de Bots Sociais Baseado em Redes Neurais Convolucionais Aplicadas em Mensagens Textuais. In: BRAZILIAN SYMPOSIUM ON CYBERSECURITY (SBSEG), 17. , 2017, Brasília. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2017 . p. 501-508. DOI: https://doi.org/10.5753/sbseg.2017.19524.

Most read articles by the same author(s)