A Characterization of Portuguese Tweets Regarding the Covid-19 Pandemic

  • Pedro V. Brum Universidade Federal de Minas Gerais
  • Matheus C. Teixeira Universidade Federal de Minas Gerais
  • Renato Miranda Universidade Federal de Minas Gerais
  • Renato Vimieiro Universidade Federal de Minas Gerais
  • Wagner Meira Jr Universidade Federal de Minas Gerais
  • Gisele L. Pappa Universidade Federal de Minas Gerais


Twitter has been one of the main sources of information and discussion during the COVID-19 pandemics. This paper characterizes a set of more than 56 million tweets written in Portuguese and collected over a period of 70 days. Our analysis includes the volume of messages, text of tweets, location of tweets, the main elements of tweets (e.g. hashtags and URLs) and the user profiles, including gender. The analyses showed the most discussed topics in the period were quarantine, hydroxychloroquine, agglomeration and social distance, and that the discussions were centered in political issues (e.g., most common hashtags include “fechadocombolsonaro" and “forabolsonaro").

Palavras-chave: coronavirus, Twitter, social media, epidemics, public health, pandemic, covid-19


Ahmed, W., Bath, P. A., Sbaffi, L., and Demartini, G. Novel insights into views towards h1n1 during the 2009 pandemic: a thematic analysis of twitter data. Health Information & Libraries Journal 36 (1): 60–72, 2019.

Bail, C. A., Argyle, L. P., Brown, T. W., Bumpus, J. P., Chen, H., Hunzaker, M. F., Lee, J., Mann, M., Merhout, F., and Volfovsky, A. Exposure to opposing views on social media can increase political polarization.

Proceedings of the National Academy of Sciences 115 (37): 9216–9221, 2018.

Chen, E., Lerman, K., and Ferrara, E. Tracking social media discourse about the covid-19 pandemic: Development of a public coronavirus twitter data set. JMIR Public Health and Surveillance 6 (2): e19273, 2020.

Dowd, J. B., Andriano, L., Brazel, D. M., Rotondi, V., Block, P., Ding, X., Liu, Y., and Mills, M. C. Demographic science aids in understanding the spread and fatality rates of covid-19. Proceedings of the National Academy of Sciences 117 (18): 9696–9698, 2020.

Ferrara, E. What types of covid-19 conspiracies are populated by twitter bots? First Monday 25 (6): 1–25, 2020.

Guerra, P. H. C., Meira Jr, W., Cardie, C., and Kleinberg, R. A measure of polarization on social media networks based on community boundaries. In Proceedings of the International Conference on Weblogs and Social Media (ICWSM), 2013.

Jiménez-Zafra, S. M., Martín-Valdivia, M. T., Molina-González, M. D., and Ureña-López, L. A. How do we talk about doctors and drugs? sentiment analysis in forums expressing opinions for medical domain. Artificial intelligence in medicine vol. 93, pp. 50–57, 2019.

Kouzy, R., Abi Jaoude, J., Kraitem, A., El Alam, M. B., Karam, B., Adib, E., Zarka, J., Traboulsi, C., Akl, E. W., and Baddour, K. Coronavirus goes viral: quantifying the covid-19 misinformation epidemic on twitter. Cureus 12 (3): e7275, 2020.

Menni, C., Valdes, A. M., Freidin, M. B., Sudre, C. H., Nguyen, L. H., Drew, D. A., Ganesh, S., Varsavsky, T., Cardoso, M. J., Moustafa, J. S. E.-S., et al. Real-time tracking of self-reported symptoms to predict potential covid-19. Nature medicine vol. 26, pp. 1037–1040, 2020.

Miranda Filho, R., Carvalho, A. I., and Pappa, G. L. Inferência de sexo e idade de usuários no twitter. In Anais do III Brazilian Workshop on Social Network Analysis and Mining. SBC, pp. 200–211, 2014.

Nepomuceno, M. R., Acosta, E., Alburez-Gutierrez, D., Aburto, J. M., Gagnon, A., and Turra, C. M. Besides population age structure, health and other demographic factors can contribute to understanding the covid-19 burden. Proceedings of the National Academy of Sciences 117 (25): 13881–13883, 2020.
BRUM, Pedro V.; TEIXEIRA, Matheus C.; MIRANDA, Renato; VIMIEIRO, Renato; MEIRA JR, Wagner; PAPPA, Gisele L.. A Characterization of Portuguese Tweets Regarding the Covid-19 Pandemic. In: SYMPOSIUM ON KNOWLEDGE DISCOVERY, MINING AND LEARNING (KDMILE), 8. , 2020, Evento Online. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2020 . p. 177-184. ISSN 2763-8944. DOI: https://doi.org/10.5753/kdmile.2020.11974.