Building and Analyzing an Open Repository of Programming Exercises in Portuguese

  • Vinicius Kuster Lodi UFV
  • João Pedro M. Sena USP
  • Julio C. S. Reis UFV

Abstract


Practical exercises are an important part of learning programming. For Brazilian Portuguese, however, there are few data sets with this availability. This work aims to collect and model a data repository for programming exercises, containing instructions, test cases, and exercise metadata. First, platforms are defined as data sources, and Web Scraping techniques are applied to extract the desired content. From the collection, the data is modeled and made available in a public repository called ProgramExBR. A characterization of the data is also presented, revealing a number of interesting discoveries about the text patterns it contains. We hope that this can be useful in various contexts, be it for supporting teachers and students, for training inference models and generating new exercises, or even for an application of personalized recommendation methods in education.
Keywords: repository, programming exercise, Portuguese, Web scraping

References

Arthur Alves, Leandro Silva Galvão de Carvalho, Elaine Oliveira, and David Fernandes. 2019. Análise comportamental em juízes online para predição do desempenho final de alunos em disciplinas de computação. In SBIE.

Brett A Becker and Keith Quille. 2019. 50 years of CS1 at SIGCSE: A review of the evolution of introductory programming education research. In ACM SIGCSE TS.

Jean Luca Bez, Carlos E Ferreira, and Neilor Tonin. 2013. Uri online judge academic: A tool for professors. In ICAICTE.

Cassio P. Campos and Carlos E. Ferreira. 2004. BOCA: um sistema de apoio a competições de programação. In WEI.

Leandro Galvão, David Fernandes, and Bruno Gadelha. 2016. Juiz online como ferramenta de apoio a uma metodologia de ensino híbrido em programação. In SBIE.

Dragan Gašević and Agathe Merceron. 2022. The Handbook of Learning Analytics (2 ed.). SOLAR. [link]

Maarten Grootendorst. 2022. BERTopic: Neural topic modeling with a class-based TF-IDF procedure. arXiv:2203.05794 [cs.CL]

Dion Ribeiro Laranjeira. 2020. Recomendação de exercícios para alunos de programação em um ambiente de correção automática de códigos. Mestrado em Informática. Universidade Federal do Amazonas, Manaus. 110 f..

Rodrigo Pessoa Medeiros, Geber Lisboa Ramalho, and Taciana Pontual Falcão. 2018. A systematic literature review on teaching and learning introductory programming in higher education. IEEE Transactions on Education 62, 2 (2018).

Filipe D Pereira, Elaine HT Oliveira, David BF Oliveira, Alexandra I Cristea, Leandro SG Carvalho, Samuel C Fonseca, Armando Toda, and Seiji Isotani. 2020. Using learning analytics in the Amazonas: understanding students’ behaviour in introductory programming. British journal of educational technology 51, 4 (2020), 955–972.

Matthew Peveler, Jeramey Tyler, Samuel Breese, Barbara Cutler, and Ana Milanova. 2017. Submitty: An Open Source, Highly-Configurable Platform for Grading of Programming Assignments. In ACM SIGCSE TS.

Gabryella Rodrigues, Ana Francisca Monteiro, and António Osório. 2022. Introductory Programming in Higher Education: A Systematic Literature Review. OASIcs, Volume 102, ICPEC 2022 102 (2022), 4:1–4:17. DOI: 10.4230/OASICS.ICPEC.2022.4

Carlos Silva, João Solano, André Santos, and Julio Reis. 2023. Previsão de Reprovações em Disciplinas Introdutórias de Programação: Um Estudo em um Ambiente de Correção Automática de Códigos. In SBIE.

Ícaro Alvim, Roberto Bittencourt, and Rodrigo Duran. 2024. Evasão nos Cursos de Graduação em Computação no Brasil. In SBIE.
Published
2025-11-10
LODI, Vinicius Kuster; SENA, João Pedro M.; REIS, Julio C. S.. Building and Analyzing an Open Repository of Programming Exercises in Portuguese. In: UNDERGRADUATE RESEARCH CONTEST - BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB (WEBMEDIA), 31. , 2025, Rio de Janeiro/RJ. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2025 . p. 45-48. ISSN 2596-1683. DOI: https://doi.org/10.5753/webmedia_estendido.2025.16359.