Comparando Repositórios de Dados Abertos

  • Pedro H. M. Costa UEM
  • André F. R. Cordeiro UEM
  • Edson OliveiraJr UEM

Resumo


Open Data é um dos principais conceitos de Open Science, que tem por objetivo tornar os artefatos de pesquisa científica acessíveis a todos. Os dados abertos fornecem recomendações e práticas para acessar e usar dados de pesquisas científicas, de forma gratuita, permanente, citável, auditável e intercambiável. Para facilitar o gerenciamento dos dados, é importante armazená-los em um repositório. Considerando esse contexto, este artigo fornece uma comparação entre cinco repositórios de dados abertos conhecidos. A comparação foi realizada levando em consideração um conjunto de critérios, como restrições de formato de dados, identificador digital, versionamento de conjuntos de dados publicados, curadores de coleções de dados, esquema de metadados, versionamento e exportação, limite de armazenamento, serviços pagos, redundância e preservação, acesso controles e APIs. Resultados e discussões são apresentados em termos de tais critérios.

Palavras-chave: Ciência Aberta, Repositórios, Dados Abertos

Referências

Ali, M., Loan, F. A., and Mushatq, R. (2018). Open access scientific digital repositories: An analytical study of the open doar. In 2018 5th International Symposium on Emerging Trends and Technologies in Libraries and Information Services (ETTLIS), pages 213–216.

Bodó, Z. (2018). A citeseerx-based dataset for record linkage and metadata extraction. In 2018 20th International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC), pages 230–236.

Charalabidis, Y., Alexopoulos, C., Diamantopoulou, V., and Androutsopoulou, A. (2016a). An open data and open services repository for supporting citizen-driven application development for governance. In 2016 49th Hawaii International Conference on System Sciences (HICSS), pages 2596–2604.

Cheikhi, L., Abran, A., and Desharnais, J.-M. (2012). Analysis of the isbsg software repository from the iso 9126 view of software product quality. In IECON 2012 - 38th Annual Conference on IEEE Industrial Electronics Society, pages 3086–3094.

Danny, M., Madelaine, F., Huriviades, C., and Dalys, S. (2019). Digital institutional repositories, component of open science to disseminate scientific publications: Case repository utp-ridda2. In 2019 7th International Engineering, Sciences and Technology Conference (IESTEC), pages 653–658.

Furtado, V. d. R. (2018). Guidelines for evaluating software product line experiments. Master’s thesis, Universidade Estadual de Maringá, Departamento de Informática, Programa de Pós-Graduação em Ciência da Computação. (in Portuguese).

Guild, K., Farrera, M. P., Martin, R., Almeida, R., Bontozoglou, A., Patel, M., Yang, K., and Callaghan, V. (2010). Student: Scenarios, technologies and users within the digital essex network testbed. In 2010 Sixth International Conference on Intelligent Environments, pages 338–343.

Iddriss, Z. and Al Sarraj, A. (2019). Exploring trends in open access repositories: The case of higher education institutions in nigeria, ghana, cabo verde, and senegal. In 2019 ACM/IEEE Joint Conference on Digital Libraries (JCDL), pages 366–367.

Kashireddy, S. D., Gauch, S., and Billah, S. M. (2013). Automatic class labeling for citeseerx. In 2013 IEEE/WIC/ACM International Joint Conferences on Web Intelligence (WI) and Intelligent Agent Technologies (IAT), volume 1, pages 241–245.

Kil, H., Lee, D., and Fisher, J. (2006). Openarxiv = arxiv + rdbms + web services. In Proceedings of the 6th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL ’06), pages 374–374.

Komiyama, Y. and Yamaji, K. (2017). Nationwide research data management service of japan in the open science era. In 2017 6th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI), pages 129–133.

Ku, L.-P. and Bao, Q. (2017). The open search.org in open science era: A communication platform for everyone building their repositories and using others. In 2017 6th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI), pages 124–128.

Lima, L. and Peres, L. (2021). Protocolo de mapeamento sistemático para busca de aplicativos de saúde em repositórios não-acadêmicos. In Anais do I Workshop de Práticas de Ciência Aberta para Engenharia de Software, pages 7–12, Porto Alegre, RS, Brasil. SBC.

McKiernan, E. C., Bourne, P. E., Brown, C. T., Buck, S., Kenall, A., Lin, J., McDougall, D., Nosek, B. A., Ram, K., Soderberg, C. K., et al. (2016). How open science helps researchers succeed. Elife, 5:e16800.

Medina, M. A., Sa´nchez, J. A., Cervantes, O., Benitez, A., and de la Calleja, J. (2017). Lod4air: A strategy to produce and consume linked open data from oai-pmh repositories. In 2017 International Conference on Electronics, Communications and Computers (CONIELECOMP), pages 1–8.

Mendez, D., Graziotin, D., Wagner, S., and Seibold, H. (2020). Open science in software engineering. In Contemporary Empirical Methods in Software Engineering, pages 477–501. Springer.

Mosconi, G., Li, Q., Randall, D., Karasti, H., Tolmie, P., Barutzky, J., Korn, M., and Pipek, V. (2019). Three gaps in opening science. Computer Supported Cooperative Work (CSCW), 28(3):749–789.

Open Science, F. (2020). Open science taxonomy. https://www.fosteropenscience.eu/foster-taxonomy/open-science?page=6

Pontika, N., Knoth, P., Cancellieri, M., and Pearce, S. (2015). Fostering open science to research using a taxonomy and an elearning portal. In Proceedings of the 15th international conference on knowledge technologies and data-driven business, pages 1–8.

Wu, J., Kandimalla, B., Rohatgi, S., Sefid, A., Mao, J., and Giles, C. L. (2018). Citeseerx-2018: A cleansed multidisciplinary scholarly big dataset. In 2018 IEEE International Conference on Big Data (Big Data), pages 5465–5467.

Yrivarren, J. (2021). Circumstantial reasoning : Creation and management of open access repositories in peru. In 2021 16th Iberian Conference on Information Systems and Technologies (CISTI), pages 1–7.
Publicado
01/12/2021
COSTA, Pedro H. M.; CORDEIRO, André F. R.; OLIVEIRAJR, Edson. Comparando Repositórios de Dados Abertos. In: ESCOLA REGIONAL DE ENGENHARIA DE SOFTWARE (ERES), 5. , 2021, Evento Online. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2021 . p. 60-69. DOI: https://doi.org/10.5753/eres.2021.18451.