Addressing search in scientific open data repositories: A semantic metasearch platform
Resumo
Scientific research in all fields has advanced in complexity and in the amount of data generated. The heterogeneity of data repositories, data meaning and their metadata standards makes this problem even more significant. In spite of several proposals to find and retrieve research data from public repositories, there is still need for more comprehensive retrieval solutions. In this article, we specify and develop a mechanism to search for scientific data that takes advantage of metadata records and semantic methods. We present the conception of our architecture and how we have implemented it in a use case in agriculture.
Palavras-chave:
semantic annotation, scientific data, agricultural, open repositories, FAIR
Referências
Ávila, R., Santos, S., Araújo, D., Vidal, V. M. P., and de Macêdo, J. A. F. (2017). Ligações Semânticas Utilizando Predicados SKOS. InSBBD, pages 88–99
Breeding, M. (2005). Plotting a new course for metasearch.Computers in Libraries,25(2):27–29.
Costa, M. and Braga, T. (2016). Repositórios de dados de pesquisa no mundo.Cadernos BAD, 0(2):80–95.
do Espírito Santo, J., de Paula, E. V., and Medeiros, C. B. (2019). Exploring Semantics in Clinical Data Interoperability. In Advances in Conceptual Modeling, pages 201–210. Springer International Publishing.
Gavankar, C., Bhosale, T., Gunda, D., Chavan, A., and Hassan, S. (2020). A comparative study of semantic search systems. 2020 International Conference on Computer Communication and Informatics, ICCCI 2020, pages 1–7.
Gottardi, T., Medeiros, C. B., and Reis, J. D. (2020). Understanding semantic search on scientific repositories: Steps towards meaningful findability. In 1st virtual workshop on Research data management for Linked Open Science-DaMaLOS.
Izquierdo, Y. T., Garcia, G. M., Lemos, M., Novello, A., Novelli, B., Damasceno, C., Leme, L. A., and Casanova, M. A. (2020). Keyword Search over the COVID-19 Data. In Anais XXXV SBBD, pages 205–210, Porto Alegre, RS, Brasil. SBC.
Jonquet, C., Shah, N. H., and Musen, M. A. (2009). The open biomedical annotator. Summit on translational bioinformatics, 2009:56–60.
Kaiser, K. A., Chodacki, J., Habermann, T., Kemp, J., Paglione, L., Urberg, M., and Scott Plutchak, T. (2020). Metadata: The accelerant we need. Information Services & Use, (Preprint):1–11.
Pierre, M. S. and LaPlant, W. P. J. (1998). Issues in Crosswalking Content Metadata Standards. National Information Standards Organization - White Papers.
Riley, J. (2017). Understanding metadata. Washington DC, United States: National Information Standards Organization (http://www.niso.org/publications/press/UnderstandingMetadata.pdf), 23.
Sanchez, F. A., Da Silva, N. B. P., and Vechiato, F. L. (2019). Padrões de metadados para representação e organização da informação em repositórios de dados de pesquisa. Informação & Tecnologia, 5(1):37–51.
Simionato, A. C. (2017). Mapeamento dos metadados para dados científicos. In XVIII ENCONTRO NACIONAL DE PESQUISA EM CIÊNCIA DA INFORMAÇÃO (XVIII ENANCIB).
Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., Blomberg, N., and et al (2016). The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data, 3(1):160018.
Yan, Q., McMahon, M. J., Dascalu, S., Harris, F. C., and Ravi, L. (2013). Community metadata ISO 19115 adaptor. 28th International Conference on Computers and Their Applications 2013, CATA 2013, pages 213–218.
Breeding, M. (2005). Plotting a new course for metasearch.Computers in Libraries,25(2):27–29.
Costa, M. and Braga, T. (2016). Repositórios de dados de pesquisa no mundo.Cadernos BAD, 0(2):80–95.
do Espírito Santo, J., de Paula, E. V., and Medeiros, C. B. (2019). Exploring Semantics in Clinical Data Interoperability. In Advances in Conceptual Modeling, pages 201–210. Springer International Publishing.
Gavankar, C., Bhosale, T., Gunda, D., Chavan, A., and Hassan, S. (2020). A comparative study of semantic search systems. 2020 International Conference on Computer Communication and Informatics, ICCCI 2020, pages 1–7.
Gottardi, T., Medeiros, C. B., and Reis, J. D. (2020). Understanding semantic search on scientific repositories: Steps towards meaningful findability. In 1st virtual workshop on Research data management for Linked Open Science-DaMaLOS.
Izquierdo, Y. T., Garcia, G. M., Lemos, M., Novello, A., Novelli, B., Damasceno, C., Leme, L. A., and Casanova, M. A. (2020). Keyword Search over the COVID-19 Data. In Anais XXXV SBBD, pages 205–210, Porto Alegre, RS, Brasil. SBC.
Jonquet, C., Shah, N. H., and Musen, M. A. (2009). The open biomedical annotator. Summit on translational bioinformatics, 2009:56–60.
Kaiser, K. A., Chodacki, J., Habermann, T., Kemp, J., Paglione, L., Urberg, M., and Scott Plutchak, T. (2020). Metadata: The accelerant we need. Information Services & Use, (Preprint):1–11.
Pierre, M. S. and LaPlant, W. P. J. (1998). Issues in Crosswalking Content Metadata Standards. National Information Standards Organization - White Papers.
Riley, J. (2017). Understanding metadata. Washington DC, United States: National Information Standards Organization (http://www.niso.org/publications/press/UnderstandingMetadata.pdf), 23.
Sanchez, F. A., Da Silva, N. B. P., and Vechiato, F. L. (2019). Padrões de metadados para representação e organização da informação em repositórios de dados de pesquisa. Informação & Tecnologia, 5(1):37–51.
Simionato, A. C. (2017). Mapeamento dos metadados para dados científicos. In XVIII ENCONTRO NACIONAL DE PESQUISA EM CIÊNCIA DA INFORMAÇÃO (XVIII ENANCIB).
Wilkinson, M. D., Dumontier, M., Aalbersberg, I. J., Appleton, G., Axton, M., Baak, A., Blomberg, N., and et al (2016). The FAIR Guiding Principles for scientific data management and stewardship. Scientific Data, 3(1):160018.
Yan, Q., McMahon, M. J., Dascalu, S., Harris, F. C., and Ravi, L. (2013). Community metadata ISO 19115 adaptor. 28th International Conference on Computers and Their Applications 2013, CATA 2013, pages 213–218.
Publicado
18/07/2021
Como Citar
BORGES, Gustavo Caetano; REIS, Julio César dos; MEDEIROS, Claudia Bauzer.
Addressing search in scientific open data repositories: A semantic metasearch platform. In: BRAZILIAN E-SCIENCE WORKSHOP (BRESCI), 15. , 2021, Evento Online.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2021
.
p. 81-88.
ISSN 2763-8774.
DOI: https://doi.org/10.5753/bresci.2021.15792.