BioDSL: A Domain-Specific Language for mapping and dissemination of Biodiversity Data in the LOD

  • Kleberson J. do A. Serique USP
  • José L. Campos dos Santos INPA
  • Dilvan A. Moreira USP

Resumo


Currently, Linked Open Data (LOD) have enabled integrated data sharing across disciplines over the Web. However, for LOD users, in areas such as biodiversity (which massively use the Web to disseminate data), the task of transforming data file contents in CSV (Comma Separated Value) to RDF (Resource Description Framework) is not trivial. We have developed a new approach to map data files in CSV to RDF format based on a domain-specific language (DSL) called BioDSL. Using it, biodiversity data users can write compact programs to map their data to RDF and link them to the LOD. Biodiversity vocabularies and ontologies, such as Darwin Core and OntoBio, can be used with BioDSL to enrich user data. Existing tools are exclusively focused on mapping (CSV to RDF), offering little or no support for linking data to the LOD (interconnecting user entities to LOD entities). They also are more complex to use than BioDSL.


 

Referências

Berners-Lee, T. (2006). Design issues: Linked data.

Cardoso, S. D., Amanqui, F. K., Serique, K. J. A., dos Santos, J. L. C., and Moreira, D. a. (2015). SWI: A Semantic Web Interactive Gazetteer to support Linked Open Data. Future Generation Computer Systems, pages –.

Dimou, A., Sande, M. V., Colpaert, P., Verborgh, R., Mannens, E., and Van De Walle, R. (2014). RML: A generic language for integrated RDF mappings of heterogeneous data. CEUR Workshop Proceedings, 1184.

Lebo, T. andWilliams, G. T. (2010). Converting governmental datasets into linked data. In Proceedings of the 6th International Conference on Semantic Systems - I-SEMANTICS ’10, page 1, New York, New York, USA. ACM Press.

Maali, F., Cyganiak, R., and Peristeras, V. (2011). Re-using Cool URIs: Entity Reconciliation Against LOD Hubs. LDOW.

Matsubara, W., Kusano, K., Bannai, H., and Shinohara, A. (2009). Language and Automata Theory and Applications. Lata, 5457:578–587.

Moura, A. M. D. C., Porto, F., Poltosi, M., Palazzi, D. C., Magalh˜aes, P., and Vidal, V. (2012). Integrating Ecological Data Using Linked Data Principles. In Joint V Seminar on Ontology Research in Brazil, pages 156–167.

Scharffe, F., Atemezing, G., and Troncy, R. (2012). Enabling linkeddata publication with the datalift platform. Proc. AAAI workshop on . . . .

Seyed, P., Chastain, K., Ashby, B., Liu, Y., Lebo, T., Patton, E., and McGuinness, D. (2013). Semanteco annotator. CEUR Workshop Proceedings, 1035:161–164.

Stadler, C., Unbehauen, J., Westphal, P., Sherif, M. A., and Lehmann, J. (2015). Simplified RDB2RDF Mapping. Proceedings of the 8th Workshop on Linked Data on the Web (LDOW2015), Florence, Italy.

Unbehauen, J., Hellmann, S., Auer, S., and Stadler, C. (2012). Knowledge extraction from structured sources. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 7538:34–52.

Van Der Waal, S., Wecel Cel, K., Ermilov, I., Janev, V., Milosevic, U., and Wainwright, M. (2014). Lifting open data portals to the data web. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 8661:175–195.

Walls, R. L., Deck, J., Guralnick, R., Baskauf, S., Beaman, R., Blum, S., Bowers, S., Buttigieg, P. L., Davies, N., Endresen, D., Gandolfo, M. A., Hanner, R., Janning, A., Krishtalka, L., Matsunaga, A., Midford, P., Morrison, N., Tuama, É. Ó., Schildhauer, M., Smith, B., Stucky, B. J., Thomer, A., Wieczorek, J., Whitacre, J., and Wooley, J. (2014). Semantics in Support of Biodiversity Knowledge Discovery: An Introduction to the Biological Collections Ontology and Related Ontologies. PLoS ONE, 9(3).
Publicado
04/07/2016
SERIQUE, Kleberson J. do A.; DOS SANTOS, José L. Campos; MOREIRA, Dilvan A.. BioDSL: A Domain-Specific Language for mapping and dissemination of Biodiversity Data in the LOD. In: BRAZILIAN E-SCIENCE WORKSHOP (BRESCI), 10. , 2016, Porto Alegre. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2016 . p. 285-293. ISSN 2763-8774. DOI: https://doi.org/10.5753/bresci.2016.10011.