OntoLP: Ontology Engineering in Portuguese Language
Abstract
The continuous growth of digital information resources of many kinds (texts, images, videos, services) points to the need of consistent knowledge representation structures for modeling, storing, accessing and communicating about these resources. In this context, ontologies are being claimed as an important technology. Ontologies wide adoption is, however, difficult due to the large cost of building them from scratch. This problem instigate a new area of research related to ontology learning from texts. For this kind of problem, many of the tools and methods needed are language dependent, since they rely on linguistic knowledge. Efforts are needed for each language. This paper presents a tool (OntoLP) developed as a plug-in to the ontology editor Protégé which analyzes a given domain corpus of Portuguese texts and suggests concept candidates and their hierarchy to the ontology engineer, based on the knowledge presented in the texts.References
Aluísio, S. (2005). Desenvolvimento de uma estrutura conceitual (ontologia) para a área de nanociência e nanotecnologia. Technical Report 2004.1.34165.1.6, Universidade de São Paulo.
Baségio, T. (2006). Uma abordagem semi-automática para identificação de estruturas ontológicas a partir de textos na língua portuguesa do brasil. Master’s thesis, Pontifícia Universidade Católica do Rio Grande do Sul - PUCRS.
Bick, E. (2000). The Parsing System “Palavras”. Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework. PhD thesis, Arhus University.
Buitelaar, P., Cimiano, P., and Magnini, B. (2005). Ontology learning from text: An overview. In P-Buitelaar, Cimiano, P., and Magnini, B., editors, Ontology Learning from Text: Methods, Evaluation and Applications, volume 123 of Frontiers in Artificial Intelligence and Applications. IOS Press.
Coulthard, R. J. (2005). The application of corpus methodology to translation: the jped parallel corpus and the pediatrics comparable corpus. Master’s thesis, Programa de Pós-Graduação em Estudos da Tradução, Universidade Federal de Santa Catarina.
Frantzi, K. T., Ananiadou, S., and ichi Tsujii, J. (1998). The c-value/nc-value method of automatic recognition for multi-word terms. In ECDL ’98: Proceedings of the Second European Conference on Research and Advanced Technology for Digital Libraries, pages 585–604, London, UK. Springer-Verlag.
Manning, C. D. and Schütze, H. (1999). Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge, Massachusetts.
Ryu, P.-M. and Choi, K.-S. (2006). Taxonomy learning using term specificity and similarity. In Proceedings of the 2nd Workshop on Ontology Learning and Population: Bridging the Gap between Text and Knowledge, pages 41–48, Sydney, Australia. Association for Computational Linguistics.
Suchanek, F. M., Ifrim, G., and Weikum, G. (2006). Leila: Learning to extract information by linguistic analysis. In Proceedings of the 2nd Workshop on Ontology Learning and Population: Bridging the Gap between Text and Knowledge, pages 18–25, Sydney, Australia. Association for Computational Linguistics.
Zavaglia, C., Aluísio, S., das Graças Volpe Nunes, M., and de Oliveira, L. M. (2007). Estrutura ontológica e unidades lexicais: uma aplicação computacional no domínio da ecologia. In Anais do 5o Workshop em Tecnologia da Informação e da Linguagem Humana, TIL’2007, pages 1575–1584, Rio de Janeiro, Brasil.
Baségio, T. (2006). Uma abordagem semi-automática para identificação de estruturas ontológicas a partir de textos na língua portuguesa do brasil. Master’s thesis, Pontifícia Universidade Católica do Rio Grande do Sul - PUCRS.
Bick, E. (2000). The Parsing System “Palavras”. Automatic Grammatical Analysis of Portuguese in a Constraint Grammar Framework. PhD thesis, Arhus University.
Buitelaar, P., Cimiano, P., and Magnini, B. (2005). Ontology learning from text: An overview. In P-Buitelaar, Cimiano, P., and Magnini, B., editors, Ontology Learning from Text: Methods, Evaluation and Applications, volume 123 of Frontiers in Artificial Intelligence and Applications. IOS Press.
Coulthard, R. J. (2005). The application of corpus methodology to translation: the jped parallel corpus and the pediatrics comparable corpus. Master’s thesis, Programa de Pós-Graduação em Estudos da Tradução, Universidade Federal de Santa Catarina.
Frantzi, K. T., Ananiadou, S., and ichi Tsujii, J. (1998). The c-value/nc-value method of automatic recognition for multi-word terms. In ECDL ’98: Proceedings of the Second European Conference on Research and Advanced Technology for Digital Libraries, pages 585–604, London, UK. Springer-Verlag.
Manning, C. D. and Schütze, H. (1999). Foundations of Statistical Natural Language Processing. The MIT Press, Cambridge, Massachusetts.
Ryu, P.-M. and Choi, K.-S. (2006). Taxonomy learning using term specificity and similarity. In Proceedings of the 2nd Workshop on Ontology Learning and Population: Bridging the Gap between Text and Knowledge, pages 41–48, Sydney, Australia. Association for Computational Linguistics.
Suchanek, F. M., Ifrim, G., and Weikum, G. (2006). Leila: Learning to extract information by linguistic analysis. In Proceedings of the 2nd Workshop on Ontology Learning and Population: Bridging the Gap between Text and Knowledge, pages 18–25, Sydney, Australia. Association for Computational Linguistics.
Zavaglia, C., Aluísio, S., das Graças Volpe Nunes, M., and de Oliveira, L. M. (2007). Estrutura ontológica e unidades lexicais: uma aplicação computacional no domínio da ecologia. In Anais do 5o Workshop em Tecnologia da Informação e da Linguagem Humana, TIL’2007, pages 1575–1584, Rio de Janeiro, Brasil.
Published
2008-07-12
How to Cite
RIBEIRO JUNIOR, Luiz Carlos; VIEIRA, Renata.
OntoLP: Ontology Engineering in Portuguese Language. In: INTEGRATED SOFTWARE AND HARDWARE SEMINAR (SEMISH), 35. , 2008, Belém/PA.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2008
.
p. 181-194.
ISSN 2595-6205.
