Toward a Scoring Schema to Rank Candidate Instances of Ontological Classes - Extracting Brazilian Portuguese Texts from the Web
Resumo
With the emergence of Information Extraction Systems driven by ontologies, boosted by the Semantic Web, there is a need for the development of scoring schemas that enable the automatic classification of information. These schemas, even so little explored in the Portuguese language, provide measures used in the stage of classification of relevant instances to ontological classes. In this way, this paper presents: (i) a brief discussion about existing scoring measures based on PMI (Pointwise Mutual Information); (ii) new scoring measures based on PMI and Standard Deviation Calculation; and (iii) an evaluation of all discussed measures in the context of Brazilian Portuguese texts from the web.
Publicado
27/10/2015
Como Citar
LIMA, Fabio dos Santos; SALVADOR, Laís do Nascimento.
Toward a Scoring Schema to Rank Candidate Instances of Ontological Classes - Extracting Brazilian Portuguese Texts from the Web. In: BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB (WEBMEDIA), 21. , 2015, Manaus.
Anais [...].
Porto Alegre: Sociedade Brasileira de Computação,
2015
.
p. 81-84.