Entity Relation Extraction from News Articles in Portuguese for Competitive Intelligence Based on BERT

Resumo


Competitive intelligence (CI) is a relevant area of a corporation and can support the strategic business area by showing those responsible, helping decision making on how to position an organization in the market. This work uses the Bidirectional Transformer Encoding Representations (BERT) to process a sentence and its named entities and extract the parts of the sentences that represent or describe the semantic relationship between these named entities. The approach was developed for the Portuguese language, considering the financial domain and exploring deep linguistic representations without using other lexical-semantic resources. The results of the experiments show a precision of 73.5% using the Jaccard metric that measures the similarity between sentences. A second contribution of this work is the manually constructed dataset with more than 4.500 tuples (phrase, entity, entity) annotated.
Palavras-chave: Competitive intelligence, Entity relation classification, Relation extraction
Publicado
29/11/2021
Como Citar

Selecione um Formato
REYES, Daniel De Los; TRAJANO, Douglas; MANSSOUR, Isabel Harb; VIEIRA, Renata; BORDINI, Rafael H.. Entity Relation Extraction from News Articles in Portuguese for Competitive Intelligence Based on BERT. In: BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), 10. , 2021, Online. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2021 . ISSN 2643-6264.