Intelligent Information System for Extracting Knowledge from Pharmaceutical Package Inserts

  • Cristiano da Silveira Colombo Universidade Federal do Espírito Santo / Instituto Federal do Espírito Santo
  • Elias Silva de Oliveira Universidade Federal do Espírito Santo


Pharmaceutical package inserts are a rich source of information about medicines. To guide the patient, health professionals need information about the appropriate medication for an illness. This information can be found in the pharmaceutical package inserts. Extracting information from package inserts manually is a challenging task, especially when information is needed quickly and efficiently. Doubts about adverse reactions or interactions with others drugs are common. Automatically extracting information from package inserts can help health professionals to make decisions about therapies and drug prescriptions. This article describes the creation of an Artificial Intelligence model, using a hybrid approach called CRF+LG, which recognizes named entities of medicines, diseases and people in package inserts. The model was tested on two sets of package inserts: for stomach pain and diabetes treatment. This work was developed under the aegis of Soft Systems Theory. This research has a prescriptive character and its evaluation was carried out through the execution of experiments. The analysis of the results was carried out with a quantitative approach. The experiments showed that the model obtained, of measure F, 82.08% in the recognition of entities related to diseases, 59.14% of medicines and 94.26% of people. The main contribution of the article is the creation of a model that automatically recognizes entities named in pharmaceutical package inserts. This model can integrate an Intelligent Information System to assist health professionals in making decisions about therapies and drug prescription.
Palavras-chave: CRF LG, Information Extraction, Named Entities Recognition, Pharmaceutical Package Inserts


Giselle Lima Afonso, Mirlane Guimarães de Melo Cardoso, Ivandete Pereira Coelho, and Bárbara Guimarães de Melo Cardoso. 2016. Intoxicação alcóolica aguda: complicação rara associada a neurólise do plexo celíaco durante procedimento cirúrgico a céu aberto em paciente com dor oncológica refratária. Relato de caso. Revista Dor 17, 2 (abr-jun 2016), 145–147.

Marjorie Costa Agollo, Sender Jankiel Miszputen, and Jayme Diament. 2014. Hepatotoxicidade induzida por Hypericum perforatum com possível associação a copaíba (Copaifera langsdorffii Desf): relato de caso. Einstein 12, 3 (jul 2014), 355–357.

Elaine Kochinski Bervanger and Clóvis Dervil Appratto Cardoso Júnior. 2018. Análise dos Fatores que Afetam a Leitura e Interpretação da Bula em Moradores do Município de Cujubim-RO. Revista Científica da Faculdade de Educação e Meio Ambiente 9, edesp (jun. 2018), 484–490.

Yao Chen, Changjiang Zhou, Tianxin Li, Hong Wu, Kai Ye Xia Zhao and, and Jun Liaor. 2019. Named entity recognition from Chinese adverse drug event reports with lexical feature based BiLSTM-CRF and tri-training. Journal of Biomedical Informatics 96, 1 (jul 2019), 1–9.

Oana Sorina Chirila, Ciprian-Bogdan Chirila, and Lacramioara Stoicu-Tivadar. 2019. Named entity recognition for the contraindication and dosing sections of patient information leaflets with CRFClassifier tools. In Proceedings of the 23rd International Conference on System Theory, Control and Computing (ICSTCC). 866–871.

Tatiane da Silva, Felipe Dal-Pizzol, Carina M Bello, Sotero S Mengue, and Eloir P Schenkel. 2000. Bulas de medicamentos e a informação adequada ao paciente. Revista de Saúde Pública 34, 2 (out 2000), 184–189.

Tatiane da Silva Dal Pizzoli, Cassia Garcia Moraes, Paulo Sérgio Dourado Arrais, Andréa Dâmaso Bertoldi, Luiz Roberto Ramos, Mareni Rocha Farias, Maria Auxiliadora Oliveira, Noemia Urruth Leão Tavares, Vera Lucia Luiza, and Sotero Serrate Mengue. 2019. Medicine package inserts from the users’ perspective: are they read and understood?Revista Brasileira de Epidemiologia 22, 1 (mar 2019), 1–12.

Jaimel de Oliveira Lima, Cristiano da Silveira Colombo, Flávio Izo, and Elias de Oliveira. 2020. Using CRF+LG for Automated Classification of Named Entities in Newspaper Texts. In 2020 XLVI Latin American Computing Conference (CLEI). 

Frederico Xavier dos Santos, André Parolin, Elissandro Márcio Silva Lindoso, Fernando Henrique Xavier Santos, and Luciene Barbosa de Sousa. 2005. Hipertensão intracraniana com manifestações oculares associada ao uso de tetraciclina: relato de caso. Arquivos Brasileiros de Oftalmologia 68, 5 (out 2005), 701–703.

Henrique D. P. dos Santos, Ana Helena D. P. S. Ulbrich, and Renata Vieira. 2021. Evaluation of a Prescription Outlier Detection System in Hospital's Pharmacy Services. In Anais do IEEE International Conference on Bioinformatics and Biomedicine (BIBM). 2862–2868.

Irene Faria Duayer, Maria Júlia Correia Lima Nepomuceno Araújo, Camila Hitomi Nihei, Maria Alice Fernandes Barcelos, Osni Braga, Zita Maria Leme Britto, and Rosilene Mota Elias. 2021. Plaquetopenia relacionada à hemodiálise: relato de caso. Brazilian Journal of Nephrology (fev 2021), 1–5.

Daniele S Freitas, Natalia Machado, Fernando V. Andrigueti, Edgard T. Reis Neto, and Marcelo M. Pinheiro. 2010. Hanseníase virchowiana associada ao uso de inibidor do fator de necrose tumoral: relato de caso. Revista Brasileira de Reumatologia 50, 3 (jun 2010), 333–339.

Patricia Lopes Fujita, Carlos José Saldanha Machado, and Márcia de Oliveira Teixeira. 2014. A bula de medicamentos e a regulação de suas configurações em termos de forma e conteúdo no Brasil. Saúde e Sociedade 23, 1 (confirmar 2014), 277–292.

John Lafferty, Andrew McCallum, and Fernando Pereira. 2001. Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data. In Proceedings of the 18th International Conference on Machine Learning (ICML 2001). 282–289.

Luis Fernando Lopes, Michelly Wada Monteiro, Lívia Miguéis Berardinelli, Larissa Menezes Pinheiro de Oliveira, and Gabriela Villar e Silva. 2017. Parada Cardíaca Após Anestesia Geral: Relato de Caso. Anestesia Analgesia Reanimación 30 (nov 2017), 1. 

Pilar López‑Úbeda, Manuel Carlos Díaz‑Galiano, L. Alfonso Ureña‑López, and M. Teresa Martín‑Valdivia. 2021. Combining word embeddings to extract chemical and drug entities in biomedical literature. BMC Bioinformatics 22, 1 (dez 2021), 1–17.

Jimmy L. Moss, Benjamin W. Brown, Sher-Lu Pai, Klaus D. Torp, and Stephen Aniskevich. 2018. Insuficiência hepática fulminante após transplante simultâneo de rim-pâncreas: um relato de caso. Revista Brasileira de Anestesiologia 68, 5 (jun 2018), 535–538.

José Roberto Mendes Pegler, Ana Paula Beltran Moschione Castro, Antonio Carlos Pastorino, and Mayra de Barros Dorna. 2020. Lesão pulmonar aguda relacionada à transfusão associada com infusão de imunoglobulina intravenosa em paciente pediátrico. Einstein 18, 5 (nov 2020), 1–4.

Carla Pires, Marina Vigário, and Afonso Cavaco. 2014. Legibilidade das bulas dos medicamentos: revisão sistemática. Rev Saúde Pública 1, 1 (ago 2014), 1–13.

Juliana P.C. Pirovani and Elias de Oliveira. 2018. CRF+LG: A Hybrid Approach for the Portuguese Named Entity Recognition. Advances in Intelligent Systems and Computing 736.

Isabel Segura-Bedmar, Víctor Suárez-Paniagua, and Paloma Martínez. 2015. Exploring Word Embedding for Drug Name Recognition. In Proceedings of the Sixth International Workshop on Health Text Mining and Information Analysis (Louhi). 64–72.

Marcia Da Silva, Adelia Emilia de Almeida, A. M. Oliveira, C. C. Correia, F. P. Benzatti, J. T. Fernandes, G. R. Barbosa, C. P. Pimenta, T. M M Costa, and V. C. Doneida. 2006. Estudo da bula de medicamentos: uma análise da situação. Revista de Ciências Farmacêuticas Básica e Aplicada 27, 3 (out 2006), 229–236.

Douglas Dogol Sucar. 2000. Interação medicamentosa de venlafaxina com captopril. Revista Brasileira de Psiquiatria 22, 3 (set 2000), 134–137.

Douglas Dogol Sucar, Everton Botelho Sougey, and José Brandão Neto. 2002. Surto psicótico pela possível interação medicamentosa de sibutramina com finasterida. Revista Brasileira de Psiquiatria 24, 1 (mar 2002), 30–33.

Mert Tiftikci, Arzucan Özgür, Yongqun He, and Junguk Hur. 2019. Machine learning-based identification and rule-based normalization of adverse drug reactions in drug labels. BMC Bioinformatics 20, 1 (dez 2019), 707–715. 
COLOMBO, Cristiano da Silveira; OLIVEIRA, Elias Silva de. Intelligent Information System for Extracting Knowledge from Pharmaceutical Package Inserts. In: SIMPÓSIO BRASILEIRO DE SISTEMAS DE INFORMAÇÃO (SBSI), 18. , 2022, Curitiba. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2022 .