OntoDrug: Enhancing Brazilian Health System Interoperability with a National Medication Ontology

  • Nelson Miranda USP
  • Matheus Matos Machado USP
  • Dilvan A. Moreira USP


This paper presents OntoDrug, an ontology designed to enhance medicine management in Brazil by integrating regulatory frameworks and standardizing terminologies. OntoDrug improves patient safety and treatment efficacy by accurately identifying and classifying medications and supporting interoperability with health information systems. A proof-of-concept application integrated into the Hospital das Clínicas de Marília’s hospital EHR system demonstrated OntoDrug’s utility, achieving high precision and recall. An experimental study using large language models grounded on the ontology achieved, using GPT-4 turbo, 0.97 precision, 1.0 recall and an F1-score of 0.99. We also evaluated open-source models llama3-8b, llama3-70b, and gemma-7b-it. Their performance was close to GPT-4’s. The significant effectiveness is primarily due to the utilization of large language models (LLMs). While using these large language models enhanced performance, challenges related to cost, privacy, and service availability were identified. OntoDrug represents a significant advancement in Brazil’s medication information standardization and optimization.

Palavras-chave: Medication Ontologies, Drug Management, Semantic Interoperability, Health Informatics


Burhan Ud Din Abbasi, Iram Fatima, Hamid Mukhtar, Sharifullah Khan, Abdulaziz Alhumam, and Hafiz Farooq Ahmad. 2022. Autonomous schema markups based on intelligent computing for search engine optimization. PeerJ Computer Science 8 (2022), e1163.

Agência Nacional de Vigilância Sanitária. 2004. Política Vigente para a Regulamentação de Medicamentos no Brasil. Gabinete do Diretor-Presidente, Núcleo de Assessoramento em Comunicação Social e Institucional - Comin/Anvisa. Impresso no Brasil. Permitida a reprodução parcial ou total desta obra, desde que citada a fonte. [link]

Agência Nacional de Vigilância Sanitária (Anvisa). 2022. Vocabulário Controlado de Formas Farmacêuticas, Vias de Administração e Embalagens. [link] Acessado: 19-05-2022.

AI@Meta. 2024. Llama 3 Model Card. (2024). [link]

Dean Allemang and James Hendler. 2011. Semantic web for the working ontologist: effective modeling in RDFS and OWL. Elsevier.

Caio Viktor S Avila, Wellington Franco, Amanda DP Venceslau, Tulio Vidal Rolim, Vania MP Vidal, and Valéria M Pequeno. 2021. MediBot: an ontology-based chatbot to retrieve drug information and compare its prices. Journal of Information and Data Management 12, 2 (2021).

Letícia de Andrade Barbosa. 2013. Assistência farmacêutica no sistema único de saúde. (2013).

Ricardo José Magalhães Barros, Marco Antônio de Araújo Fireman, and Mateus Rodrigues Westin. 2017. Relação nacional de medicamentos essenciais. (2017).

Erik Bülow. 2020. coder: An R package for code-based item classification and categorization. J. Open Source Softw. 5 (2020), 2916. [link]

Wander de Almeida Limeira and Dilvan de Abreu Moreira. 2020. KGWE-A customizable editor for OWL ontologies. In Anais Estendidos do XXVI Simpósio Brasileiro de Sistemas Multimídia e Web. SBC, 91–95.

Daniela Moulin Maciel de Vasconcelos, Gabriela Costa Chaves, Thiago Botelho Azeredo, and Rondineli Mendes da Silva. 2017. National Medicines Policy in retrospective : a review of ( almost ) 20 years of implementation. [link]

Agência Nacional de Vigilância Sanitária (Anvisa). 2020. Novo Marco Regulatório de Farmacovigilância. [link] Acessado: 04-06-2024.

Frederico Freitas and Stefan Schulz. 2009. Ontologias, Web semântica e saúde. Revista Eletrônica de Comunicação, Informação & Inovação em Saúde 3, 1 (2009).

Maurice Funk, Simon Hosemann, Jean Christoph Jung, and Carsten Lutz. 2023. Towards Ontology Construction with Language Models. arXiv preprint arXiv:2309.09898 (2023).

Thomas R Gruber. 1995. Toward principles for the design of ontologies used for knowledge sharing? International journal of human-computer studies 43, 5-6 (1995), 907–928.

Ramanathan V Guha, Dan Brickley, and Steve Macbeth. 2016. Schema. org: evolution of structured data on the web. Commun. ACM 59, 2 (2016), 44–51.

Curtis E. Haas, Mary Ann Kliethermes, Lori T. Armistead, Craig J. Beavers, Christie A. Schumacher, Lisa Smith, John A. Armitstead, Roshni P. Emmons, Lucy I. Darakjian, Krystal L. Edwards, and Michael Barr. 2023. Comprehensive medication management: Review and recommendations for quality measures. Journal of the American College of Clinical Pharmacy 6 (2023), 404 – 415. [link]

Josh Hanna, Eric Joseph, Mathias Brochhausen, and William R. Hogan. 2013. Building a drug ontology based on RxNorm and other sources. Journal of Biomedical Semantics 4 (2013), 44 – 44. [link]

María Herrero-Zazo, Janna Hastings, Isabel Segura-Bedmar, Samuel Croset, Paloma Martínez, and Christoph Steinbeck. 2013. An Ontology for Drug-drug Interactions. In Workshop on Semantic Web Applications and Tools for Life Sciences. [link]

María Herrero-Zazo, Isabel Segura-Bedmar, Janna Hastings, and Paloma Martínez. 2015. DINTO: Using OWL Ontologies and SWRL Rules to Infer Drug-Drug Interactions and Their Mechanisms. Journal of chemical information and modeling 55 8 (2015), 1698–707. [link]

Adriana M. Ivama-Brummell, Daniella PINGRET-KIPMAN, Priscila G. Louly, and Rosiene R. Andrade. 2022. Medicines regulation, pricing and reimbursement in Brazil. Revista Brasileira de Farmácia Hospitalar e Serviços de Saúde (2022). [link]

S Jagannatha, TV Suresh Kumar, and R RajaniKanth. [n. d.]. Comparative Study of Back-End Vs Front-End System by Performance Analysis during Preliminary Design stages. ([n. d.]).

Christopher Manning and Hinrich Schutze. 1999. Foundations of statistical natural language processing. MIT press.

Microsoft. 2023. .NET Framework 4.5. [link] Acessado: 01-01-2023.

Nelson Miranda and Dilvan de A. Moreira. 2024. Improving Medication Identification Accuracy and Regulatory Compliance through NLP and Ontologies: An Analysis of Otorhinolaryngology Prescriptions. In 2024 IEEE International Conference on Enabling Technologies: Infrastructure for Collaborative Enterprises (WETICE). IEEE.

Ol’ga Alekseevna Mitina and Ivan Alexandrovich Yurchenkov. 2021. Data Classification in Medicine and Healthcare Service. Artificial Intelligence in Intelligent Systems (2021). [link]

Mark A Musen. 2015. The protégé project: a look back and a look forward. AI matters 1, 4 (2015), 4–12.

Natalya Fridman Noy, Ray W Fergerson, and Mark A Musen. 2000. The knowledge model of Protege-2000: Combining interoperability and flexibility. In International Conference on Knowledge Engineering and Knowledge Management. Springer, 17–32.

Natalya F Noy, Deborah L McGuinness, et al. 2001. Ontology development 101: A guide to creating your first ontology.

OpenAI. 2024. GPT-4 Turbo: Advanced language model for natural language understanding and generation. [link] Acessado: 22-03-2024.

Christian Pachl, Nils Frank, Jan Breitbart, and Stefan Bräse. 2020. Overview of chemical ontologies. arXiv preprint arXiv:2002.03842 (2020).

Cecilia Reyes Peña, Mireya Tovar, Maricela Claudia Bravo, and Regina Motz. 2020. Drug Ontology for the Public Mexican Health System. In SWH@ISWC. [link]

Deborah L. Pestka, Caitlin K. Frail, Lindsay A. Sorge, Kylee A. Funk, Kristin K. Janke, Mary T. Roth McClurg, and Todd D. Sorensen. 2020. Development of the comprehensive medication management practice management assessment tool: A resource to assess and prioritize areas for practice improvement. Journal of the American College of Clinical Pharmacy 3 (2020), 448 – 454. [link]

Joselio Emar Araujo Queiroz, Robson Willian Melo Matos, Elivan Silva Souza, Paula Xavier Santos, Laís Bié Pinto Bandeira, Daniel Catão Moreira Licio, Beatriz Faria Leao, and Rafael Santos Santana. 2023. Ontologia brasileira de medicamentos: rumo à padronização terminológica de medicamentos no Brasil. JORNAL DE ASSISTÊNCIA FARMACÊUTICA E FARMACOECONOMIA 8, s. 2 (2023).

Bartira Dantas Rocha, Larysse Silva, Thais Batista, Everton Cavalcante, and Porfírio Gomes. 2020. An ontology-based information model for multi-domain semantic modeling and analysis of smart city data. In Proceedings of the Brazilian Symposium on Multimedia and the Web. 73–80.

Tomer Sagi, Matteo Lissandrini, T. Pedersen, and Katja Hose. 2022. A design space for RDF data representations. The VLDB Journal 31 (2022), 347 – 373. [link]

Mark E. Sharp. 2017. Toward a comprehensive drug ontology: extraction of drug-indication relations from diverse information sources. Journal of Biomedical Semantics 8 (2017). [link]

S. Stelting and B. Maassen. 2020. Understanding Data Transfer Object (DTO) in C#. [link] Acessado: 01-01-2023.

Clarence Tauro, Nagesswary Ganesan, Saumya Mishra, and Anupama Bhagwat. 2012. Article: Object Serialization: A Study of Techniques of Implementing Binary Serialization in C++, Java and .NET. International Journal of Computer Applications 45 (05 2012), 25–29.

Gemma Team, Thomas Mesnard, Cassidy Hardin, and Robert Dadashi et al. 2024. Gemma: Open Models Based on Gemini Research and Technology.

Ruben Verborgh and Max De Wilde. 2013. Using openrefine. Packt Publishing Ltd.

Zizhong Wei, Dongsheng Guo, Dengrong Huang, Qilai Zhang, Sijia Zhang, Kai Jiang, and Rui Li. 2023. Detecting and Mitigating the Ungrounded Hallucinations in Text Generation by LLMs. In Proceedings of the 2023 International Conference on Artificial Intelligence, Systems and Network Security. 77–81.

Marieke Wermuth. 2021. Terminological cooperation in the biomedical field. Terminology (2021). [link]

Yijia Zhang, Qingyu Chen, Zhihao Yang, Hongfei Lin, and Zhiyong Lu. 2019. BioWordVec, improving biomedical word embeddings with subword information and MeSH. Scientific data 6, 1 (2019), 52.
MIRANDA, Nelson; MACHADO, Matheus Matos; MOREIRA, Dilvan A.. OntoDrug: Enhancing Brazilian Health System Interoperability with a National Medication Ontology. In: BRAZILIAN SYMPOSIUM ON MULTIMEDIA AND THE WEB (WEBMEDIA), 30. , 2024, Juiz de Fora/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2024 . p. 240-248. DOI: https://doi.org/10.5753/webmedia.2024.242062.