Evaluating Machine Learning Models for Essential Protein Identification

Resumo


Drug development is often a complex and time-consuming process. Especially in the initial phase, selecting a target for drug development can take many years. Essential genes and proteins are biological entities responsible for the biological processes of survival and reproduction of organisms. Studies indicate that essential genes tend to have higher expression and encode proteins that engage in more protein-protein interactions. All these characteristics make essential proteins potential drug targets. Thus, this work proposes using protein-protein interaction-based features to train and evaluate machine learning algorithms to identify essential proteins. Experiments with the organism Saccharomyces cerevisiae indicate that the application of the Random Forest algorithm and balancing techniques obtained better recall values.
Palavras-chave: Machine learning, Protein-protein interaction, Essential protein
Publicado
21/09/2022
DA SILVA COSTA, Jessica; RODRIGUES, Jorge Gabriel; BELLOZE, Kele. Evaluating Machine Learning Models for Essential Protein Identification. In: SIMPÓSIO BRASILEIRO DE BIOINFORMÁTICA (BSB), 15. , 2022, Búzios/RJ. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2022 . p. 38-43. ISSN 2316-1248.