Evaluating Machine Learning Models for Essential Protein Identification

Resumo

Drug development is often a complex and time-consuming process. Especially in the initial phase, selecting a target for drug development can take many years. Essential genes and proteins are biological entities responsible for the biological processes of survival and reproduction of organisms. Studies indicate that essential genes tend to have higher expression and encode proteins that engage in more protein-protein interactions. All these characteristics make essential proteins potential drug targets. Thus, this work proposes using protein-protein interaction-based features to train and evaluate machine learning algorithms to identify essential proteins. Experiments with the organism Saccharomyces cerevisiae indicate that the application of the Random Forest algorithm and balancing techniques obtained better recall values.
Publicado
2022-09-21
Como Citar
DA SILVA COSTA, Jessica; RODRIGUES, Jorge Gabriel; BELLOZE, Kele. Evaluating Machine Learning Models for Essential Protein Identification. Anais do Simpósio Brasileiro de Bioinformática (BSB), [S.l.], p. 38-43, set. 2022. ISSN 2316-1248. Disponível em: <https://sol.sbc.org.br/index.php/bsb/article/view/22867>. Acesso em: 17 maio 2024.