Accuracy of RNA Structure Prediction Depends on the Pseudoknot Grammar

Resumo

Most models for pseudoknotted RNA structures can be described by multi-context free grammars (MCFGs) and thus are amenable to dynamic programming algorithms. They differ strongly in their definition of admissible structures and thus the search space over which structures are optimized. The accuracy of structure prediction can be expected to depend on choice of the MCFG: models that are too inclusive likely over-predict pseudoknots, while restrictive models by their definition already exclude more complex pseudoknotted structures. A systematic analysis of the impact of the grammar, however, is difficult since available implementations use incomparable energy parameters. We show here that Algebraic Dynamic Programming over MCFGs naturally disentangles energy models (as specified by the evaluation algebra) and the definition of search space defined by a MCFG. Preliminary computational experiments indicate that the choice of the grammar has an important impact already for short RNA sequences.
Publicado
2022-09-21
Como Citar
EGGERS, Dustyn; HÖNER ZU SIEDERDISSEN, Christian; STADLER, Peter F.. Accuracy of RNA Structure Prediction Depends on the Pseudoknot Grammar. Anais do Simpósio Brasileiro de Bioinformática (BSB), [S.l.], p. 20-31, set. 2022. ISSN 2316-1248. Disponível em: <https://sol.sbc.org.br/index.php/bsb/article/view/22856>. Acesso em: 17 maio 2024.