Unveiling Novel Secondary Metabolite Gene Clusters in a Paecilomyces sp. Strain From The Brazilian Amazon Through Computer Aided Genomic Analysis
Resumo
The genus Paecilomyces represents a promising source of bioactive secondary metabolites, yet it remains largely underexplored from chemical and genomic perspectives. To address this, a comprehensive genome mining approach was employed to characterize a novel Paecilomyces sp. strain, CMIINPA 1390, isolated from the Brazilian Amazon. Utilizing long-read sequencing, a high-quality, near-chromosome level genome assembly was achieved, comprising six linear scaffolds totaling 31.7 Mb and a circular mitochondrial genome of 52094 bp, with a high BUSCO completeness of 98.7%. Genomic analysis identified 42 biosynthetic gene clusters (BGCs), which is consistent with the genus average. Notably, 100% of these BGCs showed low to no correlation with known biosynthetic pathways, revealing the high potential of this species for the biosynthesis of novel molecules. Gene Cluster Family (GCF) analysis, conducted with 10 reference genomes for the Paecilomyces genus, revealed that among the NRPS-related GCFs, this new Amazonian species presents a singleton associated with nidulanin A biosynthesis. This singleton contained all necessary tailoring genes, but the core nonribosomal peptide synthetase (NRPS) gene exhibited a loss of several modules and domains, retaining only a single module out of the four canonical ones related to nidulanin A biosynthesis. This loss suggests a potential evolutionary divergence, possibly resulting in the production of a shorter or structurally simplified molecule. These findings underscore that Paecilomyces sp. CMI-INPA 1390 is a promising source of diverse and potentially novel secondary metabolites and demonstrate the role of genome mining in unveiling the full biosynthetic potential of filamentous fungi.
Referências
Altschul, S. F., Gish, W., Miller, W., Myers, E. W., and Lipman, D. J. (1990). Basic local alignment search tool. Journal of molecular biology, 215(3):403–410.
Blin, K., Shaw, S., Vader, L., Szenei, J., Reitz, Z. L., Augustijn, H. E., Cediel-Becerra, J. D., de Crécy-Lagard, V., Koetsier, R. A., Williams, S. E., et al. (2025). antismash 8.0: extended gene cluster detection capabilities and analyses of chemistry, enzymology, and regulation¡? mode pagerangestyle?¿. Nucleic acids research, page gkaf334.
Doyle, J. J. and Doyle, J. L. (1987). A rapid dna isolation procedure for small quantities of fresh leaf tissue. Phytochemical bulletin.
Eddy, S. R. (2011). Accelerated profile hmm searches. PLoS computational biology, 7(10):e1002195.
Fedorova, N. D., Moktali, V., and Medema, M. H. (2012). Bioinformatics approaches and software for detection of secondary metabolic gene clusters. In Fungal secondary metabolism: methods and protocols, pages 23–45. Springer.
Inglis, P. W., Sarmento, R. B., Gavião, C. F., and Valadares-Inglis, M. C. (2005). Dna fingerprinting of paecilomyces strains of potential use for the biological control of pests. World Journal of Microbiology and Biotechnology, 21(8):1487–1492.
Kolmogorov, M., Bickhart, D. M., Behsaz, B., Gurevich, A., Rayko, M., Shin, S. B., Kuhn, K., Yuan, J., Polevikov, E., Smith, T. P., et al. (2020). metaflye: scalable long-read metagenome assembly using repeat graphs. Nature methods, 17(11):1103–1110.
Manni, M., Berkeley, M. R., Seppey, M., Simão, F. A., and Zdobnov, E. M. (2021). Busco update: novel and streamlined workflows along with broader and deeper phylogenetic coverage for scoring of eukaryotic, prokaryotic, and viral genomes. Molecular biology and evolution, 38(10):4647–4654.
Moreno-Gavíra, A., Huertas, V., Diánez, F., Sánchez-Montesinos, B., and Santos, M. (2020). Paecilomyces and its importance in the biological control of agricultural pests and diseases. Plants, 9(12):1746.
Mózsik, L., Iacovelli, R., Bovenberg, R. A., and Driessen, A. J. (2022). Transcriptional activation of biosynthetic gene clusters in filamentous fungi. Frontiers in Bioengineering and Biotechnology, 10:901037.
Navarro-Muñoz, J. C., Selem-Mojica, N., Mullowney, M. W., Kautsar, S. A., Tryon, J. H., Parkinson, E. I., De Los Santos, E. L., Yeong, M., Cruz-Morales, P., Abubucker, S., et al. (2020). A computational framework to explore large-scale biosynthetic diversity. Nature chemical biology, 16(1):60–68.
Paysan-Lafosse, T., Andreeva, A., Blum, M., Chuguransky, S. R., Grego, T., Pinto, B. L., Salazar, G. A., Bileschi, M. L., Llinares-López, F., Meng-Papaxanthos, L., et al. (2025). The pfam protein families database: embracing ai/ml. Nucleic acids research, 53(D1):D523–D534.
Stanke, M., Keller, O., Gunduz, I., Hayes, A., Waack, S., and Morgenstern, B. (2006). Augustus: ab initio prediction of alternative transcripts. Nucleic acids research, 34(suppl 2):W435–W439.
Terlouw, B. R., Blin, K., Navarro-Munoz, J. C., Avalon, N. E., Chevrette, M. G., Egbert, S., Lee, S., Meijer, D., Recchia, M. J., Reitz, Z. L., et al. (2023). Mibig 3.0: a community-driven effort to annotate experimentally validated biosynthetic gene clusters. Nucleic acids research, 51(D1):D603–D610.
Visagie, C. M., Cruywagen, E. M., and Duong, T. A. (2024). A new paecilomyces from wooden utility poles in south africa. Fungal Systematics and Evolution, 13(1):163–181.
Wang, X., Peng, J., Sun, L., Bonito, G., Guo, Y., Li, Y., and Fu, Y. (2020). Genome sequencing of paecilomyces penicillatus provides insights into its phylogenetic placement and mycoparasitism mechanisms on morel mushrooms. Pathogens, 9(10):834.
