Genome Sequence and Comparative Analysis of Colletotrichum gloeosporioides Isolated from Liriodendron Leaves

Phytopathology. 2020 Jul;110(7):1260-1269. doi: 10.1094/PHYTO-12-19-0452-R. Epub 2020 May 13.

Abstract

Colletotrichum gloeosporioides is a hemibiotrophic pathogen causing significant losses to economically important crops and forest trees, including Liriodendron. To explore the interaction between C. gloeosporioides and Liriodendron and to identify the candidate genes determining the pathogenesis, we sequenced and assembled the whole genome of C. gloeosporioides Lc1 (CgLc1) using PacBio and Illumina next generation sequencing and performed a comparative genomic analysis between CgLc1 and Cg01, the latter being a described endophytic species of the C. gloeosporioides complex. Gene structure prediction identified 15,744 protein-coding genes and 837 noncoding RNAs. Species-specific genes were characterized using an ortholog analysis followed by a pathway enrichment analysis, which showed that genes specific to CgLc1 were enriched for the arginine biosynthetic process. Furthermore, genome synteny analysis revealed that most of the protein-coding genes fell into collinear blocks. However, two clusters of polyketide synthase genes were identified to be specific for CgLc1, suggesting that they might have an important role in virulence control. Transcriptional regulators coexpressed with polyketide synthase genes were detected through a Weighted Correlation Network Analysis. Taken together, this work provides new insight into the virulence- and pathogenesis-associated genes present in C. gloeosporioides and its possible lifestyle.

Keywords: Colletotrichum gloeosporioides; Liriodendron; coexpression network analysis; comparative genomic analysis; polyketide synthase; whole-genome sequencing.

MeSH terms

  • Colletotrichum*
  • Liriodendron*
  • Plant Diseases
  • Plant Leaves
  • Virulence