A draft chromosome-scale genome assembly of a commercial sugarcane

Jeremy R Shearman; Wirulda Pootakham; Chutima Sonthirod; Chaiwat Naktang; Thippawan Yoocha; Duangjai Sangsrakru; Nukoon Jomchai; Sissades Tongsima; Jittima Piriyapongsa; Chumpol Ngamphiw; Nanchaya Wanasen; Kittipat Ukoskit; Prapat Punpee; Peeraya Klomsa-Ard; Klanarong Sriroth; Jisen Zhang; Xingtan Zhang; Ray Ming; Somvong Tragoonrung; Sithichoke Tangphatsornruang

doi:10.1038/s41598-022-24823-0

A draft chromosome-scale genome assembly of a commercial sugarcane

Sci Rep. 2022 Nov 28;12(1):20474. doi: 10.1038/s41598-022-24823-0.

Authors

Jeremy R Shearman¹, Wirulda Pootakham², Chutima Sonthirod², Chaiwat Naktang², Thippawan Yoocha², Duangjai Sangsrakru², Nukoon Jomchai², Sissades Tongsima³, Jittima Piriyapongsa³, Chumpol Ngamphiw³, Nanchaya Wanasen⁴, Kittipat Ukoskit⁵, Prapat Punpee^{4

6}, Peeraya Klomsa-Ard⁶, Klanarong Sriroth⁶, Jisen Zhang⁷, Xingtan Zhang⁷, Ray Ming⁷, Somvong Tragoonrung⁴, Sithichoke Tangphatsornruang⁸

Affiliations

¹ National Omics Center, National Science and Technology Development Agency, Pathum Thani, Thailand. jeremy.she@biotec.or.th.
² National Omics Center, National Science and Technology Development Agency, Pathum Thani, Thailand.
³ National Biobank of Thailand, National Science and Technology Development Agency, Pathum Thani, Thailand.
⁴ National Center for Genetic Engineering and Biotechnology, National Science and Technology Development Agency, Pathum Thani, Thailand.
⁵ Department of Biotechnology, Faculty of Science and Technology, Thammasat University, Rangsit Campus, Klong Luang, Pathum Thani, Thailand.
⁶ Crop Production, Mitr Phol Innovation and Research Center, Pathum Thani, Thailand.
⁷ Center for Genomics and Biotechnology, Fujian Agriculture and Forestry University, Fuzhou, Fujian, China.
⁸ National Omics Center, National Science and Technology Development Agency, Pathum Thani, Thailand. sithichoke.tan@biotec.or.th.

Abstract

Sugarcane accounts for a large portion of the worlds sugar production. Modern commercial cultivars are complex hybrids of S. officinarum, S. spontaneum, and several other Saccharum species, resulting in an auto-allopolyploid with 8-12 copies of each chromosome. The current genome assembly gold standard is to generate a long read assembly followed by chromatin conformation capture sequencing to scaffold. We used the PacBio RSII and chromatin conformation capture sequencing to sequence and assemble the genome of a South East Asian commercial sugarcane cultivar, known as Khon Kaen 3. The Khon Kaen 3 genome assembled into 104,477 contigs totalling 7 Gb, which scaffolded into 56 pseudochromosomes containing 5.2 Gb of sequence. Genome annotation produced 242,406 genes from 30,927 orthogroups. Aligning the Khon Kaen 3 genome sequence to S. officinarum and S. spontaneum revealed a high level of apparent recombination, indicating a chimeric assembly. This assembly error is explained by high nucleotide identity between S. officinarum and S. spontaneum, where 91.8% of S. spontaneum aligns to S. officinarum at 94% identity. Thus, the subgenomes of commercial sugarcane are so similar that using short reads to correct long PacBio reads produced chimeric long reads. Future attempts to sequence sugarcane must take this information into account.

MeSH terms

Chromatin
Edible Grain
Saccharum* / genetics
Sequence Analysis, DNA
Thailand

Substances

Chromatin