Telomere-to-telomere genome assembly of Phaeodactylum tricornutum

PeerJ. 2022 Jul 5:10:e13607. doi: 10.7717/peerj.13607. eCollection 2022.

Abstract

Phaeodactylum tricornutum is a marine diatom with a growing genetic toolbox available and is being used in many synthetic biology applications. While most of the genome has been assembled, the currently available genome assembly is not a completed telomere-to-telomere assembly. Here, we used Oxford Nanopore long reads to build a telomere-to-telomere genome for Phaeodactylum tricornutum. We developed a graph-based approach to extract all unique telomeres, and used this information to manually correct assembly errors. In total, we found 25 nuclear chromosomes that comprise all previously assembled fragments, in addition to the chloroplast and mitochondrial genomes. We found that chromosome 19 has filtered long-read coverage and a quality estimate that suggests significantly less haplotype sequence variation than the other chromosomes. This work improves upon the previous genome assembly and provides new opportunities for genetic engineering of this species, including creating designer synthetic chromosomes.

Keywords: Genome assembly; High-molecular weight DNA; Methylation; Nanopore sequencing; Phaeodactylum tricornutum; Telomere-to-telomere; Transposons.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Diatoms* / genetics
  • Genome, Mitochondrial* / genetics
  • Telomere / genetics

Grants and funding

This work was supported by the Ontario Graduate Scholarship (Daniel J. Giguere), Mitacs IT8360 (Samuel S. Slattery and Daniel J. Giguere), Natural Sciences and Engineering Research Council of Canada (NO: RGPIN-03878-2015 - Gregory B. Gloor, RGPIN-2018-06172 -Bogumil J. Karas, RPGIN-2015-04800 - David R. Edgell). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.