De Novo Genome Sequence Assemblies of Gossypium raimondii and Gossypium turneri

G3 (Bethesda). 2019 Oct 7;9(10):3079-3085. doi: 10.1534/g3.119.400392.

Abstract

Cotton is an agriculturally important crop. Because of its importance, a genome sequence of a diploid cotton species (Gossypium raimondii, D-genome) was first assembled using Sanger sequencing data in 2012. Improvements to DNA sequencing technology have improved accuracy and correctness of assembled genome sequences. Here we report a new de novo genome assembly of G. raimondii and its close relative G. turneri The two genomes were assembled to a chromosome level using PacBio long-read technology, HiC, and Bionano optical mapping. This report corrects some minor assembly errors found in the Sanger assembly of G. raimondii We also compare the genome sequences of these two species for gene composition, repetitive element composition, and collinearity. Most of the identified structural rearrangements between these two species are due to intra-chromosomal inversions. More inversions were found in the G. turneri genome sequence than the G. raimondii genome sequence. These findings and updates to the D-genome sequence will improve accuracy and translation of genomics to cotton breeding and genetics.

Keywords: Gossypium raimondii; Gossypium turneri; PacBio; cotton; genome sequence.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Computational Biology* / methods
  • Genome, Plant*
  • Genomics* / methods
  • Gossypium / classification*
  • Gossypium / genetics*
  • Molecular Sequence Annotation
  • Repetitive Sequences, Nucleic Acid