Resequencing a core collection of upland cotton identifies genomic variation and loci influencing fiber quality and yield

Nat Genet. 2018 Jun;50(6):803-813. doi: 10.1038/s41588-018-0119-7. Epub 2018 May 7.

Abstract

Upland cotton is the most important natural-fiber crop. The genomic variation of diverse germplasms and alleles underpinning fiber quality and yield should be extensively explored. Here, we resequenced a core collection comprising 419 accessions with 6.55-fold coverage depth and identified approximately 3.66 million SNPs for evaluating the genomic variation. We performed phenotyping across 12 environments and conducted genome-wide association study of 13 fiber-related traits. 7,383 unique SNPs were significantly associated with these traits and were located within or near 4,820 genes; more associated loci were detected for fiber quality than fiber yield, and more fiber genes were detected in the D than the A subgenome. Several previously undescribed causal genes for days to flowering, fiber length, and fiber strength were identified. Phenotypic selection for these traits increased the frequency of elite alleles during domestication and breeding. These results provide targets for molecular selection and genetic manipulation in cotton improvement.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles
  • Breeding / methods
  • Cotton Fiber
  • Domestication
  • Genome, Plant / genetics*
  • Genome-Wide Association Study / methods
  • Genomics / methods
  • Gossypium / genetics*
  • Linkage Disequilibrium / genetics
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics*
  • Quantitative Trait Loci / genetics