Target Enrichment Improves Mapping of Complex Traits by Deep Sequencing

G3 (Bethesda). 2015 Nov 3;6(1):67-77. doi: 10.1534/g3.115.023671.

Abstract

Complex traits such as crop performance and human diseases are controlled by multiple genetic loci, many of which have small effects and often go undetected by traditional quantitative trait locus (QTL) mapping. Recently, bulked segregant analysis with large F2 pools and genome-level markers (named extreme-QTL or X-QTL mapping) has been used to identify many QTL. To estimate parameters impacting QTL detection for X-QTL mapping, we simulated the effects of population size, marker density, and sequencing depth of markers on QTL detectability for traits with differing heritabilities. These simulations indicate that a high (>90%) chance of detecting QTL with at least 5% effect requires 5000× sequencing depth for a trait with heritability of 0.4-0.7. For most eukaryotic organisms, whole-genome sequencing at this depth is not economically feasible. Therefore, we tested and confirmed the feasibility of applying deep sequencing of target-enriched markers for X-QTL mapping. We used two traits in Arabidopsis thaliana with different heritabilities: seed size (H(2) = 0.61) and seedling greening in response to salt (H(2) = 0.94). We used a modified G test to identify QTL regions and developed a model-based statistical framework to resolve individual peaks by incorporating recombination rates. Multiple QTL were identified for both traits, including previously undiscovered QTL. We call our method target-enriched X-QTL (TEX-QTL) mapping; this mapping approach is not limited by the genome size or the availability of recombinant inbred populations and should be applicable to many organisms and traits.

Keywords: Arabidopsis thaliana; bulked segregant analysis; salt tolerance; seed size; target-enriched extreme quantitative trait locus mapping.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms
  • Arabidopsis / genetics
  • Arabidopsis / growth & development
  • Arabidopsis / metabolism
  • Chromosome Mapping*
  • Genetic Markers
  • High-Throughput Nucleotide Sequencing*
  • Inheritance Patterns
  • Models, Genetic
  • Phenotype
  • Polymorphism, Single Nucleotide
  • Quantitative Trait Loci*
  • Quantitative Trait, Heritable*
  • ROC Curve
  • Salt Tolerance / genetics
  • Seeds

Substances

  • Genetic Markers