Low-pass sequencing increases the power of GWAS and decreases measurement error of polygenic risk scores compared to genotyping arrays

Genome Res. 2021 Apr;31(4):529-537. doi: 10.1101/gr.266486.120. Epub 2021 Feb 3.

Abstract

Low-pass sequencing (sequencing a genome to an average depth less than 1× coverage) combined with genotype imputation has been proposed as an alternative to genotyping arrays for trait mapping and calculation of polygenic scores. To empirically assess the relative performance of these technologies for different applications, we performed low-pass sequencing (targeting coverage levels of 0.5× and 1×) and array genotyping (using the Illumina Global Screening Array [GSA]) on 120 DNA samples derived from African- and European-ancestry individuals that are part of the 1000 Genomes Project. We then imputed both the sequencing data and the genotyping array data to the 1000 Genomes Phase 3 haplotype reference panel using a leave-one-out design. We evaluated overall imputation accuracy from these different assays as well as overall power for GWAS from imputed data and computed polygenic risk scores for coronary artery disease and breast cancer using previously derived weights. We conclude that low-pass sequencing plus imputation, in addition to providing a substantial increase in statistical power for genome-wide association studies, provides increased accuracy for polygenic risk prediction at effective coverages of ∼0.5× and higher compared to the Illumina GSA.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Genome, Human
  • Genome-Wide Association Study* / methods
  • Genome-Wide Association Study* / standards
  • Genotype*
  • Haplotypes
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • Risk Factors

Associated data

  • figshare/10.6084/m9.figshare.13637892.v1
  • figshare/10.6084/m9.figshare.13641094
  • figshare/10.6084/m9.figshare.13641120