Structural variants in 3000 rice genomes

Genome Res. 2019 May;29(5):870-880. doi: 10.1101/gr.241240.118. Epub 2019 Apr 16.

Abstract

Investigation of large structural variants (SVs) is a challenging yet important task in understanding trait differences in highly repetitive genomes. Combining different bioinformatic approaches for SV detection, we analyzed whole-genome sequencing data from 3000 rice genomes and identified 63 million individual SV calls that grouped into 1.5 million allelic variants. We found enrichment of long SVs in promoters and an excess of shorter variants in 5' UTRs. Across the rice genomes, we identified regions of high SV frequency enriched in stress response genes. We demonstrated how SVs may help in finding causative variants in genome-wide association analysis. These new insights into rice genome biology are valuable for understanding the effects SVs have on gene function, with the prospect of identifying novel agronomically important alleles that can be utilized to improve cultivated rice.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Alleles
  • Chromosome Mapping
  • DNA Transposable Elements
  • Genetic Variation*
  • Genome, Plant*
  • Genome-Wide Association Study / methods
  • Genomic Structural Variation*
  • Genomics / methods*
  • Oryza / genetics*
  • Phenotype
  • Sequence Analysis, DNA / methods
  • Stress, Physiological / genetics

Substances

  • DNA Transposable Elements