Challenges in studying genomic structural variant formation mechanisms: the short-read dilemma and beyond

Bioessays. 2011 Nov;33(11):840-50. doi: 10.1002/bies.201100075. Epub 2011 Sep 30.

Abstract

Next-generation sequencing (NGS) technologies have revolutionised the analysis of genomic structural variants (SVs), providing significant insights into SV de novo formation based on analyses of rearrangement breakpoint junctions. The short DNA reads generated by NGS, however, have also created novel obstacles by biasing the ascertainment of SVs, an aspect that we refer to as the 'short-read dilemma'. For example, recent studies have found that SVs are often complex, with SV formation generating large numbers of breakpoints in a single event (multi-breakpoint SVs) or structurally polymorphic loci having multiple allelic states (multi-allelic SVs). This complexity may be obscured in short reads, unless the data is analysed and interpreted within its wider genomic context. We discuss how novel approaches will help to overcome the short-read dilemma, and how integration of other sources of information, including the structure of chromatin, may help in the future to deepen the understanding of SV formation processes.

Publication types

  • Research Support, Non-U.S. Gov't
  • Review

MeSH terms

  • Alleles
  • DNA / genetics
  • DNA Breaks, Double-Stranded*
  • DNA Repair
  • DNA Replication
  • Databases, Genetic
  • Genetic Loci
  • Genetic Predisposition to Disease / genetics
  • Genome, Human*
  • Genomic Instability
  • Genomic Structural Variation*
  • Humans
  • Mutation Rate
  • Recombination, Genetic
  • Repetitive Sequences, Nucleic Acid
  • Sequence Analysis, DNA
  • Sequence Homology, Nucleic Acid

Substances

  • DNA