Whole-genome sequencing reveals principles of brain retrotransposition in neurodevelopmental disorders

Cell Res. 2018 Feb;28(2):187-203. doi: 10.1038/cr.2018.8. Epub 2018 Jan 12.

Abstract

Neural progenitor cells undergo somatic retrotransposition events, mainly involving L1 elements, which can be potentially deleterious. Here, we analyze the whole genomes of 20 brain samples and 80 non-brain samples, and characterized the retrotransposition landscape of patients affected by a variety of neurodevelopmental disorders including Rett syndrome, tuberous sclerosis, ataxia-telangiectasia and autism. We report that the number of retrotranspositions in brain tissues is higher than that observed in non-brain samples and even higher in pathologic vs normal brains. The majority of somatic brain retrotransposons integrate into pre-existing repetitive elements, preferentially A/T rich L1 sequences, resulting in nested insertions. Our findings document the fingerprints of encoded endonuclease independent mechanisms in the majority of L1 brain insertion events. The insertions are "non-classical" in that they are truncated at both ends, integrate in the same orientation as the host element, and their target sequences are enriched with a CCATT motif in contrast to the classical endonuclease motif of most other retrotranspositions. We show that L1Hs elements integrate preferentially into genes associated with neural functions and diseases. We propose that pre-existing retrotransposons act as "lightning rods" for novel insertions, which may give fine modulation of gene expression while safeguarding from deleterious events. Overwhelmingly uncontrolled retrotransposition may breach this safeguard mechanism and increase the risk of harmful mutagenesis in neurodevelopmental disorders.

MeSH terms

  • Adenine Nucleotides / genetics
  • Ataxia Telangiectasia Mutated Proteins / metabolism
  • Brain / physiopathology*
  • DNA Damage
  • Databases, Genetic
  • Endonucleases / genetics
  • Exons
  • Gene Expression Regulation
  • Genes / genetics
  • Genomics / methods
  • Humans
  • Long Interspersed Nucleotide Elements / genetics*
  • MicroRNAs / genetics
  • Mutation
  • Neurodevelopmental Disorders / genetics*
  • Neurons / metabolism
  • Statistics, Nonparametric
  • Thymine Nucleotides / genetics
  • Whole Genome Sequencing

Substances

  • Adenine Nucleotides
  • MicroRNAs
  • Thymine Nucleotides
  • ATM protein, human
  • Ataxia Telangiectasia Mutated Proteins
  • Endonucleases