Machine learning reveals bilateral distribution of somatic L1 insertions in human neurons and glia

Nat Neurosci. 2021 Feb;24(2):186-196. doi: 10.1038/s41593-020-00767-4. Epub 2021 Jan 11.

Abstract

Retrotransposons can cause somatic genome variation in the human nervous system, which is hypothesized to have relevance to brain development and neuropsychiatric disease. However, the detection of individual somatic mobile element insertions presents a difficult signal-to-noise problem. Using a machine-learning method (RetroSom) and deep whole-genome sequencing, we analyzed L1 and Alu retrotransposition in sorted neurons and glia from human brains. We characterized two brain-specific L1 insertions in neurons and glia from a donor with schizophrenia. There was anatomical distribution of the L1 insertions in neurons and glia across both hemispheres, indicating retrotransposition occurred during early embryogenesis. Both insertions were within the introns of genes (CNNM2 and FRMD4A) inside genomic loci associated with neuropsychiatric disorders. Proof-of-principle experiments revealed these L1 insertions significantly reduced gene expression. These results demonstrate that RetroSom has broad applications for studies of brain development and may provide insight into the possible pathological effects of somatic retrotransposition.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adaptor Proteins, Signal Transducing / genetics
  • Adult
  • Cation Transport Proteins / genetics
  • Embryonic Development / genetics
  • Female
  • Genome / genetics
  • HeLa Cells
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Long Interspersed Nucleotide Elements
  • Machine Learning*
  • Mental Disorders / genetics
  • Mutagenesis, Insertional / genetics*
  • Neuroglia*
  • Neurons*
  • Pregnancy
  • Retroelements
  • Schizophrenia / genetics

Substances

  • Adaptor Proteins, Signal Transducing
  • CNNM2 protein, human
  • Cation Transport Proteins
  • FERM domain-containing protein 4A, human
  • Retroelements