Pseudoalignment tools as an efficient alternative to detect repeated transposable elements in scRNAseq data

Bioinformatics. 2023 Jan 1;39(1):btac737. doi: 10.1093/bioinformatics/btac737.

Abstract

Motivation: Transposable elements (TE) have played a major role in configuring the structures of mammalian genomes through evolution. In normal conditions, the expression of these elements is repressed by different epigenetic regulation mechanisms such as DNA methylation, histone modification and regulation by small RNAs. TE re-activation is associated with stemness potential acquisition, regulation of innate immunity and disease, such as cancer. However, the vast majority of current knowledge in the field is based on bulk expression studies, and very little is known on cell-type- or state-specific expression of TE-derived transcripts. Therefore, cost-efficient single-cell-resolution TE expression analytical approaches are needed.

Results: We have implemented an analytical approach based on pseudoalignment to consensus sequences to incorporate TE expression information to scRNAseq data.

Availability and implementation: All the data and code implemented are available as Supplementary data and in: https://github.com/jmzvillarreal/kallisto_TE_scRNAseq.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • DNA Transposable Elements*
  • Epigenesis, Genetic*
  • Exome Sequencing
  • Mammals / genetics
  • RNA
  • Single-Cell Gene Expression Analysis

Substances

  • DNA Transposable Elements
  • RNA