IDP-ASE: haplotyping and quantifying allele-specific expression at the gene and gene isoform level by hybrid sequencing

Nucleic Acids Res. 2017 Mar 17;45(5):e32. doi: 10.1093/nar/gkw1076.

Abstract

Allele-specific expression (ASE) is a fundamental problem in studying gene regulation and diploid transcriptome profiles, with two key challenges: (i) haplotyping and (ii) estimation of ASE at the gene isoform level. Existing ASE analysis methods are limited by a dependence on haplotyping from laborious experiments or extra genome/family trio data. In addition, there is a lack of methods for gene isoform level ASE analysis. We developed a tool, IDP-ASE, for full ASE analysis. By innovative integration of Third Generation Sequencing (TGS) long reads with Second Generation Sequencing (SGS) short reads, the accuracy of haplotyping and ASE quantification at the gene and gene isoform level was greatly improved as demonstrated by the gold standard data GM12878 data and semi-simulation data. In addition to methodology development, applications of IDP-ASE to human embryonic stem cells and breast cancer cells indicate that the imbalance of ASE and non-uniformity of gene isoform ASE is widespread, including tumorigenesis relevant genes and pluripotency markers. These results show that gene isoform expression and allele-specific expression cooperate to provide high diversity and complexity of gene regulation and expression, highlighting the importance of studying ASE at the gene isoform level. Our study provides a robust bioinformatics solution to understand ASE using RNA sequencing data only.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alleles*
  • Gene Expression Regulation
  • Haplotypes*
  • High-Throughput Nucleotide Sequencing / methods*
  • Human Embryonic Stem Cells / cytology
  • Human Embryonic Stem Cells / metabolism
  • Humans
  • MCF-7 Cells
  • RNA Isoforms / genetics*
  • RNA Isoforms / metabolism
  • RNA, Messenger / genetics*
  • RNA, Messenger / metabolism
  • Sequence Analysis, RNA
  • Transcriptome*

Substances

  • RNA Isoforms
  • RNA, Messenger