A Panel of rSNPs Demonstrating Allelic Asymmetry in Both ChIP-seq and RNA-seq Data and the Search for Their Phenotypic Outcomes through Analysis of DEGs

Int J Mol Sci. 2021 Jul 6;22(14):7240. doi: 10.3390/ijms22147240.

Abstract

Currently, the detection of the allele asymmetry of gene expression from RNA-seq data or the transcription factor binding from ChIP-seq data is one of the approaches used to identify the functional genetic variants that can affect gene expression (regulatory SNPs or rSNPs). In this study, we searched for rSNPs using the data for human pulmonary arterial endothelial cells (PAECs) available from the Sequence Read Archive (SRA). Allele-asymmetric binding and expression events are analyzed in paired ChIP-seq data for H3K4me3 mark and RNA-seq data obtained for 19 individuals. Two statistical approaches, weighted z-scores and predicted probabilities, were used to improve the efficiency of finding rSNPs. In total, we identified 14,266 rSNPs associated with both allele-specific binding and expression. Among them, 645 rSNPs were associated with GWAS phenotypes; 4746 rSNPs were reported as eQTLs by GTEx, and 11,536 rSNPs were located in 374 candidate transcription factor binding motifs. Additionally, we searched for the rSNPs associated with gene expression using an SRA RNA-seq dataset for 281 clinically annotated human postmortem brain samples and detected eQTLs for 2505 rSNPs. Based on these results, we conducted Gene Ontology (GO), Disease Ontology (DO), and Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway enrichment analyses and constructed the protein-protein interaction networks to represent the top-ranked biological processes with a possible contribution to the phenotypic outcome.

Keywords: Genotype-Tissue expression; allele-specific events; eQTLs; enrichment analysis; molecular phenotype; protein-protein interaction networks; regulatory SNPs.

MeSH terms

  • Alleles
  • Brain / physiology
  • Cell Line, Tumor
  • Chromatin Immunoprecipitation / methods
  • Chromatin Immunoprecipitation Sequencing / methods
  • Endothelial Cells / physiology
  • Gene Expression / genetics
  • Gene Ontology
  • Histones
  • Humans
  • Phenotype
  • Polymorphism, Single Nucleotide / genetics*
  • Protein Interaction Maps / genetics
  • Pulmonary Artery
  • RNA-Seq / methods
  • Transcription Factors

Substances

  • Histones
  • Transcription Factors
  • histone H3 trimethyl Lys4