Architecture of population-differentiated polymorphisms in the human genome

PLoS One. 2019 Oct 17;14(10):e0224089. doi: 10.1371/journal.pone.0224089. eCollection 2019.

Abstract

Population variation in disease and other phenotype are partly attributed to single nucleotide polymorphisms (SNPs) in the human genome. Due to selection pressure, two individuals from the same ancestral population have more genetic similarity compared to individuals from further geographic regions. Here, we elucidated the genomic population differentiation pattern, by interrogating >22,000,000 SNPs. Majority of population-differentiated (pd) SNPs (~95%), including the potentially functional (pf) (~84%) subset reside in non-genic regions, compared to the proportion of all SNPs (58%) found in non-genic regions. This suggests that differences between populations are more likely due to differences in gene regulation rather than protein function. Actin Cytoskeleton, Axonal Guidance and Protein Kinase A signaling pathways are enriched with genes carrying at least three pdSNPs (enriched pdGenes), while Antigen Presentation, Hepatic Fibrosis and Huntington Disease Signalling pathways are over-represented by enriched pf-pdGenes. An inverse correlation between chromosome size and the proportion of pd-/pf-pdSNPs was observed. Smaller chromosomes have relatively more of such SNPs including genes carrying these SNPs. Genes associated with common diseases and enriched with these pd-/pfpdSNPs are localized to 11 different chromosomes, with immune-related disease pd/pf-pdGenes mainly residing in chromosome 6 while neurological disease pd/pf-pdGenes residing in smaller chromosomes including chromosome 21/22. The associated diseases were reported to show population differences in incidence, severity and/or etiology. In summary, this study highlights the non-sporadic nature of population differentiation footprint in the human genome, which can potentially lead to the identification of genomic regions that play roles in the manifestation of phenotypic differences, including in disease predisposition and drug response.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Actin Cytoskeleton / genetics
  • Gene Expression Regulation / genetics
  • Genetics, Population
  • Genome, Human*
  • Humans
  • Polymorphism, Single Nucleotide*
  • Signal Transduction / genetics

Grants and funding

This work was supported by BMRC-SERC (112 148 0008) and block funding from National Cancer Center, Singapore and Duke-NUS Graduate Medical School to A/Prof Caroline G.L. LEE.