The population genomics of rhesus macaques (Macaca mulatta) based on whole-genome sequences

Genome Res. 2016 Dec;26(12):1651-1662. doi: 10.1101/gr.204255.116. Epub 2016 Oct 17.

Abstract

Rhesus macaques (Macaca mulatta) are the most widely used nonhuman primate in biomedical research, have the largest natural geographic distribution of any nonhuman primate, and have been the focus of much evolutionary and behavioral investigation. Consequently, rhesus macaques are one of the most thoroughly studied nonhuman primate species. However, little is known about genome-wide genetic variation in this species. A detailed understanding of extant genomic variation among rhesus macaques has implications for the use of this species as a model for studies of human health and disease, as well as for evolutionary population genomics. Whole-genome sequencing analysis of 133 rhesus macaques revealed more than 43.7 million single-nucleotide variants, including thousands predicted to alter protein sequences, transcript splicing, and transcription factor binding sites. Rhesus macaques exhibit 2.5-fold higher overall nucleotide diversity and slightly elevated putative functional variation compared with humans. This functional variation in macaques provides opportunities for analyses of coding and noncoding variation, and its cellular consequences. Despite modestly higher levels of nonsynonymous variation in the macaques, the estimated distribution of fitness effects and the ratio of nonsynonymous to synonymous variants suggest that purifying selection has had stronger effects in rhesus macaques than in humans. Demographic reconstructions indicate this species has experienced a consistently large but fluctuating population size. Overall, the results presented here provide new insights into the population genomics of nonhuman primates and expand genomic information directly relevant to primate models of human disease.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Evolution, Molecular
  • Female
  • Genetic Fitness
  • High-Throughput Nucleotide Sequencing / methods*
  • Macaca mulatta / classification
  • Macaca mulatta / genetics*
  • Models, Animal
  • Polymorphism, Single Nucleotide
  • Population Density
  • Whole Genome Sequencing / methods*