High-throughput sequencing of a South American Amerindian

PLoS One. 2013 Dec 30;8(12):e83340. doi: 10.1371/journal.pone.0083340. eCollection 2013.

Abstract

The emergence of next-generation sequencing technologies allowed access to the vast amounts of information that are contained in the human genome. This information has contributed to the understanding of individual and population-based variability and improved the understanding of the evolutionary history of different human groups. However, the genome of a representative of the Amerindian populations had not been previously sequenced. Thus, the genome of an individual from a South American tribe was completely sequenced to further the understanding of the genetic variability of Amerindians. A total of 36.8 giga base pairs (Gbp) were sequenced and aligned with the human genome. These Gbp corresponded to 95.92% of the human genome with an estimated miscall rate of 0.0035 per sequenced bp. The data obtained from the alignment were used for SNP (single-nucleotide) and INDEL (insertion-deletion) calling, which resulted in the identification of 502,017 polymorphisms, of which 32,275 were potentially new high-confidence SNPs and 33,795 new INDELs, specific of South Native American populations. The authenticity of the sample as a member of the South Native American populations was confirmed through the analysis of the uniparental (maternal and paternal) lineages. The autosomal comparison distinguished the investigated sample from others continental populations and revealed a close relation to the Eastern Asian populations and Aboriginal Australian. Although, the findings did not discard the classical model of America settlement; it brought new insides to the understanding of the human population history. The present study indicates a remarkable genetic variability in human populations that must still be identified and contributes to the understanding of the genetic variability of South Native American populations and of the human populations history.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Cluster Analysis
  • DNA, Mitochondrial
  • Genetic Linkage
  • Genetics, Population
  • Genome, Human*
  • High-Throughput Nucleotide Sequencing*
  • Humans
  • INDEL Mutation
  • Indians, South American / genetics*
  • Molecular Sequence Annotation
  • Polymorphism, Single Nucleotide
  • Population Groups / genetics

Substances

  • DNA, Mitochondrial

Grants and funding

The work was supported by the Rede Paraense de Genômica e Proteômica (Governo do Para/Secretaria de Estado de Desenvolvimento, Ciência e Tecnologia/Fundação de Amparo a Pesquisa do Estado do Pará (FAPESPA)), Pró-Reitoria de Pesquisa e Pós-Graduação/Universidade Federal do Pará, Fundação de Amparo e Desenvolvimento da Pesquisa, Coordenação de Aperfeicoamento de Pessoal de Nivel Superior (Capes), FAPESPA and Conselho Nacional de Desenvolvimento Científico e Tecnológico. LG was supported by a grant from CAPES/Brasil. The funders had no role in the study design, the data collection and analysis, the decision to publish, or the preparation of the manuscript.