Systematic artifacts in metagenomes from complex microbial communities

ISME J. 2009 Nov;3(11):1314-7. doi: 10.1038/ismej.2009.72. Epub 2009 Jul 9.

Abstract

Metagenomics is providing an unprecedented view of the taxonomic diversity, metabolic potential and ecological role of microbial communities in biomes as diverse as the mammalian gastrointestinal tract, the marine water column and soils. However, we have found a systematic error in metagenomes generated by 454-based pyrosequencing that leads to an overestimation of gene and taxon abundance; between 11% and 35% of sequences in a typical metagenome are artificial replicates. Here we document the error in several published and original datasets and offer a web-based solution (http://microbiomes.msu.edu/replicates) for identifying and removing these artifacts.

Publication types

  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Base Sequence
  • Databases, Genetic / standards*
  • Metagenome*
  • Metagenomics / standards*
  • Molecular Sequence Data
  • Sequence Alignment
  • Sequence Analysis, DNA / standards
  • Soil Microbiology*