Finding your way through Pneumocystis sequences in the NCBI gene database

J Eukaryot Microbiol. 2014 Sep-Oct;61(5):537-55. doi: 10.1111/jeu.12132. Epub 2014 Jul 24.

Abstract

Pneumocystis sequences can be downloaded from GenBank for purposes as primer/probe design or phylogenetic studies. Due to changes in nomenclature and assignment, available sequences are presented with a variety of inhomogeneous information, which renders practical utilization difficult. The aim of this study was the descriptive evaluation of different parameters of 532 Pneumocystis sequences of mitochondrial and ribosomal origin downloaded from GenBank with regard to completeness and information content. Pneumocystis sequences were characterized by up to four different names. Official changes in nomenclature have only been partly implemented and the usage of the "forma specialis", a special feature of Pneumocystis, has only been established fragmentary in the database. Hints for a mitochondrial or ribosomal genomic origin could be found, but can easily be overlooked, which renders the download of wrong reference material possible. The specification of the host was either not available or variable regarding the used language and the localization of this information in the title or several subtitles, which limits their applicability in phylogenetic studies. Declaration of products and geographic origin was incomplete. The print version of this manuscript is completed by an online database which contains detailed information to every accession number included in the meta-analysis.

Keywords: GenBank; geographic origin; host specification; information content; meta-analysis; mitochondrial and ribosomal genomic origin; nomenclature; online database.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Base Sequence
  • DNA, Fungal / chemistry
  • DNA, Fungal / genetics
  • DNA, Ribosomal / chemistry
  • DNA, Ribosomal / genetics
  • Databases, Nucleic Acid*
  • Molecular Sequence Data
  • Pneumocystis / chemistry
  • Pneumocystis / classification
  • Pneumocystis / genetics*
  • Sequence Alignment

Substances

  • DNA, Fungal
  • DNA, Ribosomal