The MAR databases: development and implementation of databases specific for marine metagenomics

Nucleic Acids Res. 2018 Jan 4;46(D1):D692-D699. doi: 10.1093/nar/gkx1036.

Abstract

We introduce the marine databases; MarRef, MarDB and MarCat (https://mmp.sfb.uit.no/databases/), which are publicly available resources that promote marine research and innovation. These data resources, which have been implemented in the Marine Metagenomics Portal (MMP) (https://mmp.sfb.uit.no/), are collections of richly annotated and manually curated contextual (metadata) and sequence databases representing three tiers of accuracy. While MarRef is a database for completely sequenced marine prokaryotic genomes, which represent a marine prokaryote reference genome database, MarDB includes all incomplete sequenced prokaryotic genomes regardless level of completeness. The last database, MarCat, represents a gene (protein) catalog of uncultivable (and cultivable) marine genes and proteins derived from marine metagenomics samples. The first versions of MarRef and MarDB contain 612 and 3726 records, respectively. Each record is built up of 106 metadata fields including attributes for sampling, sequencing, assembly and annotation in addition to the organism and taxonomic information. Currently, MarCat contains 1227 records with 55 metadata fields. Ontologies and controlled vocabularies are used in the contextual databases to enhance consistency. The user-friendly web interface lets the visitors browse, filter and search in the contextual databases and perform BLAST searches against the corresponding sequence databases. All contextual and sequence databases are freely accessible and downloadable from https://s1.sfb.uit.no/public/mar/.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Aquatic Organisms / genetics*
  • Aquatic Organisms / microbiology
  • Archaeal Proteins / genetics
  • Bacterial Proteins / genetics
  • Base Sequence
  • DNA, Archaeal / genetics
  • DNA, Bacterial / genetics
  • Databases, Genetic*
  • Databases, Nucleic Acid*
  • Databases, Protein*
  • Fresh Water / microbiology
  • Genes, Archaeal*
  • Genes, Bacterial*
  • Genome
  • Internet
  • Metagenomics*
  • Molecular Sequence Annotation
  • Seawater / microbiology
  • Soil Microbiology
  • User-Computer Interface
  • Water Microbiology
  • Web Browser

Substances

  • Archaeal Proteins
  • Bacterial Proteins
  • DNA, Archaeal
  • DNA, Bacterial