Genome Warehouse: A Public Repository Housing Genome-scale Data

Genomics Proteomics Bioinformatics. 2021 Aug;19(4):584-589. doi: 10.1016/j.gpb.2021.04.001. Epub 2021 Jun 24.

Abstract

The Genome Warehouse (GWH) is a public repository housing genome assembly data for a wide range of species and delivering a series of web services for genome data submission, storage, release, and sharing. As one of the core resources in the National Genomics Data Center (NGDC), part of the China National Center for Bioinformation (CNCB; https://ngdc.cncb.ac.cn), GWH accepts both full and partial (chloroplast, mitochondrion, and plasmid) genome sequences with different assembly levels, as well as an update of existing genome assemblies. For each assembly, GWH collects detailed genome-related metadata of biological project, biological sample, and genome assembly, in addition to genome sequence and annotation. To archive high-quality genome sequences and annotations, GWH is equipped with a uniform and standardized procedure for quality control. Besides basic browse and search functionalities, all released genome sequences and annotations can be visualized with JBrowse. By May 21, 2021, GWH has received 19,124 direct submissions covering a diversity of 1108 species and has released 8772 of them. Collectively, GWH serves as an important resource for genome-scale data management and provides free and publicly accessible data to support research activities throughout the world. GWH is publicly accessible at https://ngdc.cncb.ac.cn/gwh.

Keywords: Genome Warehouse; Genome annotation; Genome sequence; Genome submission; Quality control.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • China
  • Databases, Genetic*
  • Genome
  • Genomics / methods
  • Housing*