Genetic variations and diseases in UniProtKB/Swiss-Prot: the ins and outs of expert manual curation

Hum Mutat. 2014 Aug;35(8):927-35. doi: 10.1002/humu.22594. Epub 2014 Jun 24.

Abstract

During the last few years, next-generation sequencing (NGS) technologies have accelerated the detection of genetic variants resulting in the rapid discovery of new disease-associated genes. However, the wealth of variation data made available by NGS alone is not sufficient to understand the mechanisms underlying disease pathogenesis and manifestation. Multidisciplinary approaches combining sequence and clinical data with prior biological knowledge are needed to unravel the role of genetic variants in human health and disease. In this context, it is crucial that these data are linked, organized, and made readily available through reliable online resources. The Swiss-Prot section of the Universal Protein Knowledgebase (UniProtKB/Swiss-Prot) provides the scientific community with a collection of information on protein functions, interactions, biological pathways, as well as human genetic diseases and variants, all manually reviewed by experts. In this article, we present an overview of the information content of UniProtKB/Swiss-Prot to show how this knowledgebase can support researchers in the elucidation of the mechanisms leading from a molecular defect to a disease phenotype.

Keywords: UniProtKB/Swiss-Prot; controlled vocabulary; database; disease; functional annotation; genetic variants; manual curation.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Review

MeSH terms

  • Amino Acid Sequence
  • Databases, Protein / statistics & numerical data*
  • Genetic Association Studies*
  • Genetic Variation
  • Genetics, Medical*
  • Genome, Human
  • High-Throughput Nucleotide Sequencing
  • Humans
  • Internet
  • Knowledge Bases*
  • Molecular Sequence Annotation
  • Molecular Sequence Data
  • Proteome*
  • Software*
  • Terminology as Topic

Substances

  • Proteome