SNVDis: a proteome-wide analysis service for evaluating nsSNVs in protein functional sites and pathways

Genomics Proteomics Bioinformatics. 2013 Apr;11(2):122-6. doi: 10.1016/j.gpb.2012.10.003. Epub 2012 Dec 5.

Abstract

Amino acid changes due to non-synonymous variation are included as annotations for individual proteins in UniProtKB/Swiss-Prot and RefSeq which present biological data in a protein- or gene-centric fashion. Unfortunately, proteome-wide analysis of non-synonymous single-nucleotide variations (nsSNVs) is not easy to perform because information on nsSNVs and functionally important sites are not well integrated both within and between databases and their search engines. We have developed SNVDis that allows evaluation of proteome-wide nsSNV distribution in functional sites, domains and pathways. More specifically, we have integrated human-specific data from major variation databases (UniProtKB, dbSNP and COSMIC), comprehensive sequence feature annotation from UniProtKB, Pfam, RefSeq, Conserved Domain Database (CDD) and pathway information from Protein ANalysis THrough Evolutionary Relationships (PANTHER) and mapped all of them in a uniform and comprehensive way to the human reference proteome provided by UniProtKB/Swiss-Prot. Integrated information of active sites, pathways, binding sites, domains, which are extracted from a number of different sources, provides a detailed overview of how nsSNVs are distributed over the human proteome and pathways and how they intersect with functional sites of proteins. Additionally, it is possible to find out whether there is an over- or under-representation of nsSNVs in specific domains, pathways or user-defined protein lists. The underlying datasets are updated once every 3months. SNVDis is freely available at http://hive.biochemistry.gwu.edu/tool/snvdis.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Binding Sites
  • Databases, Protein
  • Humans
  • Internet
  • Polymorphism, Single Nucleotide*
  • Proteins / analysis
  • Proteins / chemistry*
  • Proteins / genetics
  • Proteome
  • Proteomics / methods*

Substances

  • Proteins
  • Proteome