SpliceProt: a protein sequence repository of predicted human splice variants

Proteomics. 2014 Feb;14(2-3):181-5. doi: 10.1002/pmic.201300078.

Abstract

The mechanism of alternative splicing in the transcriptome may increase the proteome diversity in eukaryotes. In proteomics, several studies aim to use protein sequence repositories to annotate MS experiments or to detect differentially expressed proteins. However, the available protein sequence repositories are not designed to fully detect protein isoforms derived from mRNA splice variants. To foster knowledge for the field, here we introduce SpliceProt, a new protein sequence repository of transcriptome experimental data used to investigate for putative splice variants in human proteomes. Current version of SpliceProt contains 159 719 non-redundant putative polypeptide sequences. The assessment of the potential of SpliceProt in detecting new protein isoforms resulting from alternative splicing was performed by using publicly available proteomics data. We detected 173 peptides hypothetically derived from splice variants, which 54 of them are not present in UniprotKB/TrEMBL sequence repository. In comparison to other protein sequence repositories, SpliceProt contains a greater number of unique peptides and is able to detect more splice variants. Therefore, SpliceProt provides a solution for the annotation of proteomics experiments regarding splice isofoms. The repository files containing the translated sequences of the predicted splice variants and a visualization tool are freely available at http://lbbc.inca.gov.br/spliceprot.

Keywords: Alternative splicing; Bioinformatics.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Alternative Splicing*
  • Amino Acid Sequence
  • Animals
  • Computer Simulation
  • Databases, Protein*
  • Humans
  • Peptides / chemistry*
  • Peptides / genetics
  • Protein Isoforms / chemistry*
  • Protein Isoforms / genetics
  • Proteomics / methods*

Substances

  • Peptides
  • Protein Isoforms