Full-length transcriptome-based H-InvDB throws a new light on chromosome-centric proteomics

J Proteome Res. 2013 Jan 4;12(1):62-6. doi: 10.1021/pr300861a. Epub 2012 Dec 17.

Abstract

H-Invitational Database (H-InvDB; http://hinv.jp/ ) is an integrated database of all human genes and transcripts that started in an international collaborative research project for establishing a functional annotation database of human full-length cDNAs. Because H-InvDB contains an abundance of information for human transcripts, including not only well-characterized protein-coding transcripts but also those without experimental evidence at the protein level, this will be a useful information resource for identifying novel and uncharacterized human proteins (so-called missing proteins). By extending predicted protein data in H-InvDB, we developed the H-Inv Extended Protein Database (H-EPD; http://hinv.jp/hinv/h-epd/ ). From now on, we plan to carry out a database-driven proteome research that makes full use of H-EPD to promote discoveries in the current and future C-HPP. Furthermore, we will push forward with the integration of genome, transcriptome, and proteome databases using a unique tool for connecting distributed databases and would like to develop a knowledge discovery system by incorporating data mining tools.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA, Complementary* / genetics
  • DNA, Complementary* / metabolism
  • Databases, Factual
  • Gene Expression
  • Gene Expression Profiling*
  • Genome, Human
  • Human Genome Project
  • Humans
  • Mass Spectrometry
  • Proteins* / classification
  • Proteins* / genetics
  • Proteins* / metabolism
  • Proteome*

Substances

  • DNA, Complementary
  • Proteins
  • Proteome