Functional characterization and high-throughput proteomic analysis of interrupted genes in the archaeon Sulfolobus solfataricus

J Proteome Res. 2010 May 7;9(5):2496-507. doi: 10.1021/pr901166q.

Abstract

Sequenced genomes often reveal interrupted coding sequences that complicate the annotation process and the subsequent functional characterization of the genes. In the past, interrupted genes were generally considered to be the result of sequencing errors or pseudogenes, that is, gene remnants with little or no biological importance. However, recent lines of evidence support the hypothesis that these coding sequences can be functional; thus, it is crucial to understand whether interrupted genes are expressed in vivo. We addressed this issue by experimentally demonstrating the existence of functional disrupted genes in archaeal genomes. We discovered previously unknown disrupted genes that have interrupted homologues in distantly related species of archaea. The combination of a RT-PCR strategy with shotgun proteomics demonstrates that interrupted genes in the archaeon Sulfolobus solfataricus are expressed in vivo. In addition, the sequence of the peptides determined by LCMSMS and experiments of in vitro translation allows us to identify a gene expressed by programmed -1 frameshifting. Our findings will enable an accurate reinterpretation of archaeal interrupted genes shedding light on their function and on archaeal genome evolution.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Amino Acid Sequence
  • Archaeal Proteins / chemistry*
  • Archaeal Proteins / genetics
  • Base Sequence
  • Chromatography, Liquid
  • Genes, Archaeal*
  • High-Throughput Screening Assays / methods*
  • Molecular Sequence Data
  • Peptide Mapping
  • Proteome / analysis*
  • Proteomics / methods*
  • Pseudogenes
  • Reverse Transcriptase Polymerase Chain Reaction
  • Sulfolobus solfataricus / genetics*
  • Tandem Mass Spectrometry
  • Transketolase / chemistry
  • Transketolase / genetics

Substances

  • Archaeal Proteins
  • Proteome
  • Transketolase