Advancing Trypanosoma brucei genome annotation through ribosome profiling and spliced leader mapping

Mol Biochem Parasitol. 2015 Aug;202(2):1-10. doi: 10.1016/j.molbiopara.2015.09.002. Epub 2015 Sep 21.

Abstract

Since the initial publication of the trypanosomatid genomes, curation has been ongoing. Here we make use of existing Trypanosoma brucei ribosome profiling data to provide evidence of ribosome occupancy (and likely translation) of mRNAs from 225 currently unannotated coding sequences (CDSs). A small number of these putative genes correspond to extra copies of previously annotated genes, but 85% are novel. The median size of these novels CDSs is small (81 aa), indicating that past annotation work has excelled at detecting large CDSs. Of the unique CDSs confirmed here, over half have candidate orthologues in other trypanosomatid genomes, most of which were not yet annotated as protein-coding genes. Nonetheless, approximately one-third of the new CDSs were found only in T. brucei subspecies. Using ribosome footprints, RNA-Seq and spliced leader mapping data, we updated previous work to definitively revise the start sites for 414 CDSs as compared to the current gene models. The data pointed to several regions of the genome that had sequence errors that altered coding region boundaries. Finally, we consolidated this data with our previous work to propose elimination of 683 putative genes as protein-coding and arrive at a view of the translatome of slender bloodstream and procyclic culture form T. brucei.

Keywords: De novo gene evolution; Genome annotation; Ribosome profiling; Translation; Trypanosomes.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Codon, Initiator / genetics*
  • Evolution, Molecular
  • Genes, Protozoan*
  • Molecular Sequence Annotation
  • Open Reading Frames / genetics
  • RNA, Spliced Leader / genetics*
  • Ribosomes / metabolism*
  • Sequence Analysis, RNA
  • Trypanosoma brucei brucei / genetics*

Substances

  • Codon, Initiator
  • RNA, Spliced Leader