Random sequencing of Paramecium somatic DNA

Eukaryot Cell. 2002 Jun;1(3):341-52. doi: 10.1128/EC.1.3.341-352.2002.

Abstract

We report a random survey of 1 to 2% of the somatic genome of the free-living ciliate Paramecium tetraurelia by single-run sequencing of the ends of plasmid inserts. As in all ciliates, the germ line genome of Paramecium (100 to 200 Mb) is reproducibly rearranged at each sexual cycle to produce a somatic genome of expressed or potentially expressed genes, stripped of repeated sequences, transposons, and AT-rich unique sequence elements limited to the germ line. We found the somatic genome to be compact (>68% coding, estimated from the sequence of several complete library inserts) and to feature uniformly small introns (18 to 35 nucleotides). This facilitated gene discovery: 722 open reading frames (ORFs) were identified by similarity with known proteins, and 119 novel ORFs were tentatively identified by internal comparison of the data set. We determined the phylogenetic position of Paramecium with respect to eukaryotes whose genomes have been sequenced by the distance matrix neighbor-joining method by using random combined protein data from the project. The unrooted tree obtained is very robust and in excellent agreement with accepted topology, providing strong support for the quality and consistency of the data set. Our study demonstrates that a random survey of the somatic genome of Paramecium is a good strategy for gene discovery in this organism.

Publication types

  • Comparative Study

MeSH terms

  • Amino Acid Sequence
  • Animals
  • Base Sequence
  • Codon / genetics
  • DNA, Protozoan / genetics*
  • Databases, Nucleic Acid
  • Genome, Protozoan
  • Introns
  • Models, Molecular
  • Molecular Sequence Data
  • Open Reading Frames
  • Paramecium tetraurelia / genetics*
  • Phylogeny
  • Protein Structure, Tertiary
  • Proteome
  • Protozoan Proteins / chemistry
  • Protozoan Proteins / genetics
  • Sequence Analysis, DNA

Substances

  • Codon
  • DNA, Protozoan
  • Proteome
  • Protozoan Proteins