Mimivirus reveals Mre11/Rad50 fusion proteins with a sporadic distribution in eukaryotes, bacteria, viruses and plasmids

Virol J. 2011 Sep 7:8:427. doi: 10.1186/1743-422X-8-427.

Abstract

Background: The Mre11/Rad50 complex and the homologous SbcD/SbcC complex in bacteria play crucial roles in the metabolism of DNA double-strand breaks, including DNA repair, genome replication, homologous recombination and non-homologous end-joining in cellular life forms and viruses. Here we investigated the amino acid sequence of the Mimivirus R555 gene product, originally annotated as a Rad50 homolog, and later shown to have close homologs in marine microbial metagenomes.

Results: Our bioinformatics analysis revealed that R555 protein sequence is constituted from the fusion of an N-terminal Mre11-like domain with a C-terminal Rad50-like domain. A systematic database search revealed twelve additional cases of Mre11/Rad50 (or SbcD/SbcC) fusions in a wide variety of unrelated organisms including unicellular and multicellular eukaryotes, the megaplasmid of a bacterium associated to deep-sea hydrothermal vents (Deferribacter desulfuricans) and the plasmid of Clostridium kluyveri. We also showed that R555 homologs are abundant in the metagenomes from different aquatic environments and that they most likely belong to aquatic viruses. The observed phyletic distribution of these fusion proteins suggests their recurrent creation and lateral gene transfers across organisms.

Conclusions: The existence of the fused version of protein sequences is consistent with known functional interactions between Mre11 and Rad50, and the gene fusion probably enhanced the opportunity for lateral transfer. The abundance of the Mre11/Rad50 fusion genes in viral metagenomes and their sporadic phyletic distribution in cellular organisms suggest that viruses, plasmids and transposons played a crucial role in the formation of the fusion proteins and their propagation into cellular genomes.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acid Anhydride Hydrolases
  • Animals
  • Aquatic Organisms / genetics*
  • Archaeal Proteins / genetics
  • Bacteria / genetics
  • Bacterial Proteins / genetics
  • Computational Biology
  • DNA Breaks, Double-Stranded
  • DNA Repair
  • DNA Repair Enzymes / chemistry
  • DNA Repair Enzymes / genetics*
  • DNA-Binding Proteins / chemistry
  • DNA-Binding Proteins / genetics*
  • Databases, Genetic
  • Deoxyribonucleases / genetics
  • Endodeoxyribonucleases / chemistry
  • Endodeoxyribonucleases / genetics*
  • Escherichia coli Proteins / genetics
  • Exodeoxyribonucleases / chemistry
  • Exodeoxyribonucleases / genetics*
  • Exonucleases / genetics
  • Gene Transfer, Horizontal
  • Genome, Viral*
  • Humans
  • Metagenome*
  • Metagenomics
  • Mimiviridae / chemistry
  • Mimiviridae / genetics*
  • Phylogeny
  • Plasmids / chemistry
  • Plasmids / genetics
  • Saccharomyces cerevisiae Proteins / chemistry
  • Saccharomyces cerevisiae Proteins / genetics*
  • Viral Fusion Proteins / chemistry
  • Viral Fusion Proteins / genetics*
  • Viral Proteins / chemistry
  • Viral Proteins / genetics*

Substances

  • Archaeal Proteins
  • Bacterial Proteins
  • DNA-Binding Proteins
  • Escherichia coli Proteins
  • Saccharomyces cerevisiae Proteins
  • SbcC protein, Bacteria
  • Viral Fusion Proteins
  • Viral Proteins
  • Deoxyribonucleases
  • Endodeoxyribonucleases
  • Exodeoxyribonucleases
  • Exonucleases
  • MRE11 protein, S cerevisiae
  • sbcD protein, E coli
  • Acid Anhydride Hydrolases
  • RAD50 protein, human
  • DNA Repair Enzymes