An effective method for detecting gene conversion events in whole genomes

J Comput Biol. 2010 Sep;17(9):1281-97. doi: 10.1089/cmb.2010.0103.

Abstract

Gene conversion events are often overlooked in analyses of genome evolution. In a conversion event, an interval of DNA sequence (not necessarily containing a gene) overwrites a highly similar sequence. The event creates relationships among genomic intervals that can confound attempts to identify orthologs and to transfer functional annotation between genomes. Here we examine 1,616,329 paralogous pairs of mouse genomic intervals, and detect conversion events in about 7.5% of them. Properties of the putative gene conversions are analyzed, such as the lengths of the paralogous pairs and the spacing between their sources and targets. Our approach is illustrated using conversion events in primate CCL gene clusters. Source code for our program is included in the 3SEQ_2D package, which is freely available at www.bx.psu.edu/miller_lab/ .

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, N.I.H., Intramural

MeSH terms

  • Algorithms
  • Animals
  • Base Sequence
  • Evolution, Molecular
  • Gene Conversion*
  • Genome*
  • Mice
  • Molecular Sequence Data
  • Multigene Family
  • Phylogeny
  • Primates / genetics
  • Sequence Alignment
  • Sequence Analysis, DNA / methods*