Genome Rearrangements on Multigenomic Models: Applications of Graph Convexity Problems

J Comput Biol. 2019 Nov;26(11):1214-1222. doi: 10.1089/cmb.2019.0091. Epub 2019 May 22.

Abstract

Genome rearrangements are events where large blocks of DNA exchange pieces during evolution. The analysis of such events is a tool for understanding evolutionary genomics, in whose context many rearrangement distances have been proposed, based on finding the minimum number of rearrangements to transform one genome into another, using some predefined operation. However, when more than two genomes are considered, we have new challenging problems. Studying such problems from a combinatorial point of view has been shown to be a useful tool to approach such problems, for example, the reconstruction of phylogenetic trees. We focus on genome rearrangement problems related to graph convexity. Such an approach is in connection with some other well-known studies on multigenomic models, for example, those based on the median and on the closest string. We propose an association between graph convexities and genome rearrangements in such a way that graph convexity problems deal with input sets of vertices and try to answer questions concerning the closure of such inputs. The concept of closure is useful for studies on genome rearrangement by suggesting mechanisms to reduce the genomic search space. Regarding the computational complexity, and considering the Hamming distance on strings, we solve the following problems: decide if a given set is convex; compute the interval and the convex hull of a given set; and determine the convexity number, interval number, and hull number of a Hamming graph. All such problems are solved for three types of convexities: geodetic, monophonic, and P3. Considering the Cayley distance on permutations, we solve the convexity number and interval determination problems for the geodetic convexity.

Keywords: Cayley distance; Hamming distance; closest string problem; genome rearrangement; graph convexity; median problem.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Computational Biology / methods*
  • Evolution, Molecular*
  • Gene Rearrangement / genetics*
  • Genomics / methods
  • Phylogeny*