A Path-Deformation Framework for Determining Weighted Genome Rearrangement Distance

Front Genet. 2020 Sep 24:11:1035. doi: 10.3389/fgene.2020.01035. eCollection 2020.

Abstract

Measuring the distance between two bacterial genomes under the inversion process is usually done by assuming all inversions to occur with equal probability. Recently, an approach to calculating inversion distance using group theory was introduced, and is effective for the model in which only very short inversions occur. In this paper, we show how to use the group-theoretic framework to establish minimal distance for any weighting on the set of inversions, generalizing previous approaches. To do this we use the theory of rewriting systems for groups, and exploit the Knuth-Bendix algorithm, the first time this theory has been introduced into genome rearrangement problems. The central idea of the approach is to use existing group theoretic methods to find an initial path between two genomes in genome space (for instance using only short inversions), and then to deform this path to optimality using a confluent system of rewriting rules generated by the Knuth-Bendix algorithm.

Keywords: Knuth-Bendix algorithm; genome rearrangement; group theory; inversion; rewriting systems.