Tree diet: reducing the treewidth to unlock FPT algorithms in RNA bioinformatics

Algorithms Mol Biol. 2022 Apr 2;17(1):8. doi: 10.1186/s13015-022-00213-z.

Abstract

Hard graph problems are ubiquitous in Bioinformatics, inspiring the design of specialized Fixed-Parameter Tractable algorithms, many of which rely on a combination of tree-decomposition and dynamic programming. The time/space complexities of such approaches hinge critically on low values for the treewidth tw of the input graph. In order to extend their scope of applicability, we introduce the TREE-DIET problem, i.e. the removal of a minimal set of edges such that a given tree-decomposition can be slimmed down to a prescribed treewidth [Formula: see text]. Our rationale is that the time gained thanks to a smaller treewidth in a parameterized algorithm compensates the extra post-processing needed to take deleted edges into account. Our core result is an FPT dynamic programming algorithm for TREE-DIET, using [Formula: see text] time and space. We complement this result with parameterized complexity lower-bounds for stronger variants (e.g., NP-hardness when [Formula: see text] or [Formula: see text] is constant). We propose a prototype implementation for our approach which we apply on difficult instances of selected RNA-based problems: RNA design, sequence-structure alignment, and search of pseudoknotted RNAs in genomes, revealing very encouraging results. This work paves the way for a wider adoption of tree-decomposition-based algorithms in Bioinformatics.

Keywords: FPT algorithms; RNA; RNA design; structure sequence alignment; treewidth.