SuPreMo: a computational tool for streamlining in silico perturbation using sequence-based predictive models

bioRxiv [Preprint]. 2023 Nov 5:2023.11.03.565556. doi: 10.1101/2023.11.03.565556.

Abstract

Computationally editing genome sequences is a common bioinformatics task, but current approaches have limitations, such as incompatibility with structural variants, challenges in identifying responsible sequence perturbations, and the need for vcf file inputs and phased data. To address these bottlenecks, we present Sequence Mutator for Predictive Models (SuPreMo), a scalable and comprehensive tool for performing in silico mutagenesis. We then demonstrate how pairs of reference and perturbed sequences can be used with machine learning models to prioritize pathogenic variants or discover new functional sequences.

Publication types

  • Preprint