MPF-BML: a standalone GUI-based package for maximum entropy model inference

Bioinformatics. 2020 Apr 1;36(7):2278-2279. doi: 10.1093/bioinformatics/btz925.

Abstract

Summary: Learning underlying correlation patterns in data is a central problem across scientific fields. Maximum entropy models present an important class of statistical approaches for addressing this problem. However, accurately and efficiently inferring model parameters are a major challenge, particularly for modern high-dimensional applications such as in biology, for which the number of parameters is enormous. Previously, we developed a statistical method, minimum probability flow-Boltzmann Machine Learning (MPF-BML), for performing fast and accurate inference of maximum entropy model parameters, which was applied to genetic sequence data to estimate the fitness landscape for the surface proteins of human immunodeficiency virus and hepatitis C virus. To facilitate seamless use of MPF-BML and encourage more widespread application to data in diverse fields, we present a standalone cross-platform package of MPF-BML which features an easy-to-use graphical user interface. The package only requires the input data (protein sequence data or data of multiple configurations of a complex system with large number of variables) and returns the maximum entropy model parameters.

Availability and implementation: The MPF-BML software is publicly available under the MIT License at https://github.com/ahmedaq/MPF-BML-GUI.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Entropy
  • Humans
  • Machine Learning
  • Proteins*
  • Software*

Substances

  • Proteins