Identification of the Structure of a Probabilistic Boolean Network From Samples Including Frequencies of Outcomes

IEEE Trans Neural Netw Learn Syst. 2019 Aug;30(8):2383-2396. doi: 10.1109/TNNLS.2018.2884454. Epub 2018 Dec 24.

Abstract

We study the problem of identifying the structure of a probabilistic Boolean network (PBN), a probabilistic model of biological networks, from a given set of samples. This problem can be regarded as an identification of a set of Boolean functions from samples. Existing studies on the identification of the structure of a PBN only use information on the occurrences of samples. In this paper, we also make use of the frequencies of occurrences of subtuples, information that is obtainable from the samples. We show that under this model, it is possible to identify a PBN from among a class of PBNs, for much broader classes of PBNs. In particular, we prove that, under a reasonable assumption, the structure of a PBN can be identified from among the class of PBNs that have at most three functions assigned to each node, but that identification may be impossible if four or more functions are assigned to each node. We also analyze the sample complexity for exactly identifying the structure of a PBN, and present an efficient algorithm for the identification of a PBN consisting of threshold functions from samples.

Publication types

  • Research Support, Non-U.S. Gov't