Full-Span Log-Linear Model and Fast Learning Algorithm

Kazuya Takabatake; Shotaro Akaho

doi:10.1162/neco_a_01496

Full-Span Log-Linear Model and Fast Learning Algorithm

Neural Comput. 2022 May 19;34(6):1398-1424. doi: 10.1162/neco_a_01496.

Authors

Kazuya Takabatake¹, Shotaro Akaho²

Affiliations

¹ Human Informatics and Interaction Research Institute, National Institute of Advanced Industrial Science and Technology, Tsukuba, 305-8568, Japan k.takabatake@aist.go.jp.
² Human Informatics and Interaction Research Institute, National Institute of Advanced Industrial Science and Technology, Tsukuba, 305-8568, Japan s.akaho@aist.go.jp.

PMID: 35534007
DOI: 10.1162/neco_a_01496

Abstract

The full-span log-linear (FSLL) model introduced in this letter is considered an nth order Boltzmann machine, where n is the number of all variables in the target system. Let X=(X0,…,Xn-1) be finite discrete random variables that can take |X|=|X0|…|Xn-1| different values. The FSLL model has |X|-1 parameters and can represent arbitrary positive distributions of X. The FSLL model is a highest-order Boltzmann machine; nevertheless, we can compute the dual parameter of the model distribution, which plays important roles in exponential families in O(|X|log|X|) time. Furthermore, using properties of the dual parameters of the FSLL model, we can construct an efficient learning algorithm. The FSLL model is limited to small probabilistic models up to |X|≈225; however, in this problem domain, the FSLL model flexibly fits various true distributions underlying the training data without any hyperparameter tuning. The experiments showed that the FSLL successfully learned six training data sets such that |X|=220 within 1 minute with a laptop PC.