Convolutional Analysis Operator Learning: Dependence on Training Data

Il Yong Chun; David Hong; Ben Adcock; Jeffrey A Fessler

doi:10.1109/lsp.2019.2921446

Convolutional Analysis Operator Learning: Dependence on Training Data

IEEE Signal Process Lett. 2019 Aug;26(8):1137-1141. doi: 10.1109/lsp.2019.2921446. Epub 2019 Jun 7.

Authors

Il Yong Chun¹, David Hong¹, Ben Adcock², Jeffrey A Fessler¹

Affiliations

¹ Department of Electrical Engineering and Computer Science, The University of Michigan, Ann Arbor, MI 48019 USA.
² Department of Mathematics, Simon Fraser University, Burnaby, BC V5A 1S6 Canada.

Abstract

Convolutional analysis operator learning (CAOL) enables the unsupervised training of (hierarchical) convolutional sparsifying operators or autoencoders from large datasets. One can use many training images for CAOL, but a precise understanding of the impact of doing so has remained an open question. This paper presents a series of results that lend insight into the impact of dataset size on the filter update in CAOL. The first result is a general deterministic bound on errors in the estimated filters, and is followed by a bound on the expected errors as the number of training samples increases. The second result provides a high probability analogue. The bounds depend on properties of the training data, and we investigate their empirical values with real data. Taken together, these results provide evidence for the potential benefit of using more training data in CAOL.

Grants and funding

U01 EB018753/EB/NIBIB NIH HHS/United States