Fundamental limits in structured principal component analysis and how to reach them

Jean Barbier; Francesco Camilli; Marco Mondelli; Manuel Sáenz

doi:10.1073/pnas.2302028120

Fundamental limits in structured principal component analysis and how to reach them

Proc Natl Acad Sci U S A. 2023 Jul 25;120(30):e2302028120. doi: 10.1073/pnas.2302028120. Epub 2023 Jul 18.

Authors

Jean Barbier¹, Francesco Camilli¹, Marco Mondelli², Manuel Sáenz³

Affiliations

¹ Quantitative Life Sciences and Mathematics Sections, International Centre for Theoretical Physics, Trieste 34151, Italy.
² Institute of Science and Technology Austria, Klosterneuburg 3400, Austria.
³ Centro de Matemática, Universidad de La República, Montevideo 11400, Uruguay.

Abstract

How do statistical dependencies in measurement noise influence high-dimensional inference? To answer this, we study the paradigmatic spiked matrix model of principal components analysis (PCA), where a rank-one matrix is corrupted by additive noise. We go beyond the usual independence assumption on the noise entries, by drawing the noise from a low-order polynomial orthogonal matrix ensemble. The resulting noise correlations make the setting relevant for applications but analytically challenging. We provide characterization of the Bayes optimal limits of inference in this model. If the spike is rotation invariant, we show that standard spectral PCA is optimal. However, for more general priors, both PCA and the existing approximate message-passing algorithm (AMP) fall short of achieving the information-theoretic limits, which we compute using the replica method from statistical physics. We thus propose an AMP, inspired by the theory of adaptive Thouless-Anderson-Palmer equations, which is empirically observed to saturate the conjectured theoretical limit. This AMP comes with a rigorous state evolution analysis tracking its performance. Although we focus on specific noise distributions, our methodology can be generalized to a wide class of trace matrix ensembles at the cost of more involved expressions. Finally, despite the seemingly strong assumption of rotation-invariant noise, our theory empirically predicts algorithmic performance on real data, pointing at strong universality properties.

Keywords: approximate message passing; high-dimensional inference; principal components analysis; replica method; structured data.

Abstract

Grants and funding