Generalization in quantum machine learning from few training data

Matthias C Caro; Hsin-Yuan Huang; M Cerezo; Kunal Sharma; Andrew Sornborger; Lukasz Cincio; Patrick J Coles

doi:10.1038/s41467-022-32550-3

Generalization in quantum machine learning from few training data

Nat Commun. 2022 Aug 22;13(1):4919. doi: 10.1038/s41467-022-32550-3.

Authors

Matthias C Caro^{1

2}, Hsin-Yuan Huang^{3

4}, M Cerezo^{5

6}, Kunal Sharma⁷, Andrew Sornborger^{5

8}, Lukasz Cincio⁹, Patrick J Coles⁹

Affiliations

¹ Department of Mathematics, Technical University of Munich, Garching, Germany. caro@ma.tum.de.
² Munich Center for Quantum Science and Technology (MCQST), Munich, Germany. caro@ma.tum.de.
³ Institute for Quantum Information and Matter, Caltech, Pasadena, CA, USA.
⁴ Department of Computing and Mathematical Sciences, Caltech, Pasadena, CA, USA.
⁵ Information Sciences, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA.
⁶ Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA.
⁷ Joint Center for Quantum Information and Computer Science, University of Maryland, College Park, MD, 20742, USA.
⁸ Quantum Science Center, Oak Ridge, TN, 37931, USA.
⁹ Theoretical Division, Los Alamos National Laboratory, Los Alamos, NM, 87545, USA.

Abstract

Modern quantum machine learning (QML) methods involve variationally optimizing a parameterized quantum circuit on a training data set, and subsequently making predictions on a testing data set (i.e., generalizing). In this work, we provide a comprehensive study of generalization performance in QML after training on a limited number N of training data points. We show that the generalization error of a quantum machine learning model with T trainable gates scales at worst as [Formula: see text]. When only K ≪ T gates have undergone substantial change in the optimization process, we prove that the generalization error improves to [Formula: see text]. Our results imply that the compiling of unitaries into a polynomial number of native gates, a crucial application for the quantum computing industry that typically uses exponential-size training data, can be sped up significantly. We also show that classification of quantum states across a phase transition with a quantum convolutional neural network requires only a very small training data set. Other potential applications include learning quantum error correcting codes or quantum dynamical simulation. Our work injects new hope into the field of QML, as good generalization is guaranteed from few training data.

Abstract

Grants and funding