Adaptive dimensionality reduction for neural network-based online principal component analysis

PLoS One. 2021 Mar 30;16(3):e0248896. doi: 10.1371/journal.pone.0248896. eCollection 2021.

Abstract

"Principal Component Analysis" (PCA) is an established linear technique for dimensionality reduction. It performs an orthonormal transformation to replace possibly correlated variables with a smaller set of linearly independent variables, the so-called principal components, which capture a large portion of the data variance. The problem of finding the optimal number of principal components has been widely studied for offline PCA. However, when working with streaming data, the optimal number changes continuously. This requires to update both the principal components and the dimensionality in every timestep. While the continuous update of the principal components is widely studied, the available algorithms for dimensionality adjustment are limited to an increment of one in neural network-based and incremental PCA. Therefore, existing approaches cannot account for abrupt changes in the presented data. The contribution of this work is to enable in neural network-based PCA the continuous dimensionality adjustment by an arbitrary number without the necessity to learn all principal components. A novel algorithm is presented that utilizes several PCA characteristics to adaptivly update the optimal number of principal components for neural network-based PCA. A precise estimation of the required dimensionality reduces the computational effort while ensuring that the desired amount of variance is kept. The computational complexity of the proposed algorithm is investigated and it is benchmarked in an experimental study against other neural network-based and incremental PCA approaches where it produces highly competitive results.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Neural Networks, Computer*
  • Principal Component Analysis*

Grants and funding

This work was supported by the "Europäischer Fonds für regionale Entwicklung Nordrhein-Westfalen" (EFRE-NRW - https://www.efre.nrw.de) funding programme ”Forschungsinfrastrukturen” (grant no. 34.EFRE-0300119). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.