Generalized Low-Rank Update: Model Parameter Bounds for Low-Rank Training Data Modifications

Hiroyuki Hanada; Noriaki Hashimoto; Kouichi Taji; Ichiro Takeuchi

doi:10.1162/neco_a_01619

Generalized Low-Rank Update: Model Parameter Bounds for Low-Rank Training Data Modifications

Neural Comput. 2023 Nov 7;35(12):1970-2005. doi: 10.1162/neco_a_01619.

Authors

Hiroyuki Hanada¹, Noriaki Hashimoto², Kouichi Taji³, Ichiro Takeuchi^{4

5}

Affiliations

¹ Center for Advanced Intelligence Project, RIKEN, Tokyo 103-0027, Japan hiroyuki.hanada@riken.jp.
² Center for Advanced Intelligence Project, RIKEN, Tokyo 103-0027, Japan noriaki.hashimoto.jv@riken.jp.
³ Department of Mechanical Systems Engineering, Nagoya University, Nagoya 464-8603, Japan taji@nagoya-u.jp.
⁴ Department of Mechanical Systems Engineering, Nagoya University, Nagoya 464-8603, Japan.
⁵ Center for Advanced Intelligence Project, RIKEN, Tokyo 103-0027, Japan ichiro.takeuchi@mae.nagoya-u.ac.jp.

PMID: 37844324
DOI: 10.1162/neco_a_01619

Abstract

In this study, we have developed an incremental machine learning (ML) method that efficiently obtains the optimal model when a small number of instances or features are added or removed. This problem holds practical importance in model selection, such as cross-validation (CV) and feature selection. Among the class of ML methods known as linear estimators, there exists an efficient model update framework, the low-rank update, that can effectively handle changes in a small number of rows and columns within the data matrix. However, for ML methods beyond linear estimators, there is currently no comprehensive framework available to obtain knowledge about the updated solution within a specific computational complexity. In light of this, our study introduces a the generalized low-rank update (GLRU) method, which extends the low-rank update framework of linear estimators to ML methods formulated as a certain class of regularized empirical risk minimization, including commonly used methods such as support vector machines and logistic regression. The proposed GLRU method not only expands the range of its applicability but also provides information about the updated solutions with a computational complexity proportional to the number of data set changes. To demonstrate the effectiveness of the GLRU method, we conduct experiments showcasing its efficiency in performing cross-validation and feature selection compared to other baseline methods.