Comprehensive Multiview Representation Learning via Deep Autoencoder-Like Nonnegative Matrix Factorization

Haonan Huang; Guoxu Zhou; Qibin Zhao; Lifang He; Shengli Xie

doi:10.1109/TNNLS.2023.3304626

Comprehensive Multiview Representation Learning via Deep Autoencoder-Like Nonnegative Matrix Factorization

IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):5953-5967. doi: 10.1109/TNNLS.2023.3304626. Epub 2024 May 2.

Authors

Haonan Huang, Guoxu Zhou, Qibin Zhao, Lifang He, Shengli Xie

PMID: 37672378
DOI: 10.1109/TNNLS.2023.3304626

Abstract

Learning a comprehensive representation from multiview data is crucial in many real-world applications. Multiview representation learning (MRL) based on nonnegative matrix factorization (NMF) has been widely adopted by projecting high-dimensional space into a lower order dimensional space with great interpretability. However, most prior NMF-based MRL techniques are shallow models that ignore hierarchical information. Although deep matrix factorization (DMF)-based methods have been proposed recently, most of them only focus on the consistency of multiple views and have cumbersome clustering steps. To address the above issues, in this article, we propose a novel model termed deep autoencoder-like NMF for MRL (DANMF-MRL), which obtains the representation matrix through the deep encoding stage and decodes it back to the original data. In this way, through a DANMF-based framework, we can simultaneously consider the multiview consistency and complementarity, allowing for a more comprehensive representation. We further propose a one-step DANMF-MRL, which learns the latent representation and final clustering labels matrix in a unified framework. In this approach, the two steps can negotiate with each other to fully exploit the latent clustering structure, avoid previous tedious clustering steps, and achieve optimal clustering performance. Furthermore, two efficient iterative optimization algorithms are developed to solve the proposed models both with theoretical convergence analysis. Extensive experiments on five benchmark datasets demonstrate the superiority of our approaches against other state-of-the-art MRL methods.