Biclustering analysis of functionals via penalized fusion

Kuangnan Fang; Yuanxing Chen; Shuangge Ma; Qingzhao Zhang

doi:10.1016/j.jmva.2021.104874

Biclustering analysis of functionals via penalized fusion

J Multivar Anal. 2022 May:189:104874. doi: 10.1016/j.jmva.2021.104874. Epub 2021 Oct 29.

Authors

Kuangnan Fang¹, Yuanxing Chen¹, Shuangge Ma², Qingzhao Zhang³

Affiliations

¹ Department of Statistics and Data Science, School of Economics, Xiamen University, China.
² Department of Biostatistics, Yale University, United States of America.
³ MOE Key Laboratory of Econometrics, Department of Statistics and Data Science, School of Economics, Wang Yanan Institute for Studies in Economics, and Fujian Key Lab of Statistics, Xiamen University, China.

Abstract

In biomedical data analysis, clustering is commonly conducted. Biclustering analysis conducts clustering in both the sample and covariate dimensions and can more comprehensively describe data heterogeneity. In most of the existing biclustering analyses, scalar measurements are considered. In this study, motivated by time-course gene expression data and other examples, we take the "natural next step" and consider the biclustering analysis of functionals under which, for each covariate of each sample, a function (to be exact, its values at discrete measurement points) is present. We develop a doubly penalized fusion approach, which includes a smoothness penalty for estimating functionals and, more importantly, a fusion penalty for clustering. Statistical properties are rigorously established, providing the proposed approach a strong ground. We also develop an effective ADMM algorithm and accompanying R code. Numerical analysis, including simulations, comparisons, and the analysis of two time-course gene expression data, demonstrates the practical effectiveness of the proposed approach.

Keywords: 62R10; Biclustering; Functional data; Penalized fusion; primary 62H30.

Grants and funding

R01 CA204120/CA/NCI NIH HHS/United States