Merging or ensembling: integrative analysis in multiple neuroimaging studies

Biometrics. 2024 Jan 29;80(1):ujae003. doi: 10.1093/biomtc/ujae003.

Abstract

The aim of this paper is to systematically investigate merging and ensembling methods for spatially varying coefficient mixed effects models (SVCMEM) in order to carry out integrative learning of neuroimaging data obtained from multiple biomedical studies. The "merged" approach involves training a single learning model using a comprehensive dataset that encompasses information from all the studies. Conversely, the "ensemble" approach involves creating a weighted average of distinct learning models, each developed from an individual study. We systematically investigate the prediction accuracy of the merged and ensemble learners under the presence of different degrees of interstudy heterogeneity. Additionally, we establish asymptotic guidelines for making strategic decisions about when to employ either of these models in different scenarios, along with deriving optimal weights for the ensemble learner. To validate our theoretical results, we perform extensive simulation studies. The proposed methodology is also applied to 3 large-scale neuroimaging studies.

Keywords: ensemble learner; interstudy heterogeneity; merged learner; neuroimaging; spatially varying coefficient mixed effects model.

MeSH terms

  • Computer Simulation
  • Learning*
  • Neuroimaging*