Flexible copula model for integrating correlated multi-omics data from single-cell experiments

Biometrics. 2023 Jun;79(2):1559-1572. doi: 10.1111/biom.13701. Epub 2022 Jun 22.

Abstract

With recent advances in technologies to profile multi-omics data at the single-cell level, integrative multi-omics data analysis has been increasingly popular. It is increasingly common that information such as methylation changes, chromatin accessibility, and gene expression are jointly collected in a single-cell experiment. In biomedical studies, it is often of interest to study the associations between various data types and to examine how these associations might change according to other factors such as cell types and gene regulatory components. However, since each data type usually has a distinct marginal distribution, joint analysis of these changes of associations using multi-omics data is statistically challenging. In this paper, we propose a flexible copula-based framework to model covariate-dependent correlation structures independent of their marginals. In addition, the proposed approach could jointly combine a wide variety of univariate marginal distributions, either discrete or continuous, including the class of zero-inflated distributions. The performance of the proposed framework is demonstrated through a series of simulation studies. Finally, it is applied to a set of experimental data to investigate the dynamic relationship between single-cell RNA sequencing, chromatin accessibility, and DNA methylation at different germ layers during mouse gastrulation.

Keywords: Gaussian copula regression; dynamic association; integrative multi-omics data analysis; liquid association; single-cell experiment; zero-inflated model.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Animals
  • Chromatin / genetics
  • Computer Simulation
  • DNA Methylation*
  • Mice
  • Multiomics*

Substances

  • Chromatin