Interpretable factor models of single-cell RNA-seq via variational autoencoders

Valentine Svensson; Adam Gayoso; Nir Yosef; Lior Pachter

doi:10.1093/bioinformatics/btaa169

Interpretable factor models of single-cell RNA-seq via variational autoencoders

Bioinformatics. 2020 Jun 1;36(11):3418-3421. doi: 10.1093/bioinformatics/btaa169.

Authors

Valentine Svensson¹, Adam Gayoso², Nir Yosef^{2

3

4}, Lior Pachter^{1

5}

Affiliations

¹ Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, CA 91125, USA.
² Center for Computational Biology.
³ Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, CA 91125, USA.
⁴ Chan Zuckerberg Biohub, San Francisco, CA 94158, USA.
⁵ Department of Computing and Mathematical Sciences, California Institute of Technology, Pasadena, CA 91125, USA.

Abstract

Motivation: Single-cell RNA-seq makes possible the investigation of variability in gene expression among cells, and dependence of variation on cell type. Statistical inference methods for such analyses must be scalable, and ideally interpretable.

Results: We present an approach based on a modification of a recently published highly scalable variational autoencoder framework that provides interpretability without sacrificing much accuracy. We demonstrate that our approach enables identification of gene programs in massive datasets. Our strategy, namely the learning of factor models with the auto-encoding variational Bayes framework, is not domain specific and may be useful for other applications.

Availability and implementation: The factor model is available in the scVI package hosted at https://github.com/YosefLab/scVI/.

Contact: v@nxn.se.

Supplementary information: Supplementary data are available at Bioinformatics online.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Bayes Theorem
Exome Sequencing
RNA-Seq*
Sequence Analysis, RNA
Single-Cell Analysis*
Software

Abstract

Publication types

MeSH terms

Grants and funding