Cell annotation using scRNA-seq data: A protein-protein interaction network approach

MethodsX. 2023 Apr 10:10:102179. doi: 10.1016/j.mex.2023.102179. eCollection 2023.

Abstract

Pathway analysis is an important step in the interpretation of single cell transcriptomic data, as it provides powerful information to detect which cellular processes are active in each individual cell. We have recently developed a protein-protein interaction network-based framework to quantify pluripotency associated pathways from scRNA-seq data. On this occasion, we extend this approach to quantify the activity of a pathway associated with any biological process, or even any list of genes. A systems-level characterization of pathway activities across multiple cell types provides a broadly applicable tool for the analysis of pathways in both healthy and disease conditions. Dysregulated cellular functions are a hallmark of a wide spectrum of human disorders, including cancer and autoimmune diseases. Here, we illustrate our method by analyzing various biological processes in healthy and cancer breast samples. Using this approach we found that tumor breast cells, even when they form a single group in the UMAP space, keep diverse biological programs active in a differentiated manner within the cluster.•We implement a protein-protein interaction network-based approach to quantify the activity of different biological processes.•The methodology can be used for cell annotation in scRNA-seq studies and is freely available as R package.

Keywords: Biological Processes; Breast cancer; Cell annotation; ORIGINS2; Protein-protein interaction networks; scRNA-seq.