A Normalization-Free and Nonparametric Method Sharpens Large-Scale Transcriptome Analysis and Reveals Common Gene Alteration Patterns in Cancers

Qi-Gang Li; Yong-Han He; Huan Wu; Cui-Ping Yang; Shao-Yan Pu; Song-Qing Fan; Li-Ping Jiang; Qiu-Shuo Shen; Xiao-Xiong Wang; Xiao-Qiong Chen; Qin Yu; Ying Li; Chang Sun; Xiangting Wang; Jumin Zhou; Hai-Peng Li; Yong-Bin Chen; Qing-Peng Kong

doi:10.7150/thno.19425

A Normalization-Free and Nonparametric Method Sharpens Large-Scale Transcriptome Analysis and Reveals Common Gene Alteration Patterns in Cancers

Theranostics. 2017 Jul 8;7(11):2888-2899. doi: 10.7150/thno.19425. eCollection 2017.

Authors

Qi-Gang Li^{1

2}, Yong-Han He^{1

2}, Huan Wu^{1

2

3}, Cui-Ping Yang⁴, Shao-Yan Pu^{1

2}, Song-Qing Fan⁵, Li-Ping Jiang^{4

3}, Qiu-Shuo Shen^{4

3}, Xiao-Xiong Wang^{1

2

3}, Xiao-Qiong Chen^{1

2}, Qin Yu^{1

2

3}, Ying Li⁶, Chang Sun⁷, Xiangting Wang⁸, Jumin Zhou⁴, Hai-Peng Li⁹, Yong-Bin Chen⁴, Qing-Peng Kong^{1

2}

Affiliations

¹ State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.
² KIZ/CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming 650223, China.
³ Kunming College of Life Science, University of Chinese Academy of Sciences, Beijing 100049, China.
⁴ Key Laboratory of Animal Models and Human Disease Mechanisms, Kunming Institute of Zoology, Chinese Academy of Sciences, Kunming 650223, China.
⁵ Department of Pathology, the Second Xiangya Hospital, Central South University, Changsha 410013, China.
⁶ Farm Animal Genetic Resources Exploration and Innovation Key Laboratory of Sichuan Province, Sichuan Agricultural University, Chengdu 611130, China.
⁷ Laboratory for Conservation and Utilization of Bio-Resources, Yunnan University, Kunming 650091, China.
⁸ School of Life Sciences, University of Science and Technology of China, Hefei 230027, China.
⁹ Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China.

Abstract

Heterogeneity in transcriptional data hampers the identification of differentially expressed genes (DEGs) and understanding of cancer, essentially because current methods rely on cross-sample normalization and/or distribution assumption-both sensitive to heterogeneous values. Here, we developed a new method, Cross-Value Association Analysis (CVAA), which overcomes the limitation and is more robust to heterogeneous data than the other methods. Applying CVAA to a more complex pan-cancer dataset containing 5,540 transcriptomes discovered numerous new DEGs and many previously rarely explored pathways/processes; some of them were validated, both in vitro and in vivo, to be crucial in tumorigenesis, e.g., alcohol metabolism (ADH1B), chromosome remodeling (NCAPH) and complement system (Adipsin). Together, we present a sharper tool to navigate large-scale expression data and gain new mechanistic insights into tumorigenesis.

Keywords: Cross-Value Association Analysis; heterogeneity.; normalization-free; pan-cancer; transcriptome.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Computational Biology / methods*
Gene Expression Profiling / methods*
Genes, Neoplasm*
Humans
Neoplasms / pathology*