A Computational Method of Defining Potential Biomarkers based on Differential Sub-Networks

Xin Huang; Xiaohui Lin; Jun Zeng; Lichao Wang; Peiyuan Yin; Lina Zhou; Chunxiu Hu; Weihong Yao

doi:10.1038/s41598-017-14682-5

A Computational Method of Defining Potential Biomarkers based on Differential Sub-Networks

Sci Rep. 2017 Oct 30;7(1):14339. doi: 10.1038/s41598-017-14682-5.

Authors

Xin Huang¹, Xiaohui Lin², Jun Zeng³, Lichao Wang³, Peiyuan Yin³, Lina Zhou³, Chunxiu Hu³, Weihong Yao¹

Affiliations

¹ School of Computer Science & Technology, Dalian University of Technology, 116024, Dalian, China.
² School of Computer Science & Technology, Dalian University of Technology, 116024, Dalian, China. datas@dlut.edu.cn.
³ CAS Key Laboratory of Separation Science for Analytical Chemistry, Dalian Institute of Chemical Physics, Chinese Academy of Sciences, Dalian, 116023, China.

Abstract

Analyzing omics data from a network-based perspective can facilitate biomarker discovery. To improve disease diagnosis and identify prospective information indicating the onset of complex disease, a computational method for identifying potential biomarkers based on differential sub-networks (PB-DSN) is developed. In PB-DSN, Pearson correlation coefficient (PCC) is used to measure the relationship between feature ratios and to infer potential networks. A differential sub-network is extracted to identify crucial information for discriminating different groups and indicating the emergence of complex diseases. Subsequently, PB-DSN defines potential biomarkers based on the topological analysis of these differential sub-networks. In this study, PB-DSN is applied to handle a static genomics dataset of small, round blue cell tumors and a time-series metabolomics dataset of hepatocellular carcinoma. PB-DSN is compared with support vector machine-recursive feature elimination, multivariate empirical Bayes statistics, analyzing time-series data based on dynamic networks, molecular networks based on PCC, PinnacleZ, graph-based iterative group analysis, KeyPathwayMiner and BioNet. The better performance of PB-DSN not only demonstrates its effectiveness for the identification of discriminative features that facilitate disease classification, but also shows its potential for the identification of warning signals.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Algorithms
Animals
Area Under Curve
Bayes Theorem
Biomarkers / metabolism
Carcinoma, Hepatocellular / pathology
Computational Biology / methods*
Genomics / methods*
Humans
Liver Neoplasms / pathology
Metabolomics / methods*
Support Vector Machine

Substances

Biomarkers