Computational Methods for Identifying Similar Diseases

Liang Cheng; Hengqiang Zhao; Pingping Wang; Wenyang Zhou; Meng Luo; Tianxin Li; Junwei Han; Shulin Liu; Qinghua Jiang

doi:10.1016/j.omtn.2019.09.019

Computational Methods for Identifying Similar Diseases

Mol Ther Nucleic Acids. 2019 Dec 6:18:590-604. doi: 10.1016/j.omtn.2019.09.019. Epub 2019 Sep 28.

Authors

Liang Cheng¹, Hengqiang Zhao¹, Pingping Wang², Wenyang Zhou², Meng Luo², Tianxin Li², Junwei Han³, Shulin Liu⁴, Qinghua Jiang⁵

Affiliations

¹ College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China.
² School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China.
³ College of Bioinformatics Science and Technology, Harbin Medical University, Harbin 150081, China. Electronic address: hanjunwei1981@163.com.
⁴ Systemomics Center, College of Pharmacy, and Genomics Research Center (State-Province Key Laboratories of Biomedicine-Pharmaceutics of China), Harbin Medical University, Harbin, Heilongjiang, China; Department of Microbiology, Immunology and Infectious Diseases, University of Calgary, Calgary, AB, Canada. Electronic address: slliu@hrbmu.edu.cn.
⁵ School of Life Science and Technology, Harbin Institute of Technology, Harbin, Heilongjiang, China. Electronic address: qhjiang@hit.edu.cn.

Abstract

Although our knowledge of human diseases has increased dramatically, the molecular basis, phenotypic traits, and therapeutic targets of most diseases still remain unclear. An increasing number of studies have observed that similar diseases often are caused by similar molecules, can be diagnosed by similar markers or phenotypes, or can be cured by similar drugs. Thus, the identification of diseases similar to known ones has attracted considerable attention worldwide. To this end, the associations between diseases at the molecular, phenotypic, and taxonomic levels were used to measure the pairwise similarity in diseases. The corresponding performance assessment strategies for these methods involving the terms "category-based," "simulated-patient-based," and "benchmark-data-based" were thus further emphasized. Then, frequently used methods were evaluated using a benchmark-data-based strategy. To facilitate the assessment of disease similarity scores, researchers have designed dozens of tools that implement these methods for calculating disease similarity. Currently, disease similarity has been advantageous in predicting noncoding RNA (ncRNA) function and therapeutic drugs for diseases. In this article, we review disease similarity methods, evaluation strategies, tools, and their applications in the biomedical community. We further evaluate the performance of these methods and discuss the current limitations and future trends for calculating disease similarity.

Keywords: disease similarity; molecular basis; ncRNA function; phenotypic traits; therapeutic drugs.

Publication types

Review