NCPHLDA: a novel method for human lncRNA-disease association prediction based on network consistency projection

Mol Omics. 2019 Dec 2;15(6):442-450. doi: 10.1039/c9mo00092e.

Abstract

In recent years, an increasing number of biological experiments and clinical reports have shown that lncRNA is closely related to the development of various complex human diseases. Therefore, studying the relationship between lncRNA and disease is necessary. Doing so not only helps to understand the disease mechanism, but also facilitates the diagnosis, treatment, and prognosis of disease. However, understanding the relationship between lncRNA and disease through biological experiments and clinical studies requires considerable time and money. Over the years, many researchers have developed computational methods to predict potential lncRNA-disease associations. In this study, on the basis of the assumption that functionally similar lncRNAs tend to associate with phenotypically similar diseases, and vice versa, we propose a novel computational method called network consistency projection for human lncRNA-disease associations (NCPHLDA) to predict potential lncRNA-disease associations. This method integrates a lncRNA cosine similarity network, a disease cosine similarity network, and the known lncRNA-disease association network. NCPHLDA is not only a parameterless method but also does not require a negative sample. More importantly, NCPHLDA can predict lncRNA without any known associated diseases. AUC values of 0.9273 and 0.9179 ± 0.0043 are obtained by implementing leave-one-out cross-validation and 5-fold cross-validation for NCPHLDA, respectively. Case studies of three diseases (breast cancer, cervical cancer, and hepatocellular carcinoma) indicate that NCPHLDA has reliable predictive performance. The source code of NCPHLDA is freely available at https://github.com/bryanze/NCPHLDA.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Computational Biology / methods*
  • Gene Expression Regulation
  • Gene Regulatory Networks
  • Genetic Predisposition to Disease*
  • Genome-Wide Association Study / methods
  • Humans
  • RNA, Long Noncoding / genetics*
  • ROC Curve
  • Software*

Substances

  • RNA, Long Noncoding