DNILMF-LDA: Prediction of lncRNA-Disease Associations by Dual-Network Integrated Logistic Matrix Factorization and Bayesian Optimization

Genes (Basel). 2019 Aug 12;10(8):608. doi: 10.3390/genes10080608.

Abstract

Identifying associations between lncRNAs and diseases can help understand disease-related lncRNAs and facilitate disease diagnosis and treatment. The dual-network integrated logistic matrix factorization (DNILMF) model has been used for drug-target interaction prediction, and good results have been achieved. We firstly applied DNILMF to lncRNA-disease association prediction (DNILMF-LDA). We combined different similarity kernel matrices of lncRNAs and diseases by using nonlinear fusion to extract the most important information in fused matrices. Then, lncRNA-disease association networks and similarity networks were built simultaneously. Finally, the Gaussian process mutual information (GP-MI) algorithm of Bayesian optimization was adopted to optimize the model parameters. The 10-fold cross-validation result showed that the area under receiving operating characteristic (ROC) curve (AUC) value of DNILMF-LDA was 0.9202, and the area under precision-recall (PR) curve (AUPR) was 0.5610. Compared with LRLSLDA, SIMCLDA, BiwalkLDA, and TPGLDA, the AUC value of our method increased by 38.81%, 13.07%, 8.35%, and 6.75%, respectively. The AUPR value of our method increased by 52.66%, 40.05%, 37.01%, and 44.25%. These results indicate that DNILMF-LDA is an effective method for predicting the associations between lncRNAs and diseases.

Keywords: Bayesian optimization; dual-network integrated logistic matrix factorization; lncRNA and disease associations.

MeSH terms

  • Bayes Theorem
  • Gene Regulatory Networks
  • Genetic Predisposition to Disease*
  • Genome-Wide Association Study / methods*
  • Humans
  • RNA, Long Noncoding / genetics*
  • RNA, Long Noncoding / metabolism
  • Software*

Substances

  • RNA, Long Noncoding