Analysis of cancer-related lncRNAs using gene ontology and KEGG pathways

Artif Intell Med. 2017 Feb:76:27-36. doi: 10.1016/j.artmed.2017.02.001. Epub 2017 Feb 13.

Abstract

Background: Cancer is a disease that involves abnormal cell growth and can invade or metastasize to other tissues. It is known that several factors are related to its initiation, proliferation, and invasiveness. Recently, it has been reported that long non-coding RNAs (lncRNAs) can participate in specific functional pathways and further regulate the biological function of cancer cells. Studies on lncRNAs are therefore helpful for uncovering the underlying mechanisms of cancer biological processes.

Methods: We investigated cancer-related lncRNAs using gene ontology (GO) terms and KEGG pathway enrichment scores of neighboring genes that are co-expressed with the lncRNAs by extracting important GO terms and KEGG pathways that can help us identify cancer-related lncRNAs. The enrichment theory of GO terms and KEGG pathways was adopted to encode each lncRNA. Then, feature selection methods were employed to analyze these features and obtain the key GO terms and KEGG pathways.

Results: The analysis indicated that the extracted GO terms and KEGG pathways are closely related to several cancer associated processes, such as hormone associated pathways, energy associated pathways, and ribosome associated pathways. And they can accurately predict cancer-related lncRNAs.

Conclusions: This study provided novel insight of how lncRNAs may affect tumorigenesis and which pathways may play important roles during it. These results could help understanding the biological mechanisms of lncRNAs and treating cancer.

Keywords: Cancer-related lncRNA; Dagging; Gene ontology; Incremental feature selection; KEGG pathway; Minimum redundancy maximum relevance.

MeSH terms

  • Computational Biology
  • Gene Ontology*
  • Humans
  • Neoplasms / genetics*
  • RNA, Long Noncoding*
  • Signal Transduction

Substances

  • RNA, Long Noncoding