Neighborhood rough set reduction-based gene selection and prioritization for gene expression profile analysis and molecular cancer classification

J Biomed Biotechnol. 2010:2010:726413. doi: 10.1155/2010/726413. Epub 2010 Jun 23.

Abstract

Selection of reliable cancer biomarkers is crucial for gene expression profile-based precise diagnosis of cancer type and successful treatment. However, current studies are confronted with overfitting and dimensionality curse in tumor classification and false positives in the identification of cancer biomarkers. Here, we developed a novel gene-ranking method based on neighborhood rough set reduction for molecular cancer classification based on gene expression profile. Comparison with other methods such as PAM, ClaNC, Kruskal-Wallis rank sum test, and Relief-F, our method shows that only few top-ranked genes could achieve higher tumor classification accuracy. Moreover, although the selected genes are not typical of known oncogenes, they are found to play a crucial role in the occurrence of tumor through searching the scientific literature and analyzing protein interaction partners, which may be used as candidate cancer biomarkers.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Databases, Genetic
  • Gene Expression Profiling*
  • Gene Expression Regulation, Neoplastic*
  • Genes, Neoplasm / genetics*
  • Humans
  • Male
  • Models, Genetic*
  • Neoplasms / classification*
  • Neoplasms / genetics*
  • Prostatic Neoplasms / genetics
  • Protein Binding