Two stages weighted sampling strategy for detecting the relation between gene expression and disease

Int J Data Min Bioinform. 2015;12(2):207-23. doi: 10.1504/ijdmb.2015.069417.

Abstract

For microarray data analysis, most of them focus on selecting relevant genes and calculating the classification accuracy by the selected relevant genes. This paper wants to detect the relation between the gene expression levels and the classes of a cancer (or a disease) to assist researchers for initial diagnosis. The proposed method is called a Two Stages Weighted Sampling strategy (TSWS strategy). According to the results, the performance of TSWS strategy is better than other existing methods in terms of the classification accuracy and the number of selected relevant genes. Furthermore, TSWS strategy also can use to understand and detect the relation between the gene expression levels and the classes of a cancer (or a disease).

MeSH terms

  • Animals
  • Databases, Genetic*
  • Gene Expression Regulation, Neoplastic*
  • Humans
  • Neoplasms* / classification
  • Neoplasms* / genetics
  • Neoplasms* / metabolism
  • Oligonucleotide Array Sequence Analysis*