Identifying subgroup markers in heterogeneous populations

Nucleic Acids Res. 2013 Nov;41(21):e200. doi: 10.1093/nar/gkt845. Epub 2013 Sep 22.

Abstract

Traditional methods that aim to identify biomarkers that distinguish between two groups, like Significance Analysis of Microarrays or the t-test, perform optimally when such biomarkers show homogeneous behavior within each group and differential behavior between the groups. However, in many applications, this is not the case. Instead, a subgroup of samples in one group shows differential behavior with respect to all other samples. To successfully detect markers showing such imbalanced patterns of differential signal, a different approach is required. We propose a novel method, specifically designed for the Detection of Imbalanced Differential Signal (DIDS). We use an artificial dataset and a human breast cancer dataset to measure its performance and compare it with three traditional methods and four approaches that take imbalanced signal into account. Supported by extensive experimental results, we show that DIDS outperforms all other approaches in terms of power and positive predictive value. In a mouse breast cancer dataset, DIDS is the only approach that detects a functionally validated marker of chemotherapy resistance. DIDS can be applied to any continuous value data, including gene expression data, and in any context where imbalanced differential signal is manifested.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Algorithms*
  • Animals
  • Biomarkers, Tumor / genetics
  • Biomarkers, Tumor / metabolism*
  • Breast Neoplasms / genetics
  • Breast Neoplasms / metabolism
  • Female
  • Gene Expression*
  • Humans
  • Mammary Neoplasms, Experimental / genetics
  • Mammary Neoplasms, Experimental / metabolism
  • Mice
  • Receptor, ErbB-2 / analysis

Substances

  • Biomarkers, Tumor
  • Receptor, ErbB-2