Using prior knowledge and rule induction methods to discover molecular markers of prognosis in lung cancer

AMIA Annu Symp Proc. 2005:2005:256-60.

Abstract

An iterative computational scientific discovery approach is proposed and applied to gene expression data for resectable lung adenocarcinoma patients. We use genes learned from the C5.0 rule induction algorithm, clinical features and prior knowledge derived from a network of interacting genes as represented in a database obtained with PathwayAssist to discover markers for prognosis in the gene expression data. This is done in an iterative fashion with machine learning techniques seeding the prior knowledge. This research illustrates the utility of combining signaling networks and machine learning techniques to produce simple prognostic classifiers.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adenocarcinoma / genetics*
  • Adenocarcinoma / mortality
  • Artificial Intelligence*
  • Biomarkers, Tumor*
  • Gene Expression
  • Gene Expression Profiling
  • Humans
  • Lung Neoplasms / genetics*
  • Lung Neoplasms / mortality
  • Prognosis
  • Survival Analysis

Substances

  • Biomarkers, Tumor