Random forests for microarrays

Adele Cutler; John R Stevens

doi:10.1016/S0076-6879(06)11023-X

Random forests for microarrays

Methods Enzymol. 2006:411:422-32. doi: 10.1016/S0076-6879(06)11023-X.

Authors

Adele Cutler¹, John R Stevens

Affiliation

¹ Department of Mathematics and Statistics, Utah State University, Logan, UT, USA.

PMID: 16939804
DOI: 10.1016/S0076-6879(06)11023-X

Abstract

Random Forests is a powerful multipurpose tool for predicting and understanding data. If gene expression data come from known groups or classes (e.g., tumor patients and controls), Random Forests can rank the genes in terms of their usefulness in separating the groups. When the groups are unknown, Random Forests uses an intrinsic measure of the similarity of the genes to extract useful multivariate structure, including clusters. This chapter summarizes the Random Forests methodology and illustrates its use on freely available data sets.

Publication types

Review

MeSH terms

Animals
Data Interpretation, Statistical
Humans
Oligonucleotide Array Sequence Analysis / methods*
Oligonucleotide Array Sequence Analysis / statistics & numerical data*
Terminology as Topic*