Stacked generalization: an introduction to super learning

Ashley I Naimi; Laura B Balzer

doi:10.1007/s10654-018-0390-z

Stacked generalization: an introduction to super learning

Eur J Epidemiol. 2018 May;33(5):459-464. doi: 10.1007/s10654-018-0390-z. Epub 2018 Apr 10.

Authors

Ashley I Naimi¹, Laura B Balzer²

Affiliations

¹ Department of Epidemiology, University of Pittsburgh, 130 DeSoto Street 503 Parran Hall, Pittsburgh, PA, 15261, USA. ashley.naimi@pitt.edu.
² Department of Biostatistics and Epidemiology, University of Massachusetts, Amherst, MA, USA.

Abstract

Stacked generalization is an ensemble method that allows researchers to combine several different prediction algorithms into one. Since its introduction in the early 1990s, the method has evolved several times into a host of methods among which is the "Super Learner". Super Learner uses V-fold cross-validation to build the optimal weighted combination of predictions from a library of candidate algorithms. Optimality is defined by a user-specified objective function, such as minimizing mean squared error or maximizing the area under the receiver operating characteristic curve. Although relatively simple in nature, use of Super Learner by epidemiologists has been hampered by limitations in understanding conceptual and technical details. We work step-by-step through two examples to illustrate concepts and address common concerns.

Keywords: Ensemble learning; Machine learning; Stacked generalization; Stacked regression; Super Learner.

Publication types

Review

MeSH terms

Algorithms
Generalization, Psychological*
Humans
Machine Learning / statistics & numerical data*
Models, Statistical

Abstract

Publication types

MeSH terms

Grants and funding