Van der Laan, Mark J.; Polley, Eric C.; Hubbard, Alan E. Super learner. (English) Zbl 1166.62387 Stat. Appl. Genet. Mol. Biol. 6, No. 1, Article No. 25 (2007). Summary: When trying to learn a model for the prediction of an outcome given a set of covariates, a statistician has many estimation procedures in their toolbox. A few examples of these candidate learners are: least squares, least angle regression, random forests, and spline regression. Previous articles theoretically validated the use of cross validation to select an optimal learner among many candidate learners. Motivated by this use of cross validation, we propose a new prediction method for creating a weighted combination of many candidate learners to build the super learner. This article proposes a fast algorithm for constructing a super learner in prediction which uses V-fold cross-validation to select weights to combine an initial set of candidate learners. In addition, this paper contains a practical demonstration of the adaptivity of this so called super learner to various true data generating distributions. This approach for construction of a super learner generalizes to any parameter which can be defined as a minimizer of a loss function. Cited in 1 ReviewCited in 52 Documents MSC: 62P99 Applications of statistics 68T35 Theory of languages and software systems (knowledge-based systems, expert systems, etc.) for artificial intelligence 65C60 Computational problems in statistics (MSC2010) Software:SuperLearner PDF BibTeX XML Cite \textit{M. J. Van der Laan} et al., Stat. Appl. Genet. Mol. Biol. 6, No. 1, Article No. 25 (2007; Zbl 1166.62387) Full Text: DOI Link OpenURL