zbMATH — the first resource for mathematics

Comments on: “A random forest guided tour”. (English) Zbl 1402.62131
Summary: This paper is a comment on the survey paper by G. Biau and E. Scornet [ibid. 25, No. 2, 197–227 (2016; Zbl 1402.62133)] about random forests. We focus on the problem of quantifying the impact of each ingredient of random forests on their performance. We show that such a quantification is possible for a simple pure forest, leading to conclusions that could apply more generally. Then, we consider “hold-out” random forests, which are a good middle point between “toy” pure forests and L. Breiman’s original random forests [Mach. Learn. 45, No. 1, 5–32 (2001; Zbl 1007.68152)].

62H30 Classification and discrimination; cluster analysis (statistical aspects)
62G08 Nonparametric regression and quantile regression
68T05 Learning and adaptive systems in artificial intelligence
R; randomForest
Full Text: DOI
[1] Arlot S (2008) \(V\)-fold cross-validation improved: \(V\)-fold penalization. arXiv:0802.0566v2 · Zbl 1402.62133
[2] Arlot S, Genuer R (2014) Analysis of purely random forests bias. arXiv:1407.3939v1 · Zbl 1402.62131
[3] Biau, G, Analysis of a random forests model, J Mach Learn Res, 13, 1063-1095, (2012) · Zbl 1283.62127
[4] Biau, G; Scornet, E, A random forest guided tour, TEST, (2016) · Zbl 1402.62133
[5] Bühlmann, P; Yu, B, Analyzing bagging, Ann Stat, 30, 927-961, (2002) · Zbl 1029.62037
[6] Friedman JH (1991) Multivariate adaptive regression splines. Ann Stat, pp 1-67
[7] Genuer, R, Variance reduction in purely random forests, J Nonparametric Stat, 24, 543-562, (2012) · Zbl 1254.62050
[8] Genuer R, Poggi JM, Tuleau C (2008) Random forests: some methodological insights. arXiv:0811.3619
[9] Liaw A, Wiener M (2002) Classification and regression by randomforest. R News 2(3):18-22. http://CRAN.R-project.org/doc/Rnews/
[10] R Core Team (2015) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. http://www.R-project.org/
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.