×

zbMATH — the first resource for mathematics

Random forest-based approach for physiological functional variable selection for driver’s stress level classification. (English) Zbl 1427.62140
Summary: This paper deals with physiological functional variables selection for driver’s stress level classification using random forests. Our analysis is performed on experimental data extracted from the drivedb open database available on PhysioNet website. The physiological measurements of interest are: electrodermal activity captured on the driver’s left hand and foot, electromyogram, respiration, and heart rate, collected from ten driving experiments carried out in three types of routes (rest area, city, and highway). The contributions of this work touch on the method as well as the application aspects. From a methodological viewpoint, the physiological signals are considered as functional variables, decomposed on a wavelet basis and then analyzed in search of most relevant variables. On the application side, the proposed approach provides a “blind” procedure for driver’s stress level classification, giving close performances to those resulting from the expert-based approach, when applied to the drivedb database. It also suggests new physiological features based on the wavelet levels corresponding to the functional variables wavelet decomposition. Finally, the proposed approach provides a ranking of physiological variables according to their importance in stress level classification. For the case under study, results suggest that the electromyogram and the heart rate signals are less relevant compared to the electrodermal and the respiration signals. Furthermore, the electrodermal activity measured on the driver’s foot was found more relevant than the one captured on the hand. Finally, the proposed approach also provided an order of relevance of the wavelet features.
MSC:
62P30 Applications of statistics in engineering and industry; control charts
62H30 Classification and discrimination; cluster analysis (statistical aspects)
42C40 Nontrigonometric harmonic analysis involving wavelets and other special systems
62P10 Applications of statistics to biology and medical sciences; meta analysis
PDF BibTeX XML Cite
Full Text: DOI
References:
[1] Akbas, A., Evaluation of the physiological data indicating the dynamic stress level of drivers, Sci Res Essays, 6, 430-439, (2011)
[2] Alkali AH, Saatchi R, Elphick H, Burke D (2014) Short-time Fourier and wavelet transform analysis of respiration signal obtained by thermal imaging. In: 2014 9th International Symposium on Communication Systems, Networks & Digital Sign (CSNDSP). IEEE, pp 183-187. https://doi.org/10.1109/CSNDSP.2014.6923821
[3] Auret, L.; Aldrich, C., Empirical comparison of tree ensemble variable importance measures, Chemometr Intell Lab Syst, 105, 157-170, (2011)
[4] Ayata D, Yaslan Y, Kamasak M (2016) Emotion recognition via random forest and galvanic skin response: comparison of time based feature sets, window sizes and wavelet approaches. In: 2016 Medical Technologies National Congress (TIPTEKNO). IEEE, pp 1-4. https://doi.org/10.1109/TIPTEKNO.2016.7863130
[5] Bach, FR, Consistency of the group lasso and multiple kernel learning, J Mach Learn Res, 9, 1179-1225, (2008) · Zbl 1225.68147
[6] Bostrom J (2005) Emotion-sensing PCs could feel your stress. PC World
[7] Boucsein W (2012) Electrodermal activity. Springer, Berlin
[8] Breiman, L., Random forests, Mach Learn, 45, 5-32, (2001) · Zbl 1007.68152
[9] Breiman L, Friedman J, Stone CJ, Olshen RA (1984) Classification and regression trees. The wadsworth and Brooks-Cole statistics-probability series. Taylor & Francis, London
[10] Breiman L, Cutler A (2015) Randomforest: Breiman and cutler’s random forests for classification and regression. R Package Version 46-12 http://cran.r-project.org/package=randomForest
[11] Chaudhary, R., Electrocardiogram comparison of stress recognition in automobile drivers on matlab, Adv Electron Electr Eng, 3, 1007-1012, (2013)
[12] Deng Y, Wu Z, Chu C, Yang T (2012) Evaluating feature selection for stress identification. In: Information Reuse and Integration (IRI), 2012 IEEE 13th international conference on, pp 584-591. https://doi.org/10.1109/IRI.2012.6303062
[13] Díaz-Uriarte, R.; Andrés, SA, Gene selection and classification of microarray data using random forest, BMC Bioinform, 7, 1-13, (2006)
[14] El Haouij N, Poggi JM, Sevestre-Ghalila S, Ghozi R, Jaïdane M (2018) AffectiveROAD system and database to assess driver’s attention. In: SAC 2018: symposium on applied computing, April 9-13, Pau. https://doi.org/10.1145/3167132.3167395
[15] Ferraty F, Vieu P (2006) Nonparametric functional data analysis: theory and practice (springer series in statistics). Springer-Verlag New York Inc., Secaucus · Zbl 1119.62046
[16] Genuer, R.; Poggi, JM; Tuleau-Malot, C., Variable selection using random forests, Pattern Recognit Lett, 31, 2225-2236, (2010)
[17] Genuer, R.; Poggi, JM; Tuleau-Malot, C., VSURF: an R package for variable selection using random forests, R J, 7, 19-33, (2015)
[18] Goldberger, A.; Amaral, L.; Glass, L.; Hausdorff, J.; Ivanov, P.; Mark, R.; Mietus, J.; Moody, G.; Peng, CK; Stanley, H., Physiobank, physiotoolkit, and physionet: components of a new research resource for complex physiologic signals, Circulation, 101, e215-e220, (2000)
[19] Granero, AC; Fuentes-Hurtado, F.; Naranjo Ornedo, V.; Guixeres Provinciale, J.; Ausín, JM; Alcañiz Raya, M., a Comparison of physiological signal analysis techniques and classifiers for automatic emotional evaluation of audiovisual contents, Front Comput Neurosci, 10, 74, (2016)
[20] Gregorutti B (2016) RFgroove: importance measure and selection for groups of variables with random forests. R Package Version 11 http://cran.r-project.org/package=RFgroove
[21] Gregorutti, B.; Michel, B.; Saint-Pierre, P., Grouped variable importance with random forests and application to multiple functional data analysis, Comput Stat Data Anal, 90, 15-35, (2015) · Zbl 06921402
[22] Gregorutti, B.; Michel, B.; Saint-Pierre, P., Correlation and variable importance in random forests, Stat Comput, (2016) · Zbl 06737690
[23] Guendil Z, Lachiri Z, Maaoui C, Pruski A (2015) Emotion recognition from physiological signals using fusion of wavelet based features. In: 2015 7th International Conference on Modelling, Identification and Control (ICMIC), IEEE, pp 1-6. https://doi.org/10.1109/ICMIC.2015.7409485
[24] Guyon, I.; Weston, J.; Barnhill, S.; Vapnik, V., Gene selection for cancer classification using support vector machines, Mach Learn, 46, 389-422, (2002) · Zbl 0998.68111
[25] Hastie T, Tibshirani R, Friedman J (2001) The elements of statistical learning. Springer series in statistics. Springer New York Inc., New York · Zbl 0973.62007
[26] Healey JA (2000) Wearable and automotive systems for affect recognition from physiology. Ph.D. Thesis, MIT Department of Electrical Engineering and Computer Science
[27] Healey, JA; Picard, RW, Detecting stress during real-world driving tasks using physiological sensors, IEEE Trans Intell Transp Syst, 6, 156-166, (2005)
[28] Horberry, T.; Anderson, J.; Regan, MA; Triggs, TJ; Brown, J., Driver distraction: the effects of concurrent in-vehicle tasks, road environment complexity and age on driving performance, Accid Anal Prev, 38, 185-191, (2006)
[29] Imam, MH; Karmakar, CK; Khandoker, AH; Palaniswami, M., Effect of ECG-derived respiration (EDR) on modeling ventricular repolarization dynamics in different physiological and psychological conditions, Med Biol Eng Comput, 52, 851-860, (2014)
[30] Jolliffe I (2012) Principal Component Analysis. Springer, Berlin · Zbl 1011.62064
[31] Karmakar, C.; Imam, MH; Khandoker, A.; Palaniswami, M., Influence of psychological stress on QT interval, Computing in cardiology, 2014, 1009-1012, (2014)
[32] Lin, HP; Lin, HY; Lin, WL; Huang, ACW, Effects of stress, depression, and their interaction on heart rate, skin conductance, finger temperature, and respiratory rate: sympathetic-parasympathetic hypothesis of stress and depression, J Clin Psychol, 67, 1080-1091, (2011)
[33] Louppe G, Wehenkel L, Sutera A, Geurts P (2013) Understanding variable importances in forests of randomized trees
[34] Lykken, DT, Range correction applied to heart rate and to GSR data, Psychophysiology, 9, 373-379, (1972)
[35] Mallat, SG, A theory for multiresolution signal decomposition: the wavelet representation, IEEE Trans Pattern Anal Mach Intell, 11, 674-693, (1989) · Zbl 0709.94650
[36] Nicodemus, KK; Malley, JD; Strobl, C.; Ziegler, A., The behaviour of random forest permutation-based variable importance measures under predictor correlation, BMC Bioinform, 11, 1-13, (2010)
[37] Picard, RW; Fedor, S.; Ayzenberg, Y., Multiple arousal theory and daily-life electrodermal activity asymmetry, Emot Rev, 8, 62-75, (2016)
[38] Poggi JM, Tuleau C (2007) Classification of objectivization data using cart and wavelets. In: Proceedings of the IASC 07. Aveiro, pp 1-8
[39] R Core Team (2016) R: A language and environment for statistical computing. In: R foundation for statistical computing. Vienna. www.r-project.org
[40] Ramsay JO, Silverman BW (2002) Applied functional data analysis: methods and case studies, vol 77. Springer, New York · Zbl 1011.62002
[41] Ramsay JO, Silverman BW (2005) Functional data analysis. Springer, New York. https://doi.org/10.1007/b98888 · Zbl 1079.62006
[42] Rigas G, Katsis C, Bougia P, Fotiadis D (2008) A reasoning-based framework for car drivers stress prediction. In: Control and automation, 2008 16th mediterranean conference on. pp 627-632. https://doi.org/10.1109/MED.2008.4602162
[43] Sharma, N.; Gedeon, T., Objective measures, sensors and computational techniques for stress recognition and classification: a survey, Comput Methods Programs Biomed, 108, 1287-301, (2012)
[44] Sidek, KA; Khalil, I., Automobile driver recognition under different physiological conditions using the electrocardiogram, PC World, 38, 753-756, (2011)
[45] Singh RR, Conjeti S, Banerjee R (2012) Biosignal based on-road stress monitoring for automotive drivers. In: 2012 National Conference on Communications (NCC), IEEE, pp 1-5. https://doi.org/10.1109/NCC.2012.6176845
[46] Singh, M.; Queyam, AB, Stress detection in automobile drivers using physiological parameters: a review, Int J Electron Eng, 5, 1-5, (2013)
[47] Smart, RG; Cannon, E.; Howard, A.; Frise, P.; Mann, RE, Can we design cars to prevent road rage?, Int J Veh Inf Commun Syst, 1, 44-55, (2005)
[48] Strobl C, Zeileis A (2008) Danger: high power!? exploring the statistical properties of a test for random forest variable importance. In: Proceedings of 18th international conference on computational statistics
[49] Tao J, Tan T (2005) Affective computing: a review. In: International conference on affective computing and intelligent interaction. Springer, pp 981-995
[50] Ullah, S.; Finch, CF, Applications of functional data analysis: a systematic review, BMC Med Res Methodol, 13, 43, (2013)
[51] Dooren, M.; Vries, JJ; Janssen, JH, Emotional sweating across the body: Comparing 16 different skin conductance measurement locations, Physiol Behav, 106, 298-304, (2012)
[52] Verikas, A.; Gelzinis, A.; Bacauskiene, M., Mining data with random forests: a survey and results of new tests, Pattern Recognit, 44, 330-349, (2011)
[53] Yang K, Yoon H, Shahabi C (2005) A supervised feature subset selection technique for multivariate time series. In: Proceedings of the workshop on feature selection for data mining: interfacing machine learning with statistics, pp 92-101
[54] Zhang L, Tamminedi T, Ganguli A, Yosiphon G, Yadegar J (2010) Hierarchical multiple sensor fusion using structurally learned Bayesian network. In: Wireless health 2010 on—WH ’10. ACM Press, New York, p 174. https://doi.org/10.1145/1921081.1921102
[55] Zhu R, Zeng D, Kosorok MR (2012) Reinforcement learning trees. Technical reports on University of North Carolina
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.