Multiple mediation analysis for interval-valued data. (English) Zbl 1437.62194

Summary: Mediation analysis is an important statistical approach to evaluate the relationships among observed variables. The most commonly used models for mediation analysis handle single-valued variables. However, there are several circumstances (e.g., dimensionality reduction of large datasets, clinical patient courses, repeated measures, masked data, uncertain data) in which the collected information can be represented more naturally by means of intervals. In these cases, standard mediation analyses can be ill-suited. Although interval-valued variables can be transformed into standard single-valued variables, such procedures may mask some relevant information provided by intervals. In this article, we present a novel and simple model (IMedA) to perform mediation analysis on interval-valued variables which is based on both the symbolic regression approach and the regression based mediation framework. We also generalize Stolzenberg’s decomposition of effects to cope with interval-valued data. We further introduce a specific variance based decomposition procedure to descriptively evaluate the sizes of such effects. Finally, to better highlight the IMedA features we apply our model to a real case study from behavioral contexts.


62H12 Estimation in multivariate analysis
62P10 Applications of statistics to biology and medical sciences; meta analysis


SODAS; bmem ; DYFRAT; lavaan
Full Text: DOI


[1] Alarcon, Gm, A meta-analysis of burnout with job demands, resources, and attitudes, J Vocat Behav, 79, 2, 549-562 (2011)
[2] Alkhamisi, M., Simulation study of new estimators combining the sur ridge regression and the restricted least squares methodologies, Stat Pap, 51, 3, 651-672 (2010) · Zbl 1247.62176
[3] Alwin, Df; Hauser, Rm, The decomposition of effects in path analysis, Am Sociol Rev, 40, 37-47 (1975)
[4] Arndt, S.; Turvey, C.; Coryell, Wh; Dawson, Jd; Leon, Ac; Akiskal, Hs, Charting patients’ course: a comparison of statistics used to summarize patient course in longitudinal and repeated measures studies, J Psychiatr Res, 34, 2, 105-113 (2000)
[5] Augustin, T., Expected utility within a generalized concept of probabilitya comprehensive framework for decision making under ambiguity, Stat Pap, 43, 1, 5-22 (2002) · Zbl 1020.62005
[6] Avanzi, L.; Van Dick, R.; Fraccaroli, F.; Sarchielli, G., The downside of organizational identification: relations between identification, workaholism and well-being, Work Stress, 26, 3, 289-307 (2012)
[7] Avanzi, L.; Balducci, C.; Fraccaroli, F., Contribution to the italian validation of the copenhagen burnout inventory (cbi), Psicol Della Salute, 2, 120-135 (2013)
[8] Baron, Rm; Kenny, Da, The moderator-mediator variable distinction in social psychological research: conceptual, strategic, and statistical considerations, J Pers Soc Psychol, 51, 6, 1173-1182 (1986)
[9] Billard, L.; Diday, E.; Jajuga, K.; Sokołowski, A.; Bock, Hh, Symbolic regression analysis, Classification, clustering, and data analysis: recent advances and applications, 281-288 (2002), Berlin: Springer, Berlin
[10] Billard, L.; Diday, E., From the statistics of data to the statistics of knowledge: symbolic data analysis, J Am Stat Assoc, 98, 462, 470-487 (2003)
[11] Blanco-Fernández, A.; Winker, P., Data generation processes and statistical management of interval data, AStA Adv Stat Anal, 100, 4, 475-494 (2016) · Zbl 1443.62017
[12] Blanco-Fernández, A.; Colubi, A.; García-Bárzana, M., A set arithmetic-based linear regression model for modelling interval-valued responses through real-valued variables, Inf Sci, 247, 109-122 (2013) · Zbl 1321.62078
[13] Blanco-Fernández, A.; Colubi, A.; González-Rodríguez, G., Linear regression analysis for interval-valued data based on set arithmetic: a review, Towards advanced data analysis by combining soft computing and statistics, 19-31 (2013), Berlin: Springer, Berlin · Zbl 1348.62191
[14] Bliese, Pd; Castro, Ca, Role clarity, work overload and organizational support: multilevel evidence of the importance of support, Work Stress, 14, 1, 65-73 (2000)
[15] Bollen, Ka; Stine, R., Direct and indirect effects: classical and bootstrap estimates of variability, Sociol Methodol, 20, 1, 15-140 (1990)
[16] Caffo, B.; Chen, S.; Stewart, W.; Bolla, K.; Yousem, D.; Davatzikos, C.; Schwartz, Bs, Are brain volumes based on magnetic resonance imaging mediators of the associations of cumulative lead dose with cognitive function?, Am J Epidemiol, 167, 4, 429-437 (2008)
[17] Calcagnì, A.; Lombardi, L., Dynamic fuzzy rating tracker (dyfrat): a novel methodology for modeling real-time dynamic cognitive processes in rating scales, Appl Soft Comput, 24, 948-961 (2014)
[18] Calcagnì, Antonio; Lombardi, Luigi; Sulpizio, Simone, Analyzing spatial data from mouse tracker methodology: An entropic approach, Behavior Research Methods, 49, 6, 2012-2030 (2017)
[19] Carpita, M.; Ciavolino, E., A generalized maximum entropy estimator to simple linear measurement error model with a composite indicator, Adv Data Anal Classif, 11, 1, 139-158 (2017) · Zbl 07061443
[20] Choi, Jin-Young; Lee, Myoung-Jae, Regression discontinuity: review with extensions, Statistical Papers, 58, 4, 1217-1246 (2016) · Zbl 1382.62015
[21] Claessens, Bj; Van Eerde, W.; Rutte, Cg; Roe, Ra, Planning behavior and perceived control of time at work, J Organ Behav, 25, 8, 937-950 (2004)
[22] Couso, I.; Dubois, D., Statistical reasoning with set-valued information: Ontic vs. epistemic views, Int J Approx Reason, 55, 7, 1502-1518 (2014) · Zbl 1407.62032
[23] Diday, E.; Noirhomme-Fraiture, M., Symbolic data analysis and the SODAS software (2008), New York: Wiley, New York · Zbl 1275.62029
[24] Edwards, Ja; Webster, S.; Van Laar, D.; Easton, S., Psychometric analysis of the uk health and safety executive’s management standards work-related stress indicator tool, Work Stress, 22, 2, 96-107 (2008)
[25] Edwards, Jr; Lambert, Ls, Methods for integrating moderation and mediation: a general analytical framework using moderated path analysis, Psychol Methods, 12, 1, 1-22 (2007)
[26] Everitt, B., The analysis of repeated measures: a practical review with examples, Stat, 44, 1, 113-135 (1995)
[27] Fairchild, Aj; Mackinnon, Dp; Taborga, Mp; Taylor, Ab, R2 effect-size measures for mediation analysis, Behav Res Methods, 41, 2, 486-498 (2009)
[28] Fields, Gs, Accounting for income inequality and its change: a new method, with application to the distribution of earnings in the united states, Res Labor Econ, 22, 1-38 (2003)
[29] Fishburn, Pc, Interval representations for interval orders and semiorders, J Math Psychol, 10, 1, 91-105 (1973) · Zbl 0264.06003
[30] Fisher, Cd; To, Ml, Using experience sampling methodology in organizational behavior, J Organ Behav, 33, 7, 865-877 (2012)
[31] Frison, L.; Pocock, Sj, Repeated measures in clinical trials: analysis using mean summary statistics and its implications for design, Stat Med, 11, 13, 1685-1704 (1992)
[32] Gómez, G.; Calle, Ml; Oller, R., Frequentist and bayesian approaches for interval-censored data, Stat Pap, 45, 2, 139-173 (2004) · Zbl 1052.62096
[33] Halff, Hm; Ortony, A.; Anderson, Rc, A context-sensitive representation of word meanings, Memory Cogn, 4, 4, 378-383 (1976)
[34] Hayes, Af; Preacher, Kj, Quantifying and testing indirect effects in simple mediation models when the constituent paths are nonlinear, Multivar Behav Res, 45, 4, 627-660 (2010)
[35] Imai, K.; Van Dyk, Da, Causal inference with general treatment regimes, J Am Stat Assoc, 99, 467, 854-866 (2004) · Zbl 1117.62361
[36] Imai, K.; Keele, L.; Yamamoto, T., Identification, inference and sensitivity analysis for causal mediation effects, Stat Sci, 25, 1, 51-71 (2010) · Zbl 1328.62478
[37] Johnson, A.; Mulder, B.; Sijbinga, A.; Hulsebos, L., Action as a window to perception: measuring attention with mouse movements, Appl Cogn Psychol, 26, 5, 802-809 (2012)
[38] Judd, Cm; Kenny, Da, Process analysis estimating mediation in treatment evaluations, Eval Rev, 5, 5, 602-619 (1981)
[39] Kiers, Ha, Setting up alternating least squares and iterative majorization algorithms for solving various matrix optimization problems, Comput Stat Data Anal, 41, 1, 157-170 (2002) · Zbl 1018.65074
[40] Kim, H.; Park, H., Sparse non-negative matrix factorizations via alternating non-negativity-constrained least squares for microarray data analysis, Bioinformatics, 23, 12, 1495-1502 (2007)
[41] Kristal, Ar; Glanz, K.; Tilley, Bc; Li, S., Mediating factors in dietary change: understanding the impact of a worksite nutrition intervention, Health Educ Behav, 27, 1, 112-125 (2000)
[42] Kristensen, Ts; Borritz, M.; Villadsen, E.; Christensen, Kb, The copenhagen burnout inventory: a new tool for the assessment of burnout, Work Stress, 19, 3, 192-207 (2005)
[43] Lima Neto, Eda; De Carvalho, Fda, Centre and range method for fitting a linear regression model to symbolic interval data, Comput Stat Data Anal, 52, 3, 1500-1515 (2008) · Zbl 1452.62493
[44] Lima Neto, Eda; De Carvalho, Fda, Constrained linear regression models for symbolic interval-valued variables, Comput Stat Data Anal, 54, 2, 333-347 (2010) · Zbl 1464.62055
[45] Little, R., Statistical analysis of masked data, J Off Stat, 9, 2, 407-426 (1993)
[46] Luce, Rd, Semiorders and a theory of utility discrimination, Econometrica, 24, 2, 178-191 (1956) · Zbl 0071.14006
[47] Luo, P.; Geng, Z., Causal mediation analysis for survival outcome with unobserved mediator-outcome confounders, Comput Stat Data Anal, 93, 336-347 (2016) · Zbl 1468.62127
[48] Mackinnon, D., Introduction to statistical mediation analysis (2008), New York: Routledge, New York
[49] Mackinnon, Dp; Fairchild, Aj, Current directions in mediation analysis, Curr Dir Psychol Sci, 18, 1, 16-20 (2009)
[50] Mood, A.; Graybill, F., Introduction to the theory of statistics (1950), New York: McGraw-Hill, New York
[51] Moore, Re, Interval analysis (1966), New York: Prentice-Hall Englewood Cliffs, New York
[52] Nkurunziza, S.; Ejaz Ahmed, S., Estimation strategies for the regression coefficient parameter matrix in multivariate multiple regression, Stat Neerl, 65, 4, 387-406 (2011)
[53] Parchami, A.; Taheri, Sm; Mashinchi, M., Testing fuzzy hypotheses based on vague observations: a p-value approach, Stat Pap, 53, 2, 469-484 (2012) · Zbl 1440.62104
[54] Preacher, Kj; Hayes, Af, Asymptotic and resampling strategies for assessing and comparing indirect effects in multiple mediator models, Behav Res Methods, 40, 3, 879-891 (2008)
[55] Preacher, Kj; Kelley, K., Effect size measures for mediation models: quantitative strategies for communicating indirect effects, Psychol Methods, 16, 2, 93 (2011)
[56] Rosseel, Y., Lavaan: an R package for structural equation modeling, J Stat Softw, 48, 2, 1-36 (2012)
[57] Salicone, S., Measurement uncertainty: an approach via the mathematical theory of evidence (2007), New York: Springer, New York · Zbl 1144.62001
[58] Sawyer, Je, Goal and process clarity: specification of multiple constructs of role ambiguity and a structural equation model of their antecedents and consequences, J Appl Psychol, 77, 2, 130 (1992)
[59] Seibold, Dr; Mcphee, Rd, Commonality analysis: a method for decomposing explained variance in multiple regression analyses, Human Commun Res, 5, 4, 355-365 (1979)
[60] Senn, S.; Stevens, L.; Chaturvedi, N., Repeated measures in clinical trials: simple strategies for analysis using summary measures, Stat Med, 19, 6, 861-877 (2000)
[61] Sobel, Me, Asymptotic confidence intervals for indirect effects in structural equation models, Sociol Methodol, 13, 1982, 290-312 (1982)
[62] Stolzenberg, Rm, The measurement and decomposition of causal effects in nonlinear and nonadditive models, Sociol Methodol, 11, 459-488 (1980)
[63] Sutton, S., Predicting and explaining intentions and behavior: How well are we doing?, J Appl Soc Psychol, 28, 15, 1317-1338 (1998)
[64] Takane, Y.; Young, Fw; De Leeuw, J., Nonmetric individual differences multidimensional scaling: an alternating least squares method with optimal scaling features, Psychometrika, 42, 1, 7-67 (1977) · Zbl 0354.92048
[65] Taris, Tw; De Lange, Ah; Kompier, Ma; Houdmont, J.; Leka, S., Research methods in occupational health psychology, Occupational health psychology, 269-297 (2010), Hoboken: Wiley-Blackwell, Hoboken
[66] Taylor, Ab; Mackinnon, D.; Tein, Jy, Tests of the three-path mediated effect, Organ Res Methods, 11, 2, 241-269 (2008)
[67] Timmerman, Me; Kiers, Ha, Three-way component analysis with smoothness constraints, Comput Stat Data Anal, 40, 3, 447-470 (2002) · Zbl 1180.62089
[68] Toderi, S.; Balducci, C.; Edwards, Ja; Sarchielli, G.; Broccoli, M.; Mancini, G., Psychometric properties of the uk and italian versions of the hse stress indicator tool, Eur J Psychol Assess, 29, 1, 72-79 (2013)
[69] Wardle, J.; Carnell, S.; Haworth, Cm; Farooqi, Is; O’Rahilly, S.; Plomin, R., Obesity associated genetic variation in fto is associated with diminished satiety, J Clin Endocrinol Metab, 93, 9, 3640-3643 (2008)
[70] Yahya, W.; Olaifa, J., A note on ridge regression modeling techniques, Electron J Appl Stat Anal, 7, 2, 343-361 (2014)
[71] Yuan, Kh; Cheng, Y.; Maxwell, S., Moderation analysis using a two-level regression model, Psychometrika, 79, 4, 701-732 (2013) · Zbl 1303.62110
[72] Zhang, Z.; Wang, L., Methods for mediation analysis with missing data, Psychometrika, 78, 1, 154-184 (2013) · Zbl 1284.62773
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.