Matching for balance, pairing for heterogeneity in an observational study of the effectiveness of for-profit and not-for-profit high schools in Chile. (English) Zbl 1454.62510

Summary: Conventionally, the construction of a pair-matched sample selects treated and control units and pairs them in a single step with a view to balancing observed covariates \(\mathbf{x}\) and reducing the heterogeneity or dispersion of treated-minus-control response differences, \(Y\). In contrast, the method of cardinality matching developed here first selects the maximum number of units subject to covariate balance constraints and, with a balanced sample for \(\mathbf{x}\) in hand, then separately pairs the units to minimize heterogeneity in \(Y\). Reduced heterogeneity of pair differences in responses \(Y\) is known to reduce sensitivity to unmeasured biases, so one might hope that cardinality matching would succeed at both tasks, balancing \(\mathbf{x}\), stabilizing \(Y\). We use cardinality matching in an observational study of the effectiveness of for-profit and not-for-profit private high schools in Chile – a controversial subject in Chile – focusing on students who were in government run primary schools in 2004 but then switched to private high schools. By pairing to minimize heterogeneity in a cardinality match that has balanced covariates, a meaningful reduction in sensitivity to unmeasured biases is obtained.


62P25 Applications of statistics to social sciences
Full Text: DOI arXiv Euclid


[1] Angrist, J. D., Pathak, P. A. and Walters, C. R. (2013). Explaining charter school effectiveness. Am. Econ. J. 5 1-27.
[2] Baiocchi, M. (2011). Designing robust studies using propensity score and prognostic score matching. Chapter 3 in Methodologies for Observational Studies of Health Care Policy . Ph.D. thesis, Dept. Statistics, The Wharton School, Univ. Pennsylvania, Philadelphia, PA.
[3] Bellei, C. (2009). Does lengthening the school day increase students academic achievement? Results from a natural experiment in Chile. Econ. Educ. Rev. 28 629-640.
[4] Brown, B. M. (1981). Symmetric quantile averages and related estimators. Biometrika 68 235-242.
[5] Cornfield, J., Haenszel, W., Hammond, E., Lilienfeld, A., Shimkin, M. and Wynder, E. (1959). Smoking and lung cancer. J. Natl. Cancer Inst. 22 173-203.
[6] Cox, D. R. (1958). Planning of Experiments. A Wiley Publication in Applied Statistics . Wiley, New York. · Zbl 0084.15802
[7] Crump, R. K., Hotz, V. J., Imbens, G. W. and Mitnik, O. A. (2009). Dealing with limited overlap in estimation of average treatment effects. Biometrika 96 187-199. · Zbl 1163.62083
[8] Deyo, R. A., Cherkin, D. C. and Ciol, M. A. (1992). Adapting a clinical comorbidity index for use with ICD-9-CM administrative databases. J. Clin. Epidemiol. 45 613-619.
[9] Elacqua, G. (2009). The Impact of School Choice and Public Policy on Segregation : Evidence from Chile . Centro de Políticas Comparadas de Educación, Univ. Diego Portales, Santiago, Chile.
[10] Fisher, R. A. (1935). The Design of Experiments . Oliver & Boyd, Edinburgh. · Zbl 0011.03205
[11] Gastwirth, J. L. (1992). Methods for assessing the sensitivity of statistical comparisons used in Title VII cases to omitted variables. Jurimetrics 33 19-34.
[12] Hansen, B. B. (2007). Optmatch: Flexible, optimal matching for observational studies. R News 7 18-24. (Package optmatch in R).
[13] Hansen, B. B. (2008). The prognostic analogue of the propensity score. Biometrika 95 481-488. · Zbl 1437.62485
[14] Hill, J. and Su, Y.-S. (2013). Assessing lack of common support in causal inference using Bayesian nonparametrics: Implications for evaluating the effect of breastfeeding on children’s cognitive outcomes. Ann. Appl. Stat. 7 1386-1420. · Zbl 1283.62220
[15] Hodges, J. L. Jr. and Lehmann, E. L. (1963). Estimates of location based on rank tests. Ann. Math. Statist. 34 598-611. · Zbl 0203.21105
[16] Hosman, C. A., Hansen, B. B. and Holland, P. W. (2010). The sensitivity of linear regression coefficients’ confidence limits to the omission of a confounder. Ann. Appl. Stat. 4 849-870. · Zbl 1194.62089
[17] Iacus, S. M., King, G. and Porro, G. (2009). Software for coarsened exact matching. J. Stat. Softw. 30 1-27.
[18] Knaus, W. A., Draper, E. A., Wagner, D. P. and Zimmerman, J. E. (1985). APACHE II: A severity of disease classification system. Crit. Care Med. 13 818-829.
[19] Lehmann, E. L. (1975). Nonparametrics . Holden-Day, San Francisco, CA. · Zbl 0354.62038
[20] Lu, B., Greevy, R., Xu, X. and Beck, C. (2011). Optimal nonbipartite matching and its statistical applications. Amer. Statist. 65 21-30. (Package nbpmatching in R). · Zbl 05886130
[21] Marcus, S. M. (1997). Using omitted variable bias to assess uncertainty in the estimation of an AIDS education treatment effect. J. Educ. Statist. 22 193-201.
[22] Maritz, J. S. (1979). A note on exact robust confidence intervals for location. Biometrika 66 163-166. · Zbl 0396.62026
[23] Neyman, J. (1923, 1990). On the application of probability theory to agricultural experiments. Statist. Sci. 5 463-480.
[24] Rosenbaum, P. R. (1987). Sensitivity analysis for certain permutation inferences in matched observational studies. Biometrika 74 13-26. · Zbl 0605.62130
[25] Rosenbaum, P. R. (1993). Hodges-Lehmann point estimates of treatment effect in observational studies. J. Amer. Statist. Assoc. 88 1250-1253. · Zbl 0800.62138
[26] Rosenbaum, P. R. (2002). Observational Studies , 2nd ed. Springer, New York. · Zbl 0985.62091
[27] Rosenbaum, P. R. (2004). Design sensitivity in observational studies. Biometrika 91 153-164. · Zbl 1132.62363
[28] Rosenbaum, P. R. (2005). Heterogeneity and causality: Unit heterogeneity and design sensitivity in observational studies. Amer. Statist. 59 147-152. · Zbl 05680635
[29] Rosenbaum, P. R. (2007). Sensitivity analysis for \(m\)-estimates, tests, and confidence intervals in matched observational studies. Biometrics 63 456-464. (R package sensitivitymv). · Zbl 1140.62084
[30] Rosenbaum, P. R. (2010a). Design of Observational Studies. Springer Series in Statistics . Springer, New York. · Zbl 1308.62005
[31] Rosenbaum, P. R. (2010b). Design sensitivity and efficiency in observational studies. J. Amer. Statist. Assoc. 105 692-702. · Zbl 1392.62326
[32] Rosenbaum, P. R. (2011). A new U-statistic with superior design sensitivity in matched observational studies. Biometrics 67 1017-1027. · Zbl 1226.62125
[33] Rosenbaum, P. R. (2012a). Testing one hypothesis twice in observational studies. Biometrika 99 763-774. · Zbl 1323.62038
[34] Rosenbaum, P. R. (2012b). Optimal matching of an optimally chosen subset in observational studies. J. Comput. Graph. Statist. 21 57-71.
[35] Rosenbaum, P. R. (2013). Impact of multiple matched controls on design sensitivity in observational studies. Biometrics 69 118-127. · Zbl 1272.62081
[36] Rosenbaum, P. and Rubin, D. (1983). Assessing sensitivity to an unobserved binary covariate in an observational study with binary outcome. J. Roy. Statist. Soc. Ser. B 45 212-218.
[37] Rosenbaum, P. R. and Silber, J. H. (2009). Amplification of sensitivity analysis in matched observational studies. J. Amer. Statist. Assoc. 104 1398-1405. · Zbl 1205.62180
[38] Rubin, D. B. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. J. Ed. Psych. 66 688-701.
[39] Rubin, D. B. (1979). Using multivariate matched sampling and regression adjustment to control bias in observational studies. J. Amer. Statist. Assoc. 74 318-328. · Zbl 0413.62047
[40] Small, D. S. (2007). Sensitivity analysis for instrumental variables regression with overidentifying restrictions. J. Amer. Statist. Assoc. 102 1049-1058. · Zbl 1333.62295
[41] Stephenson, W. R. (1981). A general class of one-sample nonparametric test statistics based on subsamples. J. Amer. Statist. Assoc. 76 960-966. · Zbl 0481.62039
[42] Stuart, E. A. (2010). Matching methods for causal inference: A review and a look forward. Statist. Sci. 25 1-21. · Zbl 1328.62007
[43] Traskin, M. and Small, D. S. (2011). Defining the study population for an observational study to ensure suffient overlap: A tree approach. Statist. Biosci. 3 94-118.
[44] Wang, L. and Krieger, A. M. (2006). Causal conclusions are most sensitive to unobserved binary covariates. Stat. Med. 25 2257-2271.
[45] Welch, B. L. (1937). On the \(z\)-test in randomized blocks. Biometrika 29 21-52. · Zbl 0017.12602
[46] Wolfe, D. A. (1974). A characterization of population weighted-symmetry and related results. J. Amer. Statist. Assoc. 69 819-822. · Zbl 0291.62018
[47] Yanagawa, T. (1984). Case-control studies: Assessing the effect of a confounding factor. Biometrika 71 191-194. · Zbl 0532.62087
[48] Yang, D., Small, D. S., Silber, J. H. and Rosenbaum, P. R. (2012). Optimal matching with minimal deviation from fine balance in a study of obesity and surgical outcomes. Biometrics 68 628-636. (R package finebalance). · Zbl 1274.62910
[49] Yu, B. B. and Gastwirth, J. L. (2005). Sensitivity analysis for trend tests: Application to the risk of radiation exposure. Biostatistics 6 201-209. · Zbl 1071.62112
[50] Zubizarreta, J. R. (2012). Using mixed integer programming for matching in an observational study of kidney failure after surgery. J. Amer. Statist. Assoc. 107 1360-1371. (R software mipmatch at ). · Zbl 1258.62119
[51] Zubizarreta, J. R., Paredes, R. D. and Rosenbaum, P. R. (2014). Supplement to: “Matching for balance, pairing for heterogeneity in an observational study of the effectiveness of for-profit and not-for-profit high schools in Chile.” . · Zbl 1454.62510
[52] Zubizarreta, J. R., Reinke, C. E., Kelz, R. R., Silber, J. H. and Rosenbaum, P. R. (2011). Matching for several sparse nominal variables in a case-control study of readmission following surgery. Amer. Statist. 65 229-238.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.