Analysing the causal effect of London cycle superhighways on traffic congestion. (English) Zbl 1498.62304

Summary: Transport operators have a range of intervention options available to improve or enhance their networks. Such interventions are often made in the absence of sound evidence on resulting outcomes. Cycling superhighways were promoted as a sustainable and healthy travel mode, one of the aims of which was to reduce traffic congestion. Estimating the impacts that cycle superhighways have on congestion is complicated due to the nonrandom assignment of such intervention over the transport network. In this paper we analyse the causal effect of cycle superhighways utilising preintervention and postintervention information on traffic and road characteristics along with socioeconomic factors. We propose a modeling framework based on the propensity score and outcome regression model. The method is also extended to the doubly robust set-up. Simulation results show the superiority of the performance of the proposed method over existing competitors. The method is applied to analyse a real dataset on the London transport network. The methodology proposed can assist in effective decision making to improve network performance.


62P20 Applications of statistics to economics
62G05 Nonparametric estimation
62G35 Nonparametric robustness


Full Text: DOI arXiv


[1] Abadie, A. (2005). Semiparametric difference-in-differences estimators. Rev. Econ. Stud. 72 1-19. · Zbl 1112.62132
[2] Abadie, A. and Imbens, G. W. (2006). Large sample properties of matching estimators for average treatment effects. Econometrica 74 235-267. · Zbl 1112.62042
[3] Austin, P. C. (2011). An introduction to propensity score methods for reducing the effects of confounding in observational studies. Multivar. Behav. Res. 46 399-424.
[4] Badoe, D. A. and Miller, E. J. (2000). Transportation-land-use interaction: Empirical findings in North America, and their implications for modeling. Transp. Res., Part D, Transp. Environ. 5 235-263.
[5] Bang, H. and Robins, J. M. (2005). Doubly robust estimation in missing data and causal inference models. Biometrics 61 962-972. · Zbl 1087.62121
[6] Barkley, B. G., Hudgens, M. G., Clemens, J. D., Ali, M. and Emch, M. E. (2020). Causal inference from observational studies with clustered interference, with application to a cholera vaccine study. Ann. Appl. Stat. 14 1432-1448. · Zbl 1470.62152
[7] Belloni, A., Chernozhukov, V. and Hansen, C. (2014). High-dimensional methods and inference on structural and treatment effects. J. Econ. Perspect. 28 29-50. · Zbl 1409.62142
[8] Belloni, A., Chernozhukov, V., Fernández-Val, I. and Hansen, C. (2017). Program evaluation and causal inference with high-dimensional data. Econometrica 85 233-298. · Zbl 1410.62197
[9] Blunden, M. (2016). Cycle superhighways make traffic worse in the city, report reveals. Evening Stand. Oct 5.
[10] Branas, C. C., Cheney, R. A., MacDonald, J. M., Tam, V. W., Jackson, T. D. and Ten Have, T. R. (2011). A difference-in-differences analysis of health, safety, and greening vacant urban space. Am. J. Epidemiol. 174 1296-1306.
[11] Callaway, B. and Sant’Anna, P. H. C. (2020). Difference-in-differences with multiple time periods. J. Econometrics.
[12] Card, D. and Krueger, A. B. (1994). Minimum wages and employment: A case study of the fast food industry in New Jersey and Pennsylvania. Am. Econ. Rev. 84 772-93.
[13] Chaisemartin, D. and D’Haultfoeuille, X. (2020). Two-way fixed effects estimators with heterogeneous treatment effects. Am. Econ. Rev. Available at https://arxiv.org/abs/1803.08807.
[14] Chen, C., Jia, Z. and Varaiya, P. (2001). Causes and cures of highway congestion. IEEE Control Syst. Mag. 21 26-32.
[15] Daniel, R. M., Cousens, S. N., De Stavola, B. L., Kenward, M. G. and Sterne, J. A. C. (2013). Methods for dealing with time-dependent confounding. Stat. Med. 32 1584-1618.
[16] Daw, J. R. and Hatfield, L. A. (2018). Matching and regression to the mean in difference-in-differences analysis. Health Serv. Res. 53 4138-4156.
[17] Diggle, P. J., Heagerty, P. J., Liang, K.-Y. and Zeger, S. L. (2002). Analysis of Longitudinal Data, 2nd ed. Oxford Statistical Science Series 25. Oxford Univ. Press, Oxford.
[18] Flores, C. A., Flores-Lagunes, A., Gonzalez, A. and Neumann, T. C. (2012). Estimating the effects of length of exposure to instruction in a training program: The case of job corps. Rev. Econ. Stat. 94 153-171.
[19] George, K. A. (1970). Transportation compatible land uses and bus-stop location. Trans. Built Environ. 44.
[20] Graham, D. J., McCoy, E. J. and Stephens, D. A. (2016). Approximate Bayesian inference for doubly robust estimation. Bayesian Anal. 11 47-69. · Zbl 1357.62186
[21] Hahn, J. (1998). On the role of the propensity score in efficient semiparametric estimation of average treatment effects. Econometrica 66 315-331. · Zbl 1055.62572
[22] Harman, J. S., Lemak, C. H., Al-Amin, M., Hall, A. G. and Duncan, R. P. (2011). Changes in per member per month expenditures after implementation of Florida’s medicaid reform demonstration. Health Serv. Res. 43 787-804.
[23] Heckman, J., Ichimura, H., Smith, J. and Todd, P. (1998). Characterizing selection bias using experimental data. Econometrica 66 1017-1098. · Zbl 1055.62573
[24] Hernán, M. A. and Robins, J. M. (2020). Causal Inference: What If. CRC Press/CRC, Boca Raton.
[25] Hirano, K., Imbens, G. W. and Ridder, G. (2003). Efficient estimation of average treatment effects using the estimated propensity score. Econometrica 71 1161-1189. · Zbl 1152.62328
[26] Horvitz, D. G. and Thompson, D. J. (1952). A generalization of sampling without replacement from a finite universe. J. Amer. Statist. Assoc. 47 663-685. · Zbl 0047.38301
[27] Huber, P. J. (1967). The behavior of maximum likelihood estimates under nonstandard conditions. In Proc. Fifth Berkeley Sympos. Math. Statist. and Probability (Berkeley, Calif., 1965/66), Vol. I: Statistics 221-233. Univ. California Press, Berkeley, Calif. · Zbl 0212.21504
[28] Hudgens, M. G. and Halloran, M. E. (2008). Toward causal inference with interference. J. Amer. Statist. Assoc. 103 832-842. · Zbl 1471.62507
[29] Jin, J. and Rafferty, P. (2017). Does congestion negatively affect income growth and employment growth? Empirical evidence from US metropolitan regions. Transp. Policy 55 1-8.
[30] Kang, J. D. Y. and Schafer, J. L. (2007). Demystifying double robustness: A comparison of alternative strategies for estimating a population mean from incomplete data. Statist. Sci. 22 523-539. · Zbl 1246.62073
[31] Lechner, L. (2001). The estimation of causal effects by difference-in-difference methods. Found Trends Econom. 4 165-224. · Zbl 1232.91567
[32] Lee, B. K., Lessler, J. and Stuart, E. A. (2010). Improving propensity score weighting using machine learning. Stat. Med. 29 337-346.
[33] Li, H., Graham, D. J. and Liu, P. (2017). Safety effects of the London cycle superhighways on cycle collisions. Accident Anal. Prev. 90 90-101.
[34] Lindner, S. and McConnell, K. J. (2018). Difference-in-differences and matching on outcomes: A tale of two unobservables. Health Serv. Outcomes Res. Methodol. 19 127-144.
[35] Lunceford, J. K. and Davidian, M. (2004). Stratification and weighting via the propensity score in estimation of causal treatment effects: A comparative study. Stat. Med. 23 2937-2960.
[36] McCulloch, C. E. and Neuhaus, J. M. (2011). Misspecifying the shape of a random effects distribution: Why getting it wrong may not matter. Statist. Sci. 26 388-402. · Zbl 1246.62169
[37] Moodie, E. E. M., Saarela, O. and Stephens, D. A. (2018). A doubly robust weighting estimator of the average treatment effect on the treated. Stat 7 e205.
[38] Norman, W. (2017). Bike lanes don’t clog up our roads, they keep London moving. The Gaurdian Dec 1.
[39] Retallack, A. E. and Ostendorf, B. (2019). Current understanding of the effects of congestion on traffic accidents. Int. J. Environ. Res. Public Health 16.
[40] Robins, J. M., Hernán, M. A. and Brumback, B. (2000). Marginal structural models and causal inference in epidemiology. Epidemiology 11 550-560.
[41] Robins, J. M. and Rotnitzky, A. (2001). A comment on “Inference for semiparametric models: Some questions and an answer”. Statist. Sinica 11 920-936.
[42] Rosenbaum, P. R. and Rubin, D. B. (1983). The central role of the propensity score in observational studies for causal effects. Biometrika 70 41-55. · Zbl 0522.62091
[43] Rubin, D. B. (1978). Bayesian inference for causal effects: The role of randomization. Ann. Statist. 6 34-58. · Zbl 0383.62021
[44] Sant’Anna, P. H. C. and Zhao, J. (2020). Doubly robust difference-in-differences estimators. J. Econometrics 219 101-122. · Zbl 1464.62254
[45] Scharfstein, D. O., Rotnitzky, A. and Robins, J. M. (1999). Adjusting for nonignorable drop-out using semiparametric nonresponse models. J. Amer. Statist. Assoc. 94 1096-1146. With comments and a rejoinder by the authors. · Zbl 1072.62644
[46] Serfling, R. J. (1980). Approximation Theorems of Mathematical Statistics. Wiley Series in Probability and Mathematical Statistics. Wiley, New York. · Zbl 0538.62002
[47] Skrondal, A. and Rabe-Hesketh, S. (2009). Prediction in multilevel generalized linear models. J. Roy. Statist. Soc. Ser. A 172 659-687.
[48] Slawson, N. (2017). Traffic jams on major UK roads cost economy around £9bn. The Gaurdian Oct 18.
[49] Tan, Z. (2020). Model-assisted inference for treatment effects using regularized calibrated estimation with high-dimensional data. Ann. Statist. 48 811-837. · Zbl 1476.62045
[50] Tchetgen Tchetgen, E. J. and VanderWeele, T. J. (2012). On causal inference in the presence of interference. Stat. Methods Med. Res. 21 55-75.
[51] The Department of Transport (2018). Traffic statistics-methodology review-alternative data sources.
[52] Transport for London (2010). Cycling revolution London.
[53] Transport for London (2011). Barclays cycle superhighways evaluation of pilot routes 3 and 7.
[54] Transport for London (2014). Number of daily cycle journeys in London.
[55] Westreich, D., Lessler, J. and Funk, M. J. (2010). Propensity score estimation: Neural networks, support vector machines, decision trees (CART), and meta-classifiers as alternatives to logistic regression. J. Clin. Epidemiol. 63 826-833.
[56] Wharam, J. F., Landon, B. E., Galbraith, A. A., Kleinman, K. P., Soumerai, S. B. and Ross-Degnan, D. (2007). Emergency department use and subsequent hospitalizations among members of a high-deductible health plan. JAMA 297 1093-1102.
[57] Yuan, K., Knoop, V. L. and Hoogendoorn, S. P. (2015). Capacity drop: Relationship between speed in congestion and the queue discharge rate. Transp. Res. Rec. 2491 72-80.
[58] Zhang, K., Sun, D. J., Shen, S. and Zhu, Y. (2017). Analyzing spatiotemporal congestion pattern on urban roads based on taxi GPS data. J. Transp. Land Use 10 675-694
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.