×

Bayesian method for causal inference in spatially-correlated multivariate time series. (English) Zbl 1409.62178

Summary: Measuring the causal impact of an advertising campaign on sales is an essential task for advertising companies. Challenges arise when companies run advertising campaigns in multiple stores which are spatially correlated, and when the sales data have a low signal-to-noise ratio which makes the advertising effects hard to detect. This paper proposes a solution to address both of these challenges. A novel Bayesian method is proposed to detect weaker impacts and a multivariate structural time series model is used to capture the spatial correlation between stores through placing a \(\mathcal{G}\)-Wishart prior on the precision matrix. The new method is to compare two posterior distributions of a latent variable – one obtained by using the observed data from the test stores and the other one obtained by using the data from their counterfactual potential outcomes. The counterfactual potential outcomes are estimated from the data of synthetic controls, each of which is a linear combination of sales figures at many control stores over the causal period. Control stores are selected using a revised Expectation-Maximization variable selection (EMVS) method. A two-stage algorithm is proposed to estimate the parameters of the model. To prevent the prediction intervals from being explosive, a stationarity constraint is imposed on the local linear trend of the model through a recently proposed prior. The benefit of using this prior is discussed in this paper. A detailed simulation study shows the effectiveness of using our proposed method to detect weaker causal impact. The new method is applied to measure the causal effect of an advertising campaign for a consumer product sold at stores of a large national retail chain.

MSC:

62M10 Time series, auto-correlation, regression, etc. in statistics (GARCH)
62F15 Bayesian inference
62P20 Applications of statistics to economics

Software:

CausalImpact; EMVS
PDFBibTeX XMLCite
Full Text: DOI arXiv Euclid

References:

[1] Abadie, A. (2005). Semiparametric difference-in-differences estimators. The Review of Economic Studies72, 1-19. · Zbl 1112.62132
[2] Abadie, A. and J. Gardeazabal (2003). The economic costs of conflict: A case study of the basque country. American Economics Review105, 113-132.
[3] Atay-Kayis, A. and H. Massam (2005). A Monte Carlo method for computing the marginal likelihood in nondecomposable Gaussian graphical models. Biometrika92, 317-335. · Zbl 1094.62028
[4] Barber, R. F. and M. Drton (2015). High-dimensional Ising model selection with Bayesian information criteria. Electronic Journal of Statistics9, 249-275. · Zbl 1309.62050
[5] Bojinov, I. and N. Shephard (2017). Time series experiments and causal estimands: exact randomization tests and trading. Arxiv:1706.07840v2. · Zbl 1428.62385
[6] Bonhomme, S. and U. Sauder (2011). Recovering distributions in difference-in-differences models: A comparison of selective and comprehensive schooling. The Review of Economics and Statistics93(2), 479-494.
[7] Brodersen, K. H., F. Gaullusser, J. Koehler, N. Remy, and S. L. Scott (2015). Inferring causal impact using Bayesian structural time-series models. The Annals of Applied Statistics9, 247-274. · Zbl 1454.62473
[8] de Jong, P. (1991). The diffuse Kalman filter. The Annals of Statistics19, 1073-1083. · Zbl 0742.62093
[9] de Jong, P. and S. Chu-Chun-Lin (1994). Stationary and non-stationary state space models. Journal of Time Series Analysis15, 151-166. · Zbl 0794.62056
[10] Ding, P. and F. Li (2017). Causal inference: A missing data perspective. Statistical Science (to appear). · Zbl 1397.62125
[11] Donald, S. G. and K. Lang (2007). Inference with difference-in-differences and other panel data. The Review of Economics and Statistics89, 221-233.
[12] Doudchenko, N. and G. W. Imbens (2016). Balancing, regression, difference-in-differences and synthetic control method: A synthesis. NBER Working Paper No. 22791.
[13] Durbin, J. and S. J. Koopman (2002). A simple and efficient simulation smoother for state space time series analysis. Biometrika89, 603-615. · Zbl 1036.62071
[14] Durbin, J. and S. J. Koopman (2012). Time Series Analysis by State Space Methods: Second Edition. Great Clarendon Street, Oxford OX2 6DP: Oxford University Press.
[15] Galindo-Garre, F. and J. K. Vermunt (2006). Avoiding boundary estimates in latent class analysis by Bayesian posterior mode estimation. Behaviormetrika33, 43-59. · Zbl 1102.62024
[16] Gelfand, A. E., A. F. M. Smith, and T.-M. Lee (1992). Bayesian analysis of constrained parameter and truncated data problems using Gibbs sampling. Journal of the American Statistics Association87, 523-532.
[17] George, E. I. and R. E. McCulloch (1993). Variable selection via Gibbs sampling. Journal of the American Statistical Association88(423), 881-889.
[18] Gunn, L. H. and D. B. Dunson (2005). A transformation approach for incorporating monotone or unimodal constraints. Biostatistics6, 434-449. · Zbl 1071.62101
[19] Harvey, A. C. and S. Peters (1990). Estimation procedures for structural time series models. Journal of Forecasting9, 89-108.
[20] Holland, P. W. (1986). Statistics and causal inference. Journal of the American Statistical Association81, 945-960. · Zbl 0607.62001
[21] Imbens, G. W. and D. B. Rubin (2015). Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. New York, NY, USA: Cambridge University Press. · Zbl 1355.62002
[22] Khare, K., B. Rajaratnam, and A. Saha (2015). Bayesian inference for Gaussian graphical models beyond decomposable graphs. Arxiv:1505.00703. · Zbl 1398.62063
[23] Koopman, S. J. (1997). Exact initial Kalman filtering and smoothing for nonstationary time series models. Journal of the American Statistical Association92, 1630-1638. · Zbl 0912.62103
[24] Lauritzen, S. L. (1996). Graphical Models. New York, USA: Oxford University Press Inc. · Zbl 0907.62001
[25] Li, F., T. Zhang, Q. Wang, M. Z. Gonzalez, E. L. Maresh, and J. A. Coan (2015). Spatial Bayesian variable selection and grouping for high-dimensional scalar-on-image regression. The Annals of Applied Statistics9, 687-713. · Zbl 1397.62458
[26] Mitsakakis, N., H. Massam, and M. D. Escobar (2011). A Metropolis-Hastings based method for sampling from the G-Wishart distribution in Gaussian graphical models. Electronic Journal of Statistics5, 18-30. · Zbl 1274.65009
[27] Mohammadi, A. and E. C. Wit (2015). Bayesian structure learning in sparse Gaussian graphical models. Bayesian Analysis10, 109-138. · Zbl 1335.62056
[28] Ning, B., S. Ghosal, and J. Thomas (2019a). Supplement to “Bayesian method for causal inference in spatially-correlated multivariate time series”. Bayesian Analysis. · Zbl 1409.62178
[29] Ning, B., S. Ghosal, and J. Thomas (2019b). R code for “Bayesian method for causal inference in spatially-correlated multivariate time series”. Bayesian Analysis. · Zbl 1409.62178
[30] Ročková, V. and E. I. George (2014). EMVS: The EM approach to Bayesian variable selection. Journal of the American Statistical Association109(506), 828-846. · Zbl 1367.62049
[31] Rosenbaum, P. R. (2007). Interference between units in randomized experiments. Journal of the American Statistical Association102, 191-200. · Zbl 1284.62494
[32] Roverato, A. (2000). Cholesky decomposition of a hyper inverse Wishart matrix. Biometrika87, 99-112. · Zbl 0974.62047
[33] Roverato, A. (2002). Hyper inverse Wishart distribution for non-decomposable graphs and its application to Bayesian inference for Gaussian graphical models. Scandinavian Journal of Statistics29, 391-411. · Zbl 1036.62027
[34] Roy, A., T. S. McElroy, and P. Linton (2016). Constrained estimation of causal invertible VARMA models. Statistica Sinica (to appear). · Zbl 1412.62123
[35] Rubin, D. B. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of Educational Psychology66, 688-701.
[36] Rubin, D. B. (1977). Assignment to treatment group on the basis of a covariate. Journal of Educational Statistics2, 1-26.
[37] Rubin, D. B. (2005). Causal inference using potential outcomes: design, modeling, decisions. Journal of the American Statistical Association100, 322-331. · Zbl 1117.62418
[38] Smith, M. and L. Fahrmeir (2007). Spatial Bayesian variable selection with application to functional magnetic resonance imaging. Journal of the American Statistical Association102, 417-431. · Zbl 1134.62322
[39] Stein, P. (1952). Some general theorems on iterants. Journal of Research of the National Bureau of Standards48, 82-83.
[40] Stuart, E. A. (2010). Matching methods for causal inference: A review and a look forward. Statistical Science25, 1-21. · Zbl 1328.62007
[41] Ueda, N. and R. Nakano (1998). Deterministic annealing EM algorithm. Neural Networks11, 271-282.
[42] Uhler, C., A. Lenkoskiy, and D. Richardsz (2017). Exact formulas for the normalizing constants of Wishart distributions for graphical models. The Annals of Statistics46, 90-118.
[43] Wang, H. and S. Z. Li (2012). Efficient Gaussian graphical model determination under G-Wishart prior distributions. Electronic Journal of Statistics6, 168-198. · Zbl 1335.62069
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.