×

zbMATH — the first resource for mathematics

Spatial models for point and areal data using Markov random fields on a fine grid. (English) Zbl 1337.62302
Summary: I consider the use of Markov random fields (MRFs) on a fine grid to represent latent spatial processes when modeling point-level and areal data, including situations with spatial misalignment. Point observations are related to the grid cell in which they reside, while areal observations are related to the (approximate) integral over the latent process within the area of interest. I review several approaches to specifying the neighborhood structure for constructing the MRF precision matrix, presenting results comparing these MRF representations analytically, in simulations, and in two examples. The results provide practical guidance for choosing a spatial process representation and highlight the importance of this choice. In particular, the results demonstrate that, and explain why, standard CAR models can behave strangely for point-level data. They show that various neighborhood weighting approaches based on higher-order neighbors that have been suggested for MRF models do not produce smooth fields, which raises doubts about their utility. Finally, they indicate that an MRF that approximates a thin plate spline compares favorably to standard CAR models and to kriging under many circumstances.

MSC:
62M30 Inference from spatial processes
62M40 Random fields; image analysis
60G15 Gaussian processes
62M10 Time series, auto-correlation, regression, etc. in statistics (GARCH)
62P12 Applications of statistics to environmental and related topics
Software:
GMRFLib; spectralGP
PDF BibTeX XML Cite
Full Text: DOI Euclid
References:
[1] Banerjee, S., Finley, A.O., Waldmann, P., and Ericsson, T., Hierarchical spatial process models for multiple traits in large genetic trials. Journal of the American Statistical Association , 105(490):506-521, 2010. · Zbl 1392.62316
[2] Banerjee, S., Gelfand, A.E., Finley, A.O., and Sang, H., Gaussian predictive process models for large spatial data sets. Journal of the Royal Statistical Society, Series B , 70(4):825-848, 2008. · Zbl 05563371
[3] Banerjee, S., Gelfand, A.E., and Sirmans, C.F., Directional rates of change under spatial process models. Journal of the American Statistical Association , 98(464):946-954, 2003. · Zbl 1045.62096
[4] Besag, J. and Mondal, D., First-order intrinsic autoregressions and the de Wijs process. Biometrika , 92(4):909-920, 2005. · Zbl 1151.62068
[5] Breslow, N.E. and Clayton, D.G., Approximate inference in generalized linear mixed models. Journal of the American Statistical Association , 88:9-25, 1993. · Zbl 0775.62195
[6] Christensen, O.F., Roberts, G.O., and Sköld, M., Robust Markov chain Monte Carlo methods for spatial generalized linear mixed models. Journal of Computational and Graphical Statistics , 15:1-17, 2006.
[7] Christensen, Ole F. and Waagepetersen, Rasmus, Bayesian prediction of spatial count data using generalized linear mixed models. Biometrics , 58:280-286, 2002. · Zbl 1209.62156
[8] Diggle, P.J., Menezes, R., and Su, T.-L., Geostatistical inference under preferential sampling. Journal of the Royal Statistical Society, Series C , 59:191-232, 2010.
[9] Fuentes, M. and Raftery, A.E., Model evaluation and spatial interpolation by Bayesian combination of observations with outputs from numerical models. Biometrics , 61(1):36-45, 2005. · Zbl 1077.62124
[10] Furrer, R., Genton, M.G., and Nychka, D., Covariance tapering for interpolation of large spatial datasets. Journal of Computational and Graphical Statistics , 15:502-523, 2006.
[11] Gamerman, D., Sampling from the posterior distribution in generalized linear mixed models. Statistics and Computing , 7:57-68, 1997.
[12] Hrafnkelsson, B. and Cressie, N., Hierarchical modeling of count data with application to nuclear fall-out. Environmental and Ecological Statistics , 10(2):179-200, 2003.
[13] Hund, L., Chen, J.T., Krieger, N., and Coull, B.A., A geostatistical approach to large-scale disease mapping with temporal misalignment. Biometrics , 68:849-858, 2012. · Zbl 1270.62140
[14] Kammann, E.E. and Wand, M.P., Geoadditive models. Journal of the Royal Statistical Society, Series C , 52:1-18, 2003. · Zbl 1111.62346
[15] Kaufman, C., Schervish, M., and Nychka, D., Covariance tapering for likelihood-based estimation in large spatial datasets. Journal of the American Statistical Association , 103:1556-1569, 2008. · Zbl 1286.62072
[16] Kelsall, J. and Wakefield, J., Modeling spatial variation in disease risk. Journal of the American Statistical Association , 97(459):692-701, 2002. · Zbl 1073.62580
[17] Krieger, N., Chen, J.T., Waterman, P.D., Rehkopf, D.H., Yin, R., and Coull, B.A., Race/ethnicity and changing US socioeconomic gradients in breast cancer incidence: California and Massachusetts, 1978-2002 (United States). Cancer Causes and Control , 17(2):217-226, 2006.
[18] Lindgren, F., Rue, H., and Lindström, J., An explicit link between Gaussian fields and Gaussian Markov random fields: the stochastic partial differential equation approach. Journal of the Royal Statistical Society, Series B , 73:423-498, 2011. · Zbl 1274.62360
[19] Paciorek, C.J., Bayesian smoothing with Gaussian processes using Fourier basis functions in the spectralGP package. Journal of Statistical Software , 19:2, 2007.
[20] Paciorek, C.J., Supplement to “Spatial models for point and areal data using Markov random fields on a fine grid.”, 2013. DOI: 10.1214/13-EJS791SUPP. · Zbl 1337.62302
[21] Paciorek, C.J. and Liu, Y., Assessment and statistical modeling of the relationship between remotely-sensed aerosol optical depth and \(\mathrm{PM}_{2.5}\). Technical Report 167, Health Effects Institute Research Report (peer-reviewed), 2012.
[22] Paciorek, C.J., Yanosky, J.D., Puett, R.C., Laden, F., and Suh, H.H., Practical large-scale spatio-temporal modeling of particulate matter concentrations. Annals of Applied Statistics , 3:369-396, 2009. · Zbl 1160.62093
[23] Pettitt, A.N., Weir, I.S., and Hart, A.G., A conditional autoregressive Gaussian process for irregularly spaced multivariate data with application to modelling large sets of binary data. Statistics and Computing , 12(4):353-367, 2002.
[24] Rue, H. and Held, L., Gaussian Markov Random Fields: Theory and Applications . Chapman & Hall, Boca Raton, 2005. · Zbl 1093.60003
[25] Rue, H., Martino, S., and Chopin, N., Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. Journal of the Royal Statistical Society, Series B , 71(2):319-392, 2009. · Zbl 1248.62156
[26] Sang, H. and Huang, J.Z., A full scale approximation of covariance functions for large spatial data sets. Journal of the Royal Statistical Society, Series B , 74:111-132, 2012.
[27] Silverman, B.W., Spline smoothing: the equivalent variable kernel method. Annals of Statistics , 12(3):898-916, 1984. · Zbl 0547.62024
[28] Sollich, P. and Williams, C.K.I., Using the equivalent kernel to understand Gaussian process regression. In Advances in Neural Information Processing Systems 17 , pages 1313-1320. MIT Press, 2005. · Zbl 1133.68410
[29] Song, H.R., Fuentes, M., and Ghosh, S., A comparative study of Gaussian geostatistical models and Gaussian Markov random field models. Journal of Multivariate Analysis , 99(8):1681-1697, 2008. · Zbl 1142.86309
[30] Stein, M.L., Interpolation of Spatial Data: Some Theory for Kriging . Springer, N.Y., 1999. · Zbl 0924.62100
[31] Stein, M.L., Chi, Z., and Welty, L.J., Approximating likelihoods for large spatial data sets. Journal of the Royal Statistical Society, Series B , 66(2):275-296, 2004. · Zbl 1062.62094
[32] Stein, M.L. and Fang, D., Discussion of ozone exposure and population density in Harris County, Texas, by R.J. Carroll et al. Journal of the American Statistical Association , 92:408-411, 1997.
[33] Wall, M.M., A close look at the spatial structure implied by the CAR and SAR models. Journal of Statistical Planning and Inference , 121(2):311-324, 2004. · Zbl 1036.62097
[34] White, G. and Ghosh, S.K., A stochastic neighborhood conditional autoregressive model for spatial data. Computational Statistics and Data Analysis , 53(8):3033-3046, 2009. · Zbl 1453.62242
[35] Wikle, C.K., Spatial modeling of count data: A case study in modelling breeding bird survey data on large spatial domains. In A.B. Lawson and D.G.T. Denison, editors, Spatial Cluster Modelling , pages 199-209. Chapman & Hall, 2002.
[36] Wolfinger, R. and O’Connell, M., Generalized linear mixed models: A pseudo-likelihood approach. Journal of Statistical Computation and Simulation , 48:233-243, 1993. · Zbl 0833.62067
[37] Yue, Y. and Speckman, P.L., Nonstationary spatial Gaussian Markov random fields. Journal of Computational and Graphical Statistics , 19:96-116, 2010.
[38] Zhu, J., Huang, H.C., and Reyes, P.E., On selection of spatial linear models for lattice data. Journal of the Royal Statistical Society, Series B , 72(3):389-402, 2010.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.