×

Bayesian predictive inference of a finite population proportion under selection bias. (English) Zbl 1365.62097

Summary: We show how to infer about a finite population proportion using data from a possibly biased sample. In the absence of any selection bias or survey weights, a simple ignorable selection model, which assumes that the binary responses are independent and identically distributed Bernoulli random variables, is not unreasonable. However, this ignorable selection model is inappropriate when there is a selection bias in the sample. We assume that the survey weights (or their reciprocals which we call ‘selection’ probabilities) are available, but there is no simple relation between the binary responses and the selection probabilities. To capture the selection bias, we assume that there is some correlation between the binary responses and the selection probabilities (e.g., there may be a somewhat higher/lower proportion of positive responses among the sampled units than among the nonsampled units). We use a Bayesian nonignorable selection model to accommodate the selection mechanism. We use Markov chain Monte Carlo methods to fit the nonignorable selection model. We illustrate our method using numerical examples obtained from NHIS 1995 data.

MSC:

62F15 Bayesian inference
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Chambers, R.; Dorfman, A.; Wang, S., Limited information likelihood analysis of survey data, Journal of the Royal Statistical Society. Series B (Statistical Methodology), 60, 397-411 (1998) · Zbl 0918.62006
[2] Chen, Q.; Elliott, M. R.; Little, R. J.A., Bayesian penalized spline model-based inference for finite population proportion in unequal probability sampling, Survey Methodology, 36, 23-24 (2010)
[3] Chen, M.-H.; Shao, Q.-M., Monte Carlo estimation of Bayesian credible and HPD intervals, Journal of Computational and Graphical Statistics, 8, 69-92 (1999)
[4] Gelfand, A. E.; Smith, A. F.M., Sampling-based approaches to calculating marginal densities, Journal of the American Statistical Association, 85, 398-409 (1990) · Zbl 0702.62020
[5] Gelman, A., Prior distribution for variance parameters in hierarchical models, Bayesian Analysis, 1, 515-533 (2006) · Zbl 1331.62139
[6] Gilks, W.; Wild, P., Adaptive rejection sampling for Gibbs sampling, Applied Statistics, 41, 2, 337-338 (1992) · Zbl 0825.62407
[7] Krieger, A.; Pfeffermann, D., Maximum likelihood estimation from complex sample surveys, Survey Methodology, 18, 225-239 (1992)
[8] Malec, D.; Davis, W. W.; Cao, X., Model-based small area estimates of overweight prevalence using sample selection adjustment, Statistics in Medicine, 18, 3189-3200 (1999)
[9] Nandram, B., Bayesian predictive inference under informative sampling via surrogate samples, (Upadhyay, S. K.; Singh, Umesh; Dey, Dipak K., Bayesian Statistics and Its Applications (2007), Anamaya: Anamaya New Delhi), 356-374, (Chapter 25)
[10] Nandram, B.; Bai, Y.; Choi, J. W., Hierarchical Bayesian models for assessing possible changes in prevalence of activity limitation, Advances and Applications in Statistical Sciences, 6, 5, 285-311 (2011)
[11] Nandram, B.; Chen, M.-H., Reparameterizing the generalized linear model to accelerate Gibbs sampler convergence, Journal of Statistical Computation and Simulation, 54, 129-144 (1996) · Zbl 0925.62309
[12] Nandram, B.; Choi, J. W., A Bayesian analysis of body mass index data from small domains under nonignorable nonresponse and selection, Journal of the American Statistical Association, 105, 120-135 (2010) · Zbl 1397.62463
[13] Nandram, B.; Choi, J. W.; Shen, G.; Burgos, C., Bayesian predictive inference under informative sampling and transformation, Applied Stochastic Models in Business and Industry, 22, 559-572 (2006) · Zbl 1164.62323
[14] Natarajan, R.; Kass, R. E., Reference Bayesian methods for generalized linear mixed models, Journal of the American Statistical Association, 95, 227-237 (2000) · Zbl 0996.62021
[15] Opsomer, J. D.; Glaeskens, G.; Ranalli, M. G.; Kauermann, G.; Breidt, F. J., Non-parametric small area estimation using penalized spline regression, Journal of the Royal Statistical Society. Series B (Statistical Methodology), 70, 265-286 (2008) · Zbl 05563354
[16] Patil, G. P.; Rao, C. R., Weighted distributions and size-biased sampling with applications to wildlife populations and human families, Biometrics, 34, 179-189 (1978) · Zbl 0384.62014
[17] Pfeffermann, D.; Krieger, A. M.; Rinott, Y., Parametric distributions of complex survey data under informative probability sampling, Statistica Sinica, 8, 1087-1114 (1998) · Zbl 0923.62019
[18] Pfeffermann, D.; Skinner, C. J.; Holmes, D. J.; Goldstein, H.; Rasbash, J., Weighting for unequal selection probabilities in multilevel models, Journal of the Royal Statistical Society. Series B (Statistical Methodology), 60, 23-40 (1998), (with discussion) · Zbl 0909.62006
[19] Pfeffermann, D.; Sverchkov, M., Parametric and semi-parametric estimation of regression models fitted to survey data, Sankhyā Series B, 61, 1-21 (1999) · Zbl 0985.62013
[20] Pfeffermann, D.; Sverchkov, M., Small-area estimation under informative probability sampling of areas and within selected areas, Journal of the American Statistical Association, 102, 1427-1439 (2007) · Zbl 1333.62023
[21] Sverchkov, M.; Pfeffermann, D., Prediction of finite population totals based on the sample distribution, Survey Methodology, 30, 79-92 (2004)
[22] Torabi, M.; Rao, J. N.K., Mean squared error estimators of small area means using survey weights, Canadian Journal of Statistics, 38, 598-608 (2010) · Zbl 1349.62032
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.