×

Sampling a two dimensional matrix. (English) Zbl 1510.62099

Summary: A new sampling design for populations whose units can be arranged as an \(N\times M\) matrix is proposed. The sample must satisfy some constraints: row and column sample sizes are set in advance. The proposed sampling method gives the same selection probability to all the sample matrices that satisfy the constraints. Three algorithms to select a sample uniformly in the feasible set are presented: an exact algorithm based on the multivariate hypergeometric distribution, an MCMC algorithm, and the cube method. Their performances are evaluated using Monte Carlo simulations. The designs for sampling elements in a given row or a given column are investigated and the single inclusion and joint selection probabilities under the proposed design are evaluated. Several variance estimators are proposed for the Horvitz-Thompson estimator of the population mean of the survey variable \(y\) and their performances are compared in a Monte Carlo study. A numerical example dealing with a creel survey of fishermen found at 9 sites over 36 days is presented.

MSC:

62D05 Sampling theory, sample surveys
62-08 Computational methods for problems pertaining to statistics
PDFBibTeX XMLCite
Full Text: DOI Link

References:

[1] Agresti, A.; Wackerly, D.; Boyett, J. M., Exact conditional tests for cross-classifications: Approximation of attained significance levels, Psychometrika, 44, 75-83 (1979) · Zbl 0419.62046
[2] Besag, J.; Clifford, P., Generalized monte carlo significance tests, Biometrika, 76, 4, 633-642 (1989) · Zbl 0679.62033
[3] Breidt, F. J.; Chauvet, G., Improved variance estimation for balanced samples drawn via the cube method, J. Statist. Plann. Inference, 141, 1, 479-487 (2011) · Zbl 1203.62010
[4] Daigle, G.; Crépeau, H.; Bujold, V.; Legault, M., Enquête de la Pêche Sportive au bar Rayé en Gaspésie en 2015Tech. rep. (2015), Service de consultation statistique de l’Université Laval, Ministère du Développement durable, de l’Environnement, de la Faune et des Parcs du Québec: Service de consultation statistique de l’Université Laval, Ministère du Développement durable, de l’Environnement, de la Faune et des Parcs du Québec Québec
[5] Deville, J.-C.; Tillé, Y., Efficient balanced sampling: the cube method, Biometrika, 91, 4, 893-912 (2004) · Zbl 1064.62015
[6] Deville, J.-C.; Tillé, Y., Variance approximation under balanced sampling, J. Statist. Plann. Inference, 128, 2, 569-591 (2005) · Zbl 1089.62005
[7] Diaconis, P.; Sturmfels, B., Algebraic algorithms for sampling from conditional distributions, Ann. Stat., 26, 1, 363-397 (1998) · Zbl 0952.62088
[8] Gotelli, N. J.; Graves, G. R., Null Models in Ecology (1996), Smithsonian Institution Press, University of Michigan USA
[9] Gotelli, N. J.; Ulrich, W., Statistical challenges in null model analysis, Oikos, 121, 171-180 (2012)
[10] Grafström, A.; Lisic, J., BalancedSampling: Balanced and spatially balanced sampling (2018), R package version 1.5.4. URL http://www.antongrafstrom.se/balancedsampling
[11] Ida, I. O.; Rivest, L.-P.; Daigle, G., Using balanced sampling in creel surveys, Surv. Methodol., 44, 239-252 (2018)
[12] John, P. W., Statistical Design and Analysis of Experiments (1998), SIAM
[13] Juillard, H.; Chauvet, G.; Ruiz-Gazen, A., Estimation under cross-classified sampling with application to a childhood survey, J. Amer. Statist. Assoc., 112, 850-858 (2017)
[14] Miklós, I.; Podani, J., Randomization of presence-absence matrices: comments and new algorithms, Ecology, 85, 1, 86-92 (2004)
[15] Ohlsson, E., Cross-classified sampling, J. Off. Stat., 12, 241-251 (1996)
[16] Oksanen, J.; Blanchet, F. G.; Kindt, R.; Legendre, P.; Minchin, P. R.; O’hara, R.; Simpson, G. L.; Solymos, P.; Stevens, M. H.H.; Wagner, H., Package vegan (2013), Community ecology package, version 2 (9)
[17] Skinner, C. J., Cross-classified sampling: some estimation theory, Statist. Probab. Lett., 104, 163-168 (2015) · Zbl 1396.62014
[18] Strona, G.; Nappo, D.; Boccacci, F.; Fattorini, S.; San-Miguel-Ayanz, J., A fast and unbiased procedure to randomize ecological binary matrices with fixed row and column totals, Nat. Commun., 5, 4114 (2014)
[19] Tillé, Y., Sampling Algorithms (2006), Springer Science & Business Media: Springer Science & Business Media New York · Zbl 1099.62009
[20] Tillé, Y.; Matei, A., Sampling: Survey sampling (2015), R package version 2.7. URL http://CRAN.R-project.org/package=sampling
[21] von Gagern, M.; von Gagern, M.; Schmitz Ornès, A., Problems with bins: A critical reassessment of gotelli and ulrich’s bayes approach using bird data, Acta Oecol., 69, 137-145 (2015)
[22] Vos, J. W.E., Sampling in space and time, Rev. Int. Stat. Inst., 32, 226-241 (1964) · Zbl 0173.46903
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.