×

Mixture models for capture-recapture count data. (English) Zbl 1089.62126

Summary: The contribution investigates the problem of estimating the size of a population, also known as the missing cases problem. Suppose a registration system is targeting to identify all cases having a certain characteristic such as a specific disease (cancer, heart disease,…), disease related conditions (HIV, heroin use,…) or a specific behavior (driving a car without license). Every case in such a registration system has a certain notification history in that it might have been identified several times (at least once) which can be understood as a particular capture-recapture situation. Typically, cases are left out which have never been listed at any occasion, and it is this frequency one wants to estimate.
In this paper, modelling is concentrating on the counting distribution, e.g., the distribution of the variable that counts how often a given case has been identified by the registration system. Besides very simple models like the binomial or Poisson distribution, finite (nonparametric) mixtures of these are considered providing rather flexible modelling tools. Estimation is done using maximum likelihood by means of the EM algorithm. A case study on heroin users in Bangkok in the year 2001 is completing the contribution.

MSC:

62P10 Applications of statistics to biology and medical sciences; meta analysis
62D05 Sampling theory, sample surveys

Software:

REGAL
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Bishop, YMM; Fienberg, SE; Holland, PW, Discrete Multivariate Analysis: Theory and Practice (1975), Cambridge: MIT Press, Cambridge · Zbl 0332.62039
[2] Blumenthal, S.; Dahiya, R.; Gross, A., Estimating complete sample-size from an incomplete Poisson sample, Journal of the American Statistical Association, 73, 182-187 (1978) · Zbl 0386.62029 · doi:10.2307/2286542
[3] Böhning Suppawattanabodee, B.; Kusolvisitkul, W.; Viwatwongkasem, C., Estimating the number of drug users in Bangkok 2001: A capture-recapture approach using repeated entries in one list, European Journal of Epidemiology, 19, 1075-1083 (2004) · doi:10.1007/s10654-004-3006-8
[4] Böhning, D., Computer-assisted analysis of mixtures and applications. Meta-analysis, disease mapping and others (2000), Boca Raton: Chapman & Hall/CRC, Boca Raton · Zbl 0951.62088
[5] Chao, A., An overview of closed capture-recapture models, Journal of Agricultural, Biological, and Environmental Statistics, 6, 158-175 (2001) · doi:10.1198/108571101750524670
[6] Chao A (1998) Capture-recapture. In: Armitage P, Colton T (eds) Encyclopedia of biostatistics, vol. 1. Wiley, pp 482-486
[7] Chao, A., Estimating population size for sparse data in capture-recapture experiments, Biometrics, 45, 427-438 (1989) · Zbl 0715.62285 · doi:10.2307/2531487
[8] Chao, A.; Bunge, J., Estimating the number of species in a stochastic abundance model, Biometrics, 58, 531-539 (2002) · Zbl 1210.62225 · doi:10.1111/j.0006-341X.2002.00531.x
[9] Comiskey, CM; Barry, JM, A capture-recapture study of the prevalence and implications of opiate use in Dublin, European Journal of Public Health, 11, 198-200 (2001) · doi:10.1093/eurpub/11.2.198
[10] Cormack, RM, Interval estimation for mark-recapture studies of closed populations, Biometrics, 48, 567-576 (1992) · doi:10.2307/2532310
[11] Dorazio, RM; Royle, JA, Mixture models for estimating the size of a closed population when capture rates vary among individuals, Biometrics, 59, 351-364 (2003) · Zbl 1210.62226 · doi:10.1111/1541-0420.00042
[12] Dempster, AP; Laird, NM; Rubin, DB, Maximum likelihood estimation from incomplete data via the EM algorithm (with discussion), Journal of the Royal Statistical Society B, 39, 1-38 (1977) · Zbl 0364.62022
[13] Dietz, E.; Böhning, D., On estimation of the Poisson parameter in zero-modified Poisson models, Computational Statistics & Data Analysis, 34, 441-459 (2000) · Zbl 1046.62085 · doi:10.1016/S0167-9473(99)00111-5
[14] Hook, EB; Regal, R., Capture-recapture methods in epidemiology: methods and limitations, Epidemiologic Reviews, 17, 243-264 (1995)
[15] International Working Group for Disease Monitoring and Forecasting, Capture-recapture and multiple record systems estimation I: history and theoretical development, American Journal of Epidemiology, 142, 1047-1058 (1995)
[16] International Working Group for Disease Monitoring and Forecasting, Capture-recapture and multiple record systems estimation II: Applications in human diseases, American Journal of Epidemiology, 142, 1059-1068 (1995)
[17] LaPorte, RE; McCarty, DJ; Tull, ES; Tajima, N., Counting birds, bees, and NCDs, Lancet, 339, 494-495 (1992) · doi:10.1016/0140-6736(92)91103-F
[18] Laird, NM, Nonparametric maximum likelihood estimation of a mixing distribution, Journal of the American Statistical Association, 73, 805-811 (1978) · Zbl 0391.62029 · doi:10.2307/2286284
[19] Lindsay, BG; Roeder, K., A unified treatment of integer parameter models, Journal of the American Statistical Association, 82, 758-764 (1987) · Zbl 0633.62026 · doi:10.2307/2288784
[20] Mao, CX; Lindsay, BG, Tests and diagnostics for heterogeneity in the species problem, Computational Statistics and Data Analysis, 41, 389-398 (2003) · Zbl 1256.62066 · doi:10.1016/S0167-9473(02)00164-0
[21] McKendrick, AG, Application of mathematics to medical problems, Proceedings of the Edinburgh Mathematical Society, 44, 98-130 (1926) · JFM 52.0542.04 · doi:10.1017/S0013091500034428
[22] McLachlan, G.; Krishnan, T., The EM algorithm and extensions (1997), New York: Wiley, New York · Zbl 0882.62012
[23] McLachlan, G.; Peel, D., Finite mixture models (2000), New York: Wiley, New York · Zbl 0963.62061 · doi:10.1002/0471721182
[24] Meng, X-L, The EM algorithm and medical studies: a historical link, Statistical Methods in Medical Research, 6, 3-23 (1997) · doi:10.1191/096228097675663031
[25] Miloslavsky, M.; van der Laan, MJ, Fitting of mixtures with unspecified number of components using cross validation distance estimate, Computational Statistics and Data Analysis, 41, 413-428 (2003) · Zbl 1256.62015 · doi:10.1016/S0167-9473(02)00166-4
[26] Nannan DJ, White F (1997) Capture-recapture: Reconnaissance of a demographic technique in epidemiology. Health Canada 18(4)
[27] Norris, JL III; Pollock, KH, Non-parametric MLE for Poisson species abundance models allowing for heterogeneity between species, Environmental and Ecological Statistics, 5, 391-402 (1998) · doi:10.1023/A:1009659922745
[28] Norris, JL III; Pollock, KH, Nonparametric MLE under two closed capture-recapture models with heterogeneity, Biometrics, 52, 639-649 (1996) · Zbl 0875.62536 · doi:10.2307/2532902
[29] Pledger, S., Unified maximum likelihood estimates for closed capture-recapture models using mixtures, Biometrics, 56, 434-442 (2000) · Zbl 1060.62652 · doi:10.1111/j.0006-341X.2000.00434.x
[30] Sanathanan, L., Estimating the size of a multinomial population, Annals of Mathematical Statistics, 42, 58-69 (1972)
[31] Sanathanan, L., Estimating the size of a truncated sample, Journal of the American Statistical Association, 72, 669-672 (1977) · Zbl 0444.62037 · doi:10.2307/2286238
[32] Schouten, LJ; Straatmann, H.; Kiemeney, LA; Gimbrere, CH; Verbeek, AL, The capture-recapture method for estimation of cancer registry completeness: a useful tool?, International Journal of Epidemiology, 23, 1111-1116 (1994)
[33] Scollnik, D., Inference concerning the size of the zero class from an incomplete Poisson sample, Communication in Statistics-Theory and Methods, 26, 221-236a (1997) · Zbl 0900.62154
[34] Sekar, C.; Deming, WE, On a method of estimating birth and death rates and the extent of registration, JASA, 44, 101-115 (1949) · Zbl 0033.29402
[35] Tilling, K., Capture-recapture methods-useful or misleading?, International Journal of Epidemiology, 30, 12-14 (2001) · doi:10.1093/ije/30.1.12
[36] van der Heijden, PGM; Bustami, R.; Cruyff, M.; Engbersen, G.; van Houwelingen, HC, Point and interval estimation of the population size using the truncated Poisson regression model, Statistical Modelling-An International Journal, 3, 305-322 (2003) · Zbl 1111.62010 · doi:10.1191/1471082X03st057oa
[37] van der Heijden, PGM; Cruyff, M.; van Houwelingen, H. C., Estimating the size of a criminal population from police records using the truncated Poisson regression model, Statistica Neerlandica, 57, 1-16 (2003) · doi:10.1111/1467-9574.00232
[38] Wilson, RM; Collins, MF, Capture-recapture estimation with samples of size one using frequency data, Biometrika, 79, 543-553 (1992) · Zbl 0775.62315 · doi:10.2307/2336785
[39] Wittes, JT; Sidel, VW, A generalization of the simple capture-recapture model with applications to epidemiological research, Journal of Chronic Diseases, 21, 287-301 (1968) · doi:10.1016/0021-9681(68)90038-6
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.