zbMATH — the first resource for mathematics

Estimating the number of classes. (English) Zbl 1117.62045
Summary: Estimating the unknown number of classes in a population has numerous important applications. In a Poisson mixture model, the problem is reduced to estimating the odds that a class is undetected in a sample. The discontinuity of the odds prevents the existence of locally unbiased and informative estimators and restricts confidence intervals to be one-sided. Confidence intervals for the number of classes are also necessarily one-sided. A sequence of lower bounds to the odds is developed and used to define pseudo maximum likelihood estimators for the number of classes.

62G15 Nonparametric tolerance and confidence regions
62G05 Nonparametric estimation
Full Text: DOI arXiv
[1] Bahadur, R. R. and Savage, L. J. (1956). The nonexistence of certain statistical procedures in nonparametric problems. Ann. Math. Statist. 27 1115–1122. · Zbl 0073.14302
[2] Blumenthal, S., Dahiya, R. C. and Gross, A. J. (1978). Estimating the complete sample size from an incomplete Poisson sample. J. Amer. Statist. Assoc. 73 182–187. JSTOR: · Zbl 0386.62029
[3] Bunge, J. and Fitzpatrick, M. (1993). Estimating the number of species: A review. J. Amer. Statist. Assoc. 88 364–373.
[4] Chao, A. (1984). Nonparametric estimation of the number of classes in a population. Scand. J. Statist. 11 265–270.
[5] Chao, A. (2001). An overview of closed capture–recapture models. J. Agric. Biol. Environ. Stat. 6 158–175.
[6] Chao, A. and Bunge, J. (2002). Estimating the number of species in a stochastic abundance model. Biometrics 58 531–539. JSTOR: · Zbl 1210.62225
[7] Chao, A. and Lee, S.-M. (1992). Estimating the number of classes via sample coverage. J. Amer. Statist. Assoc. 87 210–217. JSTOR: · Zbl 0850.62145
[8] Curto, R. E. and Fialkow, L. A. (1991). Recursiveness, positivity, and truncated moment problems. Houston J. Math. 17 603–635. · Zbl 0757.44006
[9] Darroch, J. N. and Ratcliff, D. (1980). A note on capture–recapture estimation. Biometrics 36 149–153. JSTOR: · Zbl 0437.62100
[10] Donoho, D. L. (1988). One-sided inference about functionals of a density. Ann. Statist. 16 1390–1420. · Zbl 0665.62040
[11] Gong, G. and Samaniego, F. J. (1981). Pseudomaximum likelihood estimation: Theory and applications. Ann. Statist. 9 861–869. · Zbl 0471.62032
[12] Lindsay, B. G. (1983). The geometry of mixture likelihoods: A general theory. Ann. Statist. 11 86–94. · Zbl 0512.62005
[13] Liu, R. C. and Brown, L. D. (1993). Nonexistence of informative unbiased estimators in singular problems. Ann. Statist. 21 1–13. · Zbl 0783.62026
[14] Mao, C. X. (2004). Prediction the conditional probability of discovering a new class. J. Amer. Statist. Assoc. 99 1108–1118. · Zbl 1055.62007
[15] Mao, C. X. and Lindsay, B. G. (2003). Tests and diagnostics for heterogeneity in the species problem. Comput. Statist. Data Anal. 41 389–398. · Zbl 1256.62066
[16] Mao, C. X. and Lindsay, B. G. (2004). Estimating the number of classes in multiple populations: A geometric analysis. Canad. J. Statist. 32 303–314. JSTOR: · Zbl 1061.62054
[17] Pfanzagl, J. (1998). The nonexistence of confidence sets for discontinuous functionals. J. Statist. Plann. Inference 75 9–20. · Zbl 1041.62508
[18] Romanowska, M. (1977). A note on the upper bound for the distance in total variation between the binomial and the Poisson distribution. Statist. Neerlandica 31 127–130. · Zbl 0369.60025
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.