×

Missing link survival analysis with applications to available pandemic data. (English) Zbl 07476341

Summary: It is shown how to overcome a new missing data problem in survival analysis. Iterative nonparametric techniques are utilized and the missing data information is both estimated and used for further estimation in each iterative step. Theory is developed and a good finite sample performance is illustrated by simulations. The main motivation is an application to French data on the temporal development of the number of hospitalized Covid-19 patients.

MSC:

62-XX Statistics
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Allassonniere, S.; Chevallier, J., A new class of stochastic EM algorithms. Escaping local maxima and handling intractable sampling, Comput. Stat. Data Anal., 159, Article 107159 pp. (2021) · Zbl 1510.62014
[2] Andersen, P.; Borgan, O.; Gill, R.; Keiding, N., Statistical Models Based on Counting Processes (1993), Springer: Springer New York · Zbl 0769.62061
[3] Bingham, N. H.; Pitts, S. M., Nonparametric estimation for the M/G/∞ queue, Ann. Inst. Stat. Math., 51, 71-97 (1999) · Zbl 0951.62024
[4] Blanche, P.; Proust-Lima, C.; Loubere, L.; Berr, C.; Dartigues, J. F.; Jacqmin-Gadda, H., Quantifying and comparing dynamic predictive accuracy of joint models for longitudinal marker and time-to-event in presence of censoring and competing risks, Biometrics, 71, 102-113 (2015) · Zbl 1419.62313
[5] Brookmeyer, R., AIDS, epidemics, and statistics, Biometrics, 52, 781-796 (1996) · Zbl 0875.62542
[6] Brookmeyer, R.; Gail, M. H., A method for obtaining short-term projections and lower bounds on the size of the AIDS epidemic, J. Am. Stat. Assoc., 83, 301-308 (1988) · Zbl 0644.62108
[7] Deimling, K., Nonlinear Functional Analysis (1985), Springer: Springer New York · Zbl 0559.47040
[8] Farebrother, R. W., Estimation with aggregated data, J. Econom., 10, 43-55 (1979) · Zbl 0405.62049
[9] Ferrer, L.; Putter, H.; Proust-Lima, C., Individual dynamic predictions using landmarking and joint modelling: validation of estimators and robustness assessment, Stat. Methods Med. Res., 28, 3649-3666 (2019)
[10] Gámiz, M. L.; Kulasekera, K. B.; Limnios, N.; Lindqvist, B. H., Applied Nonparametric Statistics in Reliability, Springer Series in Reliability Engineering. (2011), Springer-Verlag: Springer-Verlag London · Zbl 1292.62006
[11] Gámiz, M. L.; Mammen, E.; Martínez-Miranda, M. D.; Nielsen, J. P., Double one-sided cross-validation of local linear hazards, J. R. Stat. Soc. B, 78, 755-779 (2016) · Zbl 1414.62329
[12] Gámiz, M.L., Mammen, E., Martínez-Miranda, M.D., Nielsen, J.P., 2017. DOvalidation: Kernel Hazard Estimation with Best One-Sided and Double One-Sided Cross-Validation. R package version 1.1.0.
[13] Goldenshluger, A., Nonparametric estimation of service time distribution in the M/G/∞ queue and related estimation problems, Adv. Appl. Probab., 48, 1117-1138 (2016) · Zbl 1356.62125
[14] Goldenshluger, A.; Koops, D. T., Nonparametric estimation of service time characteristics in infinite-server queues with stationary Poisson input (2018), Preprint
[15] Heckman, J. J., Sample selection bias as a specification error, Econometrica, 47, 153-161 (1979) · Zbl 0392.62093
[16] Horwitz, L. I.; Jones, S. A.; Cerfolio, R. J.; Francois, F.; Greco, J.; Rudy, B.; Petrilli, C. A., Trends in Covid-19 risk-adjusted mortality rates in a single health system, J. Hosp. Med., 16, 2, 90-92 (2021)
[17] Isaric, International Severe Acute Respiratory and Emerging Infections Consortium, COVID-19 Report: 04 October 2020 (2020)
[18] Jewell, N. P., Some statistical issues in studies of the epidemiology of aids, Stat. Med., 9, 1387-1416 (1990)
[19] Jewell, N. P., Statistics for Epidemiology (2004), Chapman&Hall/CRC
[20] King, G., A Solution to the Ecological Inference Problem. Reconstructing Individual Behavior from Aggregate Data (1997), Princeton University Press
[21] Lawless, J. F.; McLeish, D. L., The information in aggregate data from Markov chains, Biometrika, 71, 419-430 (1984) · Zbl 0567.62072
[22] Liu, Y.; Hu, F., Balancing unobserved covariates with covariate-adaptive randomized experiments, J. Am. Stat. Assoc. (2020)
[23] Mammen, E.; Martínez-Miranda, M. D.; Nielsen, J. P.; Sperlich, S., Do-validation for kernel density estimation, J. Am. Stat. Assoc., 106, 651-660 (2011) · Zbl 05965223
[24] Martinussen, T.; Scheike, T. H., Dynamic Regression Models for Survival Data, Statistics for Biology and Health (2006), Springer: Springer New York · Zbl 1096.62119
[25] Nielsen, J. P.; Tanggaard, C., Boundary and bias correction in kernel hazard estimation, Scand. J. Stat., 28, 675-698 (2001) · Zbl 1010.62093
[26] Pickands, J.; Stine, R. A., Estimation for an M/G/∞ queue with incomplete information, Biometrika, 84, 295-308 (1997) · Zbl 0883.62092
[27] Proust-Lima, C.; Dartigues, J. F.; Jacqmin-Gadda, H., Joint modeling of repeated multivariate cognitive measures and competing risks of dementia and death: a latent process and latent class approach, Stat. Med., 35, 382-398 (2016)
[28] Proust-Lima, C.; Sene, M.; Taylor, M. G.; Jacqmin-Gadda, H., Joint latent class models for longitudinal and time-to-event data: a review, Stat. Methods Med. Res., 23, 70-90 (2014)
[29] Rubin, D. B., Multiple imputation after 18+ years, J. Am. Stat. Assoc., 91, 473-489 (1996) · Zbl 0869.62014
[30] Zhao, J.; Kim, H-J.; Kim, H-M., New EM-type algorithms for the Heckman selection model, Comput. Stat. Data Anal., 146, Article 106930 pp. (2020) · Zbl 1510.62088
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.