×

zbMATH — the first resource for mathematics

Robust Wald-type methods for testing equality between two populations regression parameters: a comparative study under the logistic model. (English) Zbl 07135542
Summary: Comparing the regression parameters between two populations is useful to understand the homogeneity of the process underlying the data. The problem of logistic regression is considered when the practitioner handles data from two populations and when the goal is to test the hypothesis that some regression parameters are equal in both populations. A classical testing procedure is to construct a Wald-type test from the maximum likelihood estimators obtained from each data set. However, as in the one-population setting, the presence of outliers in any of the two samples may distort both the level and/or the power of this procedure. Instead of the maximum likelihood procedure, reliable statistics are built using a class of robust estimators which bound large values of the deviance as well as the effect of high leverage points. The asymptotic behaviour of this family of test statistics is derived under the null and contiguous alternatives. Besides, the robustness of the tests is investigated through the influence function. A simulation study allows to compare, under different contamination schemes, the behaviour of the tests based on the maximum likelihood estimators and on their robust counterparts. The numerical study shows that the Wald tests based on the maximum likelihood estimators or on the unweighted robust ones break down when atypical data arise in the samples, while both the level and power of the Wald-type tests based on redescending weighted \(M\)-estimators are stable against the considered contaminations. The analysis of a real data set enables to investigate the \(p\)-value sensitivity to the presence of outliers.
MSC:
62 Statistics
PDF BibTeX XML Cite
Full Text: DOI
References:
[1] Basu, A.; Gosh, A.; Mandal, A.; Martin, N.; Pardo, L., A wald-type test statistic for testing linear hypothesis in logistic regression models based on minimum density power divergence estimator, Electron. J. Statist., 11, 2741-2772 (2017) · Zbl 1366.62052
[2] Basu, A.; Harris, I. R.; Hjort, N. L.; Jones, M. C., Robust and efficient estimation by minimizing a density power divergence, Biometrika, 85, 549-559 (1998) · Zbl 0926.62021
[3] Bianco, A.; Boente, G.; Rodrigues, I. M., Robust test in generalized linear models with missing responses, Comput. Statist. Data Anal., 65, 80-97 (2013) · Zbl 06958966
[4] Bianco, A.; Martínez, E., Robust testing in the logistic regression model, Comput. Statist. Data Anal., 53, 4095-4105 (2009) · Zbl 1454.62016
[5] Bianco, A.; Yohai, V., (Robust Estimation in the Logistic Regression Model. Robust Estimation in the Logistic Regression Model, Lecture Notes in Statistics, vol. 109 (1996), Springer-Verlag: Springer-Verlag New York), 17-34 · Zbl 0839.62030
[6] Bondell, H. D., Minimum distance estimation for the logistic regression model, Biometrika, 92, 724-731 (2005) · Zbl 1152.62324
[7] Bondell, H. D., A characteristic function approach to the biased sampling model, with application to robust logistic regression, J. Statist. Plann. Inference, 138, 742-755 (2008) · Zbl 1133.62020
[8] Cantoni, E.; Ronchetti, E., Robust inference for generalized linear models, J. Amer. Statist. Assoc., 96, 1022-1030 (2001) · Zbl 1072.62610
[9] Carroll, R. J.; Pederson, S., On robustness in the logistic regression model, J. Roy. Statist. Soc., Ser. B, 55, 693-706 (1993) · Zbl 0794.62021
[10] Croux, C.; Haesbroeck, G., Implementing the bianco and yohai estimator for logistic regression, Comput. Statist. Data Anal., 44, 273-295 (2003) · Zbl 1429.62317
[11] Gervini, D., Robust adaptive estimators for binary regression models, J. Statist. Plann. Inference, 131, 297-311 (2005) · Zbl 1061.62036
[12] Hampel, F. R.; Ronchetti, E. M.; Rousseeuw, P. J.; Stahel, W. A., Robust Statistics: The Approach Based on Influence Functions (1986), Wiley: Wiley New York · Zbl 0593.62027
[13] Hauck, W. W.; Donner, A., Wald’s test as applied to hypotheses in logit analysis, J. Amer. Statist. Assoc., 72, 851-853 (1977) · Zbl 0375.62022
[14] Heritier, S.; Cantoni, E.; Copt, S.; Victoria-Feser, M. P., Robust Methods in Biostatistics (2009), Wiley: Wiley New York · Zbl 1163.62085
[15] Hobza, T.; Pardo, L.; Vajda, I., Robust median estimator in logistic regression, J. Statist. Plann. Inference, 138, 3822-3840 (2008) · Zbl 1146.62015
[16] Künsch, H.; Stefanski, L.; Carroll, R., Conditionally unbiased bounded influence estimation in general regression models with applications to generalized linear models, J. Amer. Statist. Assoc., 84, 460-466 (1989) · Zbl 0679.62024
[17] Maronna, R.; Martin, R.; Yohai, V.; Salibián-Barrera, M., Robust Statistics: Theory and Methods (with R) (2019), Wiley: Wiley New York · Zbl 1409.62009
[18] Pires, A. M.; Branco, J., Partial influence functions, J. Multivariate Anal., 83, 451-468 (2002) · Zbl 1030.62024
[19] Rousseeuw, P.; Christmann, A., Measuring overlap in logistic regression, Comput. Statist. Data Anal., 37, 65-75 (2001) · Zbl 1051.62065
[20] Rousseeuw, P.; Christmann, A., Robustness against separation and outliers in logistic regression, Comput. Statist. Data Anal., 43, 315-332 (2003) · Zbl 1429.62325
[21] Wilcox, R., Introduction to Robust Estimation and Hypothesis Testing (2017), Academic Press · Zbl 1362.62003
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.