zbMATH — the first resource for mathematics

Oracle inequality for conditional density estimation and an actuarial example. (English) Zbl 1440.62118
Summary: Conditional density estimation in a parametric regression setting, where the problem is to estimate a parametric density of the response given the predictor, is a classical and prominent topic in regression analysis. This article explores this problem in a nonparametric setting where no assumption about shape of an underlying conditional density is made. For the first time in the literature, it is proved that there exists a nonparametric data-driven estimator that matches performance of an oracle which: (i) knows the underlying conditional density, (ii) adapts to an unknown design of predictors, (iii) performs a dimension reduction if the response does not depend on the predictor, (iv) is minimax over a vast set of anisotropic bivariate function classes. All these results are established via an oracle inequality which is on par with ones known in the univariate density estimation literature. Further, the asymptotically optimal estimator is tested on an interesting actuarial example which explores a relationship between credit scoring and premium for basic auto-insurance for 54 undergraduate college students.

62G07 Density estimation
62G08 Nonparametric regression and quantile regression
62G20 Asymptotic properties of nonparametric inference
62P05 Applications of statistics to actuarial sciences and financial mathematics
Full Text: DOI
[1] Abramovich F., Sapatinis T. (1999) Bayesian approach to wavelet decomposition and shrinkage. Springer, New York
[2] Arnold B.C., Castillo E., Sarabia J.M. (1999) Conditional specification of statistical models. Springer, New York · Zbl 0932.62001
[3] Efromovich S. (1985) Nonparametric estimation of a density with unknown smoothness. Theory of Probability and its Applications 30: 557–568 · Zbl 0593.62034
[4] Efromovich S. (1999) Nonparametric curve estimation: methods, theory and applications. Springer, New York · Zbl 0935.62039
[5] Efromovich S. (2000) On sharp adaptive estimation of multivariate curves. Mathematical Methods of Statistics 9: 117–139 · Zbl 1006.62033
[6] Efromovich S. (2005) Estimation of the density of regression errors. The Annals of Statistics 33: 2194–2227 · Zbl 1086.62053
[7] Efromovich S. (2007) Conditional density estimation in a regression setting. The Annals of Statistics 33: 2504–2535 · Zbl 1129.62025
[8] Eubank R.L. (1999) Nonparametric regression and spline smoothing. Marcel Dekker, New York · Zbl 0936.62044
[9] Fan J. (1992) Design-adaptive nonparametric regression. The Journal of American Statistical Association 87: 998–1004 · Zbl 0850.62354
[10] Fan J., Gijbels I. (1996) Local polynomial modeling and its applications. Chapman and Hall, New York · Zbl 0873.62037
[11] Koul H., Sakhanenko L. (2005) Goodness-of-fit testing in regression: A finite sample comparison of bootstrap methodology and Khmaladze transformation. Statistics and Probability Letters 74: 290–302 · Zbl 1070.62030
[12] Neter J., Kutner M., Nachtsheim C., Wasserman W. (1996) Applied linear models, 4th ed. McGraw-Hill, Boston
[13] Prakasa Rao B.L.S. (1983) Nonparametric functional estimation. Academic Press, New York · Zbl 0542.62025
[14] Samarov A.M. (1992) Lower bound for the integral risk of density function estimates. Topics in Nonparametric Estimation 8: 1–7 · Zbl 0773.62031
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.