zbMATH — the first resource for mathematics

Improved Stein-type shrinkage estimators for the high-dimensional multivariate normal covariance matrix. (English) Zbl 1328.62336
Summary: Many applications require an estimate for the covariance matrix that is non-singular and well-conditioned. As the dimensionality increases, the sample covariance matrix becomes ill-conditioned or even singular. A common approach to estimating the covariance matrix when the dimensionality is large is that of Stein-type shrinkage estimation. A convex combination of the sample covariance matrix and a well-conditioned target matrix is used to estimate the covariance matrix. Recent work in the literature has shown that an optimal combination exists under mean-squared loss, however it must be estimated from the data. In this paper, we introduce a new set of estimators for the optimal convex combination for three commonly used target matrices. A simulation study shows an improvement over those in the literature in cases of extreme high-dimensionality of the data. A data analysis shows the estimators are effective in a discriminant and classification analysis.

62H12 Estimation in multivariate analysis
62J07 Ridge regression; shrinkage estimators (Lasso)
Full Text: DOI
[1] Alon, U.; Barkai, N.; Notterman, D.A.; Gish, K.; Ybarra, S.; Mack, D.; Levine, A.J., Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays, Proc. natl. acad. sci. USA, 96, 12, 6745-6750, (1999)
[2] Bai, Z.D.; Yin, Y.Q., Limit of the smallest eigenvalue of a large-dimensional sample covariance matrix, Ann. probab., 21, 3, 1275-1294, (1993) · Zbl 0779.60026
[3] Bickel, P.J.; Levina, E., Regularized estimation of large covariance matrices, Ann. statist., 36, 1, 199-227, (2008) · Zbl 1132.62040
[4] Chen, Y., Wiesel, A., Hero, A.O., 2009. Shrinkage estimation of high dimensional covariance matrices. In: Acoustics, Speech, and Signal Processing, IEEE International Conference on 0, pp. 2937-2940.
[5] Dempster, A.P., Elements of continuous multivariate analysis, (1969), Addison-Wesley Reading, MA · Zbl 0197.44904
[6] Dettling, M.; Bühlmann, P., Boosting for tumor classification with gene expression data, Bioinformatics, 19, 9, 1061-1069, (2003)
[7] Dudoit, S.; Fridlyand, J.; Speed, T.P., Comparison of discrimination methods for the classification of tumors using gene expression data, J. amer. statist. assoc., 97, 457, 77-87, (2002) · Zbl 1073.62576
[8] Efron, B., Biased versus unbiased estimation, Advances in math., 16, 259-277, (1975) · Zbl 0306.62010
[9] Efron, B.; Morris, C., Data analysis using stein’s estimator and its generalizations, J. amer. statist. assoc., 70, 350, 311-319, (1975) · Zbl 0319.62018
[10] Efron, B.; Morris, C., Stein’s paradox in statistics, Sci. am., 236, 5, 119-127, (1977)
[11] Fan, J.; Fan, Y.; Lv, J., High dimensional covariance matrix estimation using a factor model, J. econometrics, 147, 1, 186-197, (2008) · Zbl 1429.62185
[12] Fisher, T.J., 2009. On the Testing and Estimation of High-Dimensional Covariance Matrices. Ph.D. dissertation, Clemson University.
[13] Golub, T.; Slonim, D.; Tamayo, P.; Huard, C.; Gaasenbeek, M.; Mesirov, J.; Coller, H.; Loh, M.; Downing, J.; Caligiuri, M.; Bloomfield, C.; Lander, E., Molecular classification of cancer: class discovery and class prediction by gene expression monitoring, Science, 286, 5439, 531-537, (1999)
[14] Huang, J.Z.; Liu, N.; Pourahmadi, M.; Liu, L., Covariance matrix selection and estimation via penalised normal likelihood, Biometrika, 93, 1, 85-98, (2006) · Zbl 1152.62346
[15] Ledoit, O.; Wolf, M., Improved estimation of the covariance matrix of stock returns with an application to portfolio selection, J. empir. finance, 10, 5, 603-621, (2003)
[16] Ledoit, O.; Wolf, M., Honey, i shrunk the sample covariance matrix, J. portfolio manage., 31, 1, 110-119, (2004)
[17] Ledoit, O.; Wolf, M., A well-conditioned estimator for large-dimensional covariance matrices, J. multivariate anal., 88, 2, 365-411, (2004) · Zbl 1032.62050
[18] Pourahmadi, M., Cholesky decompositions and estimation of a covariance matrix: orthogonality of variance-correlation parameters, Biometrika, 94, 4, 1006-1013, (2007) · Zbl 1156.62043
[19] Pourahmadi, M.; Daniels, M.J.; Park, T., Simultaneous modelling of the Cholesky decomposition of several covariance matrices, J. multivariate anal., 98, 3, 568-587, (2007) · Zbl 1107.62043
[20] Rothman, A.J.; Levina, E.; Zhu, J., Generalized thresholding of large covariance matrices, J. amer. statist. assoc., 104, 485, 177-186, (2009) · Zbl 1388.62170
[21] Schäfer, J.; Strimmer, K., A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics, Stat. appl. genet. mol. biol., 4, 28 pp., (2005), Art. 32
[22] Srivastava, M.S., Some tests concerning the covariance matrix in high dimensional data, J. Japan statist. soc., 35, 2, 251-272, (2005)
[23] Srivastava, M.S.; Kubokawa, T., Comparison of discrimination methods for high dimensional data, J. Japan statist. soc., 37, 1, 123-134, (2007) · Zbl 1138.62361
[24] Stein, C., Inadmissibility of the usual estimator for the Mean of a multivariate normal distribution, (), 197-206
[25] Stein, C., 1975. Estimation of a covariance matrix. In: Rietz Lecture, 39th Annual Meeting IMS, Atlanta, GA.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.