## A structural mixed model to shrink covariance matrices for time-course differential gene expression studies.(English)Zbl 1453.62151

Summary: Time-course microarray studies require a particular modelling of covariance matrices when measures are repeated on the same individuals. Taking into account the within-subject correlation in the test statistics for differential gene expression, however, requires a large number of parameters when a gene-specific approach is used, which often results in a lack of power due to the small number of individuals usually considered in microarray experiments. Shrinkage approaches can improve this detection power in differential gene expression studies by reducing the number of parameters, while offering a good flexibility and a small rate of false positives. A natural extension of the shrinkage approach based on a structural mixed model to variance-covariance matrices is proposed. The structural model was used in three configurations to shrink (i) the eigenvalues in an eigenvalue/eigenvector decomposition, (ii) the innovation variances in a Cholesky decomposition, (iii) both the variances and correlation parameters of a gene-by-gene covariance matrix using a Fisher transformation. The proposed methods were applied both to a publicly available data set and to simulated data. They were found to perform well, compared to previously proposed empirical Bayesian approaches, and outperformed the gene-specific or common-covariance methods in many cases.

### MSC:

 62-08 Computational methods for problems pertaining to statistics 62P10 Applications of statistics to biology and medical sciences; meta analysis 62H12 Estimation in multivariate analysis 92D20 Protein sequences, DNA sequences

### Software:

corpor; VarMixt; WinBUGS; maSigPro
Full Text:

### References:

 [1] Angelini, C.; De Canditiis, D.; Mutarelli, M.; Pensky, M., A Bayesian approach to estimation and testing in time-course microarray experiments, Statistical applications in genetics and molecular biology, 6, 1, 24, (2007) · Zbl 1166.62327 [2] Baldi, P.; Long, A., A Bayesian framework for the analysis of microarray expression data: regularized $$t$$-test and statistical inferences of gene changes, Bioinformatics, 17, 509-519, (2001) [3] Conesa, A.; Nueda, M.J.; Talon, M., Masigpro: A method to identify significantly differential expression profiles in time-course microarray experiments, Bioinformatics, 22, 9, 1096-1102, (2006) [4] Cui, X.; Gene Hwang, J.; Qiu, J.; Blades, N.; Churchill, G., Improved statistical tests for differential gene expression by shrinking variance components estimates, Biostatistics, 6, 59-75, (2005) · Zbl 1069.62090 [5] Daniels, M.J.; Kass, R.E., Shrinkage estimators for covariance matrices, Biometrics, 57, 4, 1173-1184, (2001) · Zbl 1209.62132 [6] Daniels, M.J.; Pourahmadi, M., Bayesian analysis of covariance matrices and dynamic models for longitudinal data, Biometrika, 89, 3, 553-566, (2002) · Zbl 1036.62019 [7] Delmar, P.; Robin, S.; Daudin, J.J., Varmixt: efficient variance modelling for the differential analysis of replicated gene expression data, Bioinformatics, 21, 4, 502-508, (2005) [8] Foulley, J.-L.; San Cristobal, M.; Gianola, D.; Im, S., Marginal likelihood and Bayesian approaches to the analysis of heterogeneous residual variances in mixed linear Gaussian models, Computational statistics and data analysis, 13, 291-305, (1992) · Zbl 0800.62386 [9] Foulley, J.-L.; Sorensen, D.; Robert-Granié, C.; Bonaiti, B., Heteroskedasticity and structural models for variances, Journal of Indian society of agricultural statistics, 57, 64-70, (2004) · Zbl 1188.62107 [10] Gabriel, K.R., Ante-dependence analysis of an ordered set of variables, The annals of mathematical statistics, 33, 1, 201-212, (1962) · Zbl 0111.15604 [11] Guo, X.; Qi, H.; Verfaillie, C.M.; Pan, W., Statistical significance analysis of longitudinal gene expression data, Bioinformatics, 19, 13, 1628-1635, (2003) [12] Jaffrézic, F.; Marot, G.; Degrelle, S.; Hue, I.; Foulley, J.-L., A structural mixed model for variances in differential gene expression studies, Genetical research, 89, 1, 19-25, (2007) [13] Kenward, M.G.; Roger, J.H., Small sample inference for fixed effects from restricted maximum likelihood, Biometrics, 53, 983-997, (1997) · Zbl 0890.62042 [14] Lewin, A.; Richardson, S.; Marshall, C.; Glazier, A.; Aitman, T., Bayesian modeling of differential gene expression, Biometrics, 62, 1-9, (2006) · Zbl 1099.62131 [15] Liang, K.; Zeger, S., Longitudinal data analysis using generalized linear models, Biometrika, 73, 13-22, (1986) · Zbl 0595.62110 [16] Nunez-Anton, V.; Zimmerman, D.L., Modeling nonstationary longitudinal data, Biometrics, 56, 3, 699-705, (2000) · Zbl 1060.62647 [17] Opgen-Rhein, R.; Strimmer, K., Accurate ranking of differentially expressed genes by a distribution-free shrinkage approach, Statistical applications in genetics and molecular biology, 6, 1, 9, (2007) · Zbl 1166.62361 [18] Rao, C., Linear statistical inference and its applications, (1973), Wiley New York · Zbl 0256.62002 [19] Smyth, G.K., Linear models and empirical Bayes methods for assessing differential expression in microarray experiments, Statistical applications in genetics and molecular biology, 3, 1, 3, (2004) · Zbl 1038.62110 [20] Spiegelhalter, D., Thomas, A., Best, N., 2004. WinBUGS Version 1.4 User Manual. Cambridge: Medical Research Council Biostatistics Unit. Available from http://www.mrc-bsu.cam.ac.uk/bugs [21] Storey, J.D.; Xiao, W.; Leek, J.T.; Tompkins, R.G.; Davis, R.W., Significance analysis of time course microarray experiments, Proceedings of the national Academy of sciences of the united states of America, 102, 36, 12837-12842, (2005) [22] Tusher, V.G.; Tibshirani, R.; Chu, G., Significance analysis of microarrays applied to the ionizing radiation response, Proceedings of the national Academy of sciences of the united states of America, 98, 9, 5116-5121, (2001) · Zbl 1012.92014
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.