Biclustering via sparse singular value decomposition. (English) Zbl 1233.62182

Summary: Sparse singular value decomposition (SSVD) is proposed as a new exploratory analysis tool for biclustering or identifying interpretable row-column associations within high-dimensional data matrices. SSVD seeks a low-rank, checkerboard structured matrix approximation to the data matrices. The desired checkerboard structure is achieved by forcing both the left- and right-singular vectors to be sparse, that is, having many zero entries. By interpreting singular vectors as regression coefficient vectors for certain linear regressions, sparsity-inducing regularization penalties are imposed to the least squares regression to produce sparse singular vectors. An efficient iterative algorithm is proposed for computing the sparse singular vectors, along with some discussion of penalty parameter selection. A lung cancer microarray and a food nutrition data set are used to illustrate SSVD as a biclustering method. SSVD is also compared with some existing biclustering methods using simulated data sets.


62P10 Applications of statistics to biology and medical sciences; meta analysis
62H30 Classification and discrimination; cluster analysis (statistical aspects)
65C60 Computational problems in statistics (MSC2010)
62J05 Linear regression; mixed models
92C50 Medical applications (general)


Full Text: DOI


[1] Asgarian, Using rank-1 biclusters to classify microarray data (2008)
[2] Bondell, Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with OSCAR, Biometrics 64 pp 115– (2008) · Zbl 1146.62051
[3] Breiman, Better subset regression using the nonnegative garotte, Technometrics 37 pp 373– (1995) · Zbl 0862.62059
[4] Busygin, Feature selection for consistent biclustering via fractional 0-1 programming, Journal of Combinatorial Optimization 10 pp 7– (2005) · Zbl 1123.90073
[5] Busygin, Biclutering in data mining, Computers and Operations Research 35 pp 2964– (2008) · Zbl 1144.68309
[6] Dhillon , I. S. Mallela , S. Modha , D. S. 2003 Information-theoretic co-clustering Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining 89 98 Springer
[7] Eckart, The approximation of one matrix by another of lower rank, Psychometrika 1 pp 211– (1936) · JFM 62.1075.02
[8] Fan, Variable selection via nonconcave penalized likelihood and its oracle properties, Journal of the American Statistical Association 96 pp 1348– (2001) · Zbl 1073.62547
[9] Jolliffe, Principal Component Analysis (2002)
[10] Jolliffe, A modified principal component technique based on the lasso, Journal of Computational and Graphical Statistics 12 pp 531– (2003)
[11] Kluger, Spectral biclustering of microarray data: Coclustering genes and conditions, Genome Research 13 pp 703– (2003)
[12] Lazzeroni, Plaid models for gene expression data, Statistica Sinica 12 pp 61– (2002) · Zbl 1004.62084
[13] Leng, On general adaptive sparse principal component analysis, Journal of Computational and Graphical Statistics 18 pp 201– (2009)
[14] Liu, Statistical significance of clustering for high dimension low sample size data, Journal of the American Statistical Association 103 pp 1281– (2008) · Zbl 1205.62079
[15] Madeira, Biclustering algorithms for biological data analysis: A survey, IEEE Transactions on Computational Biology and Bioinformatics 1 pp 24– (2004) · Zbl 05103330
[16] Schwarz, Estimating the dimension of a model, Annals of Statistics 6 pp 461– (1978) · Zbl 0379.62005
[17] Shabalin, Finding large average submatrices in high dimensional data, Annals of Applied Statistics 3 pp 985– (2009) · Zbl 1196.62087
[18] Shen, Sparse principal component analysis via regularized low rank matrix approximation, Journal of Multivariate Analysis 99 pp 1015– (2008) · Zbl 1141.62049
[19] Tibshirani, Regression shrinkage and selection via the lasso, Journal of the Royal Statistical Society, Series B 58 pp 267– (1996) · Zbl 0850.62538
[20] Wang, Regression coefficient and autoregressive order shrinkage and selection via lasso, Journal of the Royal Statistical Society, Series B 69 pp 63– (2007)
[21] Wang, Variable selection for model-based high-dimensional clustering and its application to microarray data, Biometrics 64 pp 440– (2008) · Zbl 1137.62041
[22] Yang, Biclustering of microarray data based on singular value decomposition, Emerging Technologies in Knowledge Discovery and Data Mining pp 194– (2007) · Zbl 05269648
[23] Zhang, Adaptive-lasso for Cox’s proportional hazard model, Biometrika 94 pp 691– (2007) · Zbl 1135.62083
[24] Zou, The adaptive lasso and its oracle properties, Journal of the American Statistical Association 101 pp 1418– (2006) · Zbl 1171.62326
[25] Zou, Regularization and variable selection via the elastic net, Journal of the Royal Statistical Society, Series B 67 pp 301– (2005) · Zbl 1069.62054
[26] Zou, Sparse principal component analysis, Journal of Computational and Graphical Statistics 15 pp 265– (2006)
[27] Zou, On the ”degrees of freedom” of the lasso, Annals of Statistics 35 pp 2173– (2007) · Zbl 1126.62061
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.