×

Multiview feature analysis via structured sparsity and shared subspace discovery. (English) Zbl 1456.68141

Summary: Since combining features from heterogeneous data sources can significantly boost classification performance in many applications, it has attracted much research attention over the past few years. Most of the existing multiview feature analysis approaches separately learn features in each view, ignoring knowledge shared by multiple views. Different views of features may have some intrinsic correlations that might be beneficial to feature learning. Therefore, it is assumed that multiviews share subspaces from which common knowledge can be discovered. In this letter, we propose a new multiview feature learning algorithm, aiming to exploit common features shared by different views. To achieve this goal, we propose a feature learning algorithm in a batch mode, by which the correlations among different views are taken into account. Multiple transformation matrices for different views are simultaneously learned in a joint framework. In this way, our algorithm can exploit potential correlations among views as supplementary information that further improves the performance result. Since the proposed objective function is nonsmooth and difficult to solve directly, we propose an iterative algorithm for effective optimization. Extensive experiments have been conducted on a number of real-world data sets. Experimental results demonstrate superior performance in terms of classification against all the compared approaches. Also, the convergence guarantee has been validated in the experiment.

MSC:

68T05 Learning and adaptive systems in artificial intelligence
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Bay, H., Tuytelaars, T., & Gool, L. V. (2006). Surf: Speeded up robust features. In Proceedings of the 9th European Conference on Computer Vision. Berlin: Springer.
[2] Cai, X., Nie, F., & Huang, H. (2013). Multi-view k-means clustering on big data. In Proceedings of the Conference on Artificial Intelligence. Cambridge, MA: AAAI Press.
[3] Chang, C.-C., & Lin, C.-J. (2011). LIBSVM: A library for support vector machines. ACM TIST, 2(3), 27:1-27:27. .
[4] Chang, X., Nie, F., Wang, S., Yang, Y., Zhou, X., & Zhang, C. (2016). Compound rank-k projections for bilinear analysis. IEEE Trans. Neural Netw. Learning Syst., 27(7), 1502-1513. ,
[5] Chang, X., Nie, F., Yang, Y., & Huang, H. (2014). A convex formulation for semisupervised multi-label feature selection. In Proceedings of the Twenty-Eighth AAAI Conference on Artificial Intelligence (pp. 1171-1177). Cambridge, MA: AAAI Press.
[6] Chang, X., Nie, F., Yang, Y., Zhang, C., & Huang, H. (2016). Convex sparse PCA for unsupervised feature learning. TKDD, 11(1), 3:1-3:16. ,
[7] Chang, X., Yu, Y., Yang, Y., & Xing, E. P. (2016a). They are not equally reliable: Semantic event search using differentiated concept classifiers. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (pp. 1884-1893). Piscataway, NJ: IEEE. ,
[8] Chang, X., Yu, Y.-L., Yang, Y., & Xing, E. P. (2016b). Semantic pooling for complex event analysis in untrimmed videos. IEEE Transactions on Pattern Analysis and Machine Intelligence. doi:10.1109/TPAMI.2016.2608901,
[9] Chen, H., Cai, X., Zhu, D., Nie, F., Liu, T., & Huang, H. (2012). Group-wise consistent parcellation of gyri via adaptive multi-view spectral clustering of fiber shapes. In Proceedings of the Conference on Medical Computing and Computer-Assisted Intervention. Berlin: Springer.
[10] Chua, T.-S., Tang, J., Hong, R., Li, H., Luo, Z., & Zheng, Y.-T. (2009). Nus-wide: A real-world web image database from national university of singapore. In Proceedings of the 8th International Conference on Image and Video Retrieval. New York: ACM.
[11] Conrad, C., & Mester, R. (2013). Learning multi-view correspondences via subspace-based temporal coincidences. In Proceedings of the 18th Scandinavian Conference on Image Analysis. Berlin: Springer. ,
[12] Dalal, N., & Triggs, B. (2005). Histograms of oriented gradients for human detection. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway, NJ: IEEE.
[13] Fan, M., Chang, X., & Tao, D. (2017). Structure regularized unsupervised discriminant feature analysis. In Proceedings of the AAAI Conference on Artificial Intelligence. Cambridge, MA: AAAI Press.
[14] Foster, D. P., Kakade, S. M., & Zhang, T. (2008). Multi-view dimensionality reduction via canonical correlation analysis (Technical Report). Chicago: Toyota Technological Institute.
[15] Gehler, P., & Nowozin, S. (2009). On feature combination for multiclass object classification. In Proceedings of the 12th International Conference on Computer Vision. Piscataway, NJ: IEEE.
[16] Grauman, K., & Darrell, T. (2006). Unsupervised learning of categories from sets of partially matching image features. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition.Piscataway, NJ: IEEE.
[17] Guo, H., & Viktor, H. L. (2008). Multirelational classification: A multiple view approach. Knowledge and Information Systems, 17(3), 287-312. ,
[18] Ham, J., Lee, D., & Saul., L. (2005). Semisupervised alignment of manifolds. In Proceedings of the 10th International Workshop on Artificial Intelligence and Statistics. N.p.: Society for Artificial Intelligence and Statistics.
[19] Jing, X., Hu, R., Zhu, Y., Wu, S., Liang, C., & Yang, J. (2014). Intra-view and inter-view supervised correlation analysis for multi-view feature learning. In Proceedings of the AAAI Conference on Artificial Analysis. Cambridge, MA: MIT Press.
[20] Kapoor, A., Grauman, K., Urtasun, R., & Darrell, T. (2010). Gaussian processes for object categorization. International Journal of Computer Vision, 88(2), 169-188. , · Zbl 1477.68384
[21] Kloft, M., Brefeld, U., Laskov, P., & Sonnenburg, S. (2008). Non-sparse multiple kernel learning. In Proceedings of the NIPS Workshop Kernel Learning: Automatic Selection of Kernels.
[22] Lanckriet, G. R., Cristianini, N., Bartlett, P., Ghaoui, L. E., & Jordan, M. I. (2004). Learning the kernel matrix with semidefinite programming. Journal of Machine Learning Research, 5, 27-72. · Zbl 1222.68241
[23] Lowe, D. G. (2004). Distinctive image features from scale-invariant keypoints. International Journal of Computer Vision, 60(2), 91-110. ,
[24] Luo, M., Nie, F., Chang, X., Yang, Y., Hauptmann, A. G., & Zhang, Q. (2017). Avoiding optimal mean 2,1-norm maximization-based robust PCA for reconstruction. Neural Computation, 29, 1124-1150. , · Zbl 1418.62245
[25] Monadjemi, A., Thomas, B. T., & Mirmehdi, M. (2002). Experiments on high resolution images towards outdoor scene classification (Technical Report). Bristol: University of Bristol, Department of Computer Science.
[26] Sonnenburg, S., Rätsch, G., Schäfer, C., & Schölkopf, B. (2006). Large scale multiple kernel learning. Journal of Machine Learning Research, 7, 1531-1565. · Zbl 1222.90072
[27] Suykens, J. A., & Vandewalle, J. (1999). Least squares support vector machine classifiers. In Neural Processing Letters, 9, 293-300. ,
[28] Wang, C., & Mahadevan, S. (2013). Manifold alignment preserving global geometry. In Proceedings of the 23rd International Joint Conference on Artificial Intelligence. Cambridge, MA.
[29] Wang, H., Nie, F., & Huang, H. (2013). Multi-view clustering and feature learning via structured sparsity. In Proceedings of the International Conference on Machine Learning.
[30] Wang, S., Chang, X., Li, X., Long, G., Yao, L., & Sheng, Q. Z. (2016). Diagnosis code assignment using sparsity-based disease correlation embedding. IEEE Trans. Knowl. Data Eng., 28(12), 3191-3202. ,
[31] Wang, S., Chang, X., Li, X., Sheng, Q. Z., & Chen, W. (2016). Multi-task support vector machines for feature selection with shared knowledge discovery. Signal Processing, 120, 746-753. ,
[32] Wang, S., Ma, Z., Yang, Y., Li, X., Pang, C., & Hauptmann, A. G. (2014). Semisupervised multiple feature analysis for action recognition. IEEE Transactions on Multimedia, 16(2), 289-298. ,
[33] Xue, X., Nie, F., Wang, S., Chang, X., Stantic, B., & Yao, M. (2017). Multi-view correlated feature learning by uncovering shared component. In Proceedings of the Conference on Artificial Analysis. Cambridge, MA: AAAI Press.
[34] Yang, Y., Song, J., Huang, Z., Ma, Z., Sebe, N., & Hauptmann, A. (2012). Multi-feature fusion via hierarchical regression for multimedia analysis. IEEE Transactions on Multimedia, 15, 572-581. ,
[35] Ye, J., Ji, S., & Chen, J. (2008). Multi-class discriminant kernel learning via convex programming. Journal of Machine Learning Research, 9, 719-758. · Zbl 1225.68223
[36] Yu, S., Falck, T., Daemen, A., Tranchevent, L.-C., Suykens, J. A., Moor, B. D., & Moreau, Y. (2010). L2-norm multiple kernel learning and its application to biomedical data fusion. BMC Bioinformatics, 11, 309.
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.