×

Kernel-improved support vector machine for semanteme data. (English) Zbl 1288.68196

Summary: The computation for the inner production of semanteme data and the Support Vector Machine (SVM) classification of semantic data are very difficult. In this paper, a novel kernel-based semanteme data classification method is proposed, and the (SVM) is extended to semantic SVM, called the Semanteme-based Support Vector Machine (SSVM). A new dissimilarity definition and a simple inner production computation method for semanteme data are presented, and the parameters for optimization selection in SSVM are also discussed. In the proposed algorithm, the solving support vector process based on the new inner production computation method remains a quadratic program problem but has a lower computation complexity. The SSVM is insensitive to outliers, and its classification capability for unbalanced data in actual datasets is analyzed. The experimental results demonstrate the average advantage of SSVM over algorithm C4.5, adaptive dissimilarity metric, and value difference metric in terms of classification and robust capability, indicating that the proposed method has promising performance.

MSC:

68T05 Learning and adaptive systems in artificial intelligence

Software:

UCI-ml; C4.5
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Minho, K.; Ramakrishna, R. S., Projected clustering for categorical datasets, Pattern Recognition Letters, 27, 1405-1417 (2009)
[2] Esposito, F.; Malerba, D.; Tamma, V.; Bock, H. H., Classical resemblance measure, (Bock, H. H.; Diday, E., Analysis of Symbolic Data (2000), Springer: Springer Berlin), 139-152 · Zbl 0977.62069
[3] Stanfill, C.; Waltz, D., Towards memory-based reasoning, Communications of the ACM, 29, 1213-1228 (1986)
[4] Cheng, V.; Li, C. H.; Kwokb, J. T.; Lic, C. K., Dissimilarity learning for semanteme data, Pattern Recognition, 37, 1471-1477 (2004)
[5] Gower, J. C.; Legendre, P., Metric and Euclidean properties of dissimilarity coefficients, Journal of Classification, 3, 5-48 (1986) · Zbl 0592.62048
[6] Zhang, X.-Y.; Zhou, J.-Z.; Wang, C.-Q.; Li, C.-S.; Song, L.-X., Multi-class support vector machine optimized by inter-cluster distance and self-adaptive deferential evolution, Applied Mathematics and Computation, 218, 4973-4987 (2012) · Zbl 1238.68137
[7] Burges, J. C., A tutorial on support vector machine for pattern recognition, Data Mining and Knowledge Discovery, 2, 121-167 (1998)
[8] Vapnik, V. N., Statistical Learning Theory (1998), John Wiley & Sons, Inc.: John Wiley & Sons, Inc. New York · Zbl 0935.62007
[9] Scholkopf, B.; Mika, S.; Burges, C. J.C.; Knirsch, P.; Muller, K.-R.; Ratsch, G.; Smola, A. J., Input space versus feature space in kernel-based methods, IEEE Transactions on Neural Networks, 10, 1000-1017 (2009)
[10] Wang, D.-F.; Daniel, S. Y.; Eric, C. C.T., Weighted mahalanobis distance kernels for support vector machines, IEEE Transaction on Neural Networks, 18, 1453-1462 (2007)
[11] Richard, O. D.; Peter, E. H.; David, G. S., Pattern Classification (2001), John Wiley · Zbl 0968.68140
[12] Han, X.-P.; Zhao, J., The creation of semantic metadata based on Wikipedia, Journal of Chinese Information Processing, 23, 108-114 (2009)
[13] Li, W.-F.; Duan, Y.; Liu, Y.; Sun, C.-L., A feature selection algorithm of orienting to the flow classification, Journal of Chinese Information Processing, 23, 51-57 (2009)
[14] Chen, C.-S., Quadratic optimal synchronization of uncertain chaotic systems, Applied Mathematics and Computation, 217, 10377-10386 (2011) · Zbl 1225.65119
[15] Li, Z.-H.; Wang, S.-T., Clustering with outliers-based anomalous intrusion detection, Systems Engineering and Electronics, 31, 1227-1230 (2009)
[17] Sever, A., A problem of data mining in E-commerce, Applied Mathematics and Computation, 217, 9966-9970 (2011) · Zbl 1221.68200
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.