×

A cluster validity index for fuzzy clustering. (English) Zbl 1204.62102

Summary: A cluster validity index proposed by Y.-I. Kim et al. [Inf. Sci. 168, No. 1–4, 225–242 (2004; Zbl 1075.62052)] is analyzed, and a problem is discussed that the validity index faces in situations when there are well-separated clusters that themselves include subclusters. Based on this analysis, a new validity index is proposed. The new validity index employs a compactness measure and a separation measure. The compactness measure combines the fuzziness in the membership matrix \((U)\) with the geometrical compactness of the representation of the data set \((X)\) via the prototypes \((V)\). The separation measure is defined as the average value of the degrees of overlap of all possible pairs of fuzzy clusters in the system. The proposed index is tested and validated using several data sets. The results of the comparison show the superior effectiveness and reliability of the proposed index in comparison to other indices.

MSC:

62H30 Classification and discrimination; cluster analysis (statistical aspects)
62H86 Multivariate analysis and fuzziness

Citations:

Zbl 1075.62052

Software:

J-MEANS
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Duda, R.; Hart, P., Pattern Classification and Scene Analysis (1973), Wiley Interscience: Wiley Interscience New York · Zbl 0277.68056
[2] Bezdek, J. C., Pattern Recognition with Fuzzy Objective Function Algorithms (1981), Plenum: Plenum New York · Zbl 0503.68069
[3] Hoppner, F.; Klawonn, F.; Kruse, R.; Runkler, T., Fuzzy Cluster Analysis: Methods for Classification, Data Analysis and Image Recognition (1999), John Wiley · Zbl 0944.65009
[4] Titterington, D.; Smith, A.; Makov, U., Statistical Analysis of Finite Mixture Distributions (1985), John Wiley and Sons · Zbl 0646.62013
[5] Krishnapuram, R.; Keller, J., A possibilistic approach to clustering, IEEE Trans. Fuzzy Systems, 1, 2, 98-110 (1993)
[6] D. Gustafson, W. Kessel, Fuzzy clustering with a fuzzy covariance matrix, in: Proceedings of the IEEE CDC, San Diego, CA, USA, 1979, pp. 761-766.; D. Gustafson, W. Kessel, Fuzzy clustering with a fuzzy covariance matrix, in: Proceedings of the IEEE CDC, San Diego, CA, USA, 1979, pp. 761-766. · Zbl 0448.62045
[7] Bezdek, J. C., Numerical taxonomy with fuzzy sets, J. Math. Biol., 1, 57-71 (1974) · Zbl 0403.62039
[8] Bezdek, J. C., Cluster validity with fuzzy sets, J. Cybernet., 3, 58-72 (1974) · Zbl 0294.68035
[9] J.C. Bezdek, Mathematical models for systematic and taxonomy, in: Eighth International Conference on Numerical Taxonomy, San Francisco, CA, 1975, pp. 143-165.; J.C. Bezdek, Mathematical models for systematic and taxonomy, in: Eighth International Conference on Numerical Taxonomy, San Francisco, CA, 1975, pp. 143-165.
[10] Y. Fukuyama, M. Sugeno, A new method of choosing the number of clusters for the fuzzy \(c\); Y. Fukuyama, M. Sugeno, A new method of choosing the number of clusters for the fuzzy \(c\)
[11] Xie, X. L.; Beni, G., A validity measure for fuzzy clustering, IEEE Trans. Pattern Anal. Mach. Intell., 13, 8, 841-847 (1991)
[12] Kwon, S. H., Cluster validity index for fuzzy clustering, Electron. Lett., 34, 22, 2176-2177 (1998)
[13] Kim, D.-W.; Lee, K.-H.; Lee, D., Fuzzy cluster validation index based on inter-cluster proximity, Pattern Recognition Lett., 24, 2561-2574 (2003)
[14] Kim, D. W.; Lee, K. H.; Lee, D., On cluster validity index for estimation of the optimal number of fuzzy clusters, Pattern Recognition, 37, 2009-2025 (2004)
[15] Kim, Y.; Kim, D.; Lee, D.; Lee, K. H., A cluster validation index for GK cluster analysis based on relative degree of sharing, Inform. Sci., 168, 225-242 (2004) · Zbl 1075.62052
[16] Pakhira, M. K.; Bandyopadhyay, S.; Maulik, U., Validity index for crisp and fuzzy clusters, Pattern Recognition, 37, 481-501 (2004) · Zbl 1068.68135
[17] Pakhira, M. K.; Bandyopadhyay, S.; Maulik, U., A study of some fuzzy cluster validity indices, genetic clustering and application to pixel classification, Fuzzy Set and Systems, 155, 2, 191-214 (2005)
[18] Kim, M.; Ramakrishna, R. S., New indices for cluster validity assessment, Pattern Recognition Lett., 26, 2353-2363 (2005)
[19] Wu, K. L.; Yang, M. S., A cluster validity index for fuzzy clustering, Pattern Recognition Lett., 26, 1275-1291 (2005)
[20] Bouguessa, M.; Wang, S.; Sun, H., An objective approach to cluster validation, Pattern Recognition Lett., 27, 1419-1430 (2006)
[21] Yu, J.; Li, C. X., Novel cluster validity index for FCM algorithm, J. Comput. Sci. Technol., 21, 1, 137-140 (2006)
[22] Ding, Y.; Harrison, R. F., Relational visual cluster validity (RVCV), Pattern Recognition Lett., 28, 2071-2079 (2007)
[23] Zhang, Y.; Wang, W.; Zhang, X.; Li, Y., A cluster validity index for fuzzy clustering, Inform. Sci., 178, 1205-1218 (2008) · Zbl 1134.68050
[24] Halkidi, M.; Batistakis, Y.; Vazirgiannis, M., Cluster validity methods: part I, SIGMOD Record, 31, 2, 40-45 (2002)
[25] Halkidi, M.; Batistakis, Y.; Vazirgiannis, M., Cluster validity methods: part II, SIGMOD Record, 31, 3, 19-27 (2002)
[26] Wang, W.; Zhang, Y., On fuzzy cluster validity indices, Fuzzy Sets and Systems, 158, 2095-2117 (2007) · Zbl 1123.62046
[27] Bezdek, J. C.; Pal, N. R., Some new indices of cluster validity, IEEE Trans. Systems Man Cybernet., 28, 301-315 (1998)
[28] Pal, N. R.; Bezdek, J. C., On cluster validity for the fuzzy \(c\)-means model, IEEE Trans. Fuzzy Systems, 3, 3, 370-379 (1995)
[29] Lee-Kwang, H.; Song, Y. S.; Lee, K. M., Similarity measure between fuzzy sets and between elements, Fuzzy Sets and Systems, 62, 291-293 (1994)
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.