×

A class of fuzzy clusterwise regression models. (English) Zbl 1204.62112

Summary: We introduce a class of fuzzy clusterwise regression models with linear regression (LR) fuzzy response variable and numeric explanatory variables, which embodies fuzzy clustering, into a fuzzy regression framework. The model bypasses the heterogeneity problem that could arise in fuzzy regression by subdividing the dataset into homogeneous clusters and performing separate fuzzy regression on each cluster. The integration of the clustering model into the regression framework allows us to simultaneously estimate the regression parameters and the membership degree of each observation to each cluster by optimizing a single objective function. The class of models proposed here includes, as special cases, the fuzzy clusterwise linear regression model and the fuzzy clusterwise polynomial regression model. We also introduce a set of goodness of fit indices to evaluate the fit of the regression model within each cluster as well as in the whole dataset. Finally, we consider some cluster validity criteria that are useful in identifying the “optimal” number of clusters. Several applications are provided in order to illustrate the approach.

MSC:

62J05 Linear regression; mixed models
62J86 Fuzziness, and linear inference and regression
62H30 Classification and discrimination; cluster analysis (statistical aspects)
62H12 Estimation in multivariate analysis
62H86 Multivariate analysis and fuzziness
65C60 Computational problems in statistics (MSC2010)
PDFBibTeX XMLCite
Full Text: DOI

References:

[1] Aitkin, M.; Wilson, G. T., Mixture models, outliers and the EM algorithm, Technometrics, 22, 325-331 (1980) · Zbl 0466.62034
[2] Aurifeille, J. M.; Quester, P. G., Predicting business ethical tolerance in international markets: a concomitant clusterwise regression analysis, International Business Review, 12, 253-272 (2003)
[3] Bezdek, J. C., Cluster validity with fuzzy sets, Journal of Cybernetics, 3, 58-73 (1974) · Zbl 0294.68035
[4] Bezdek, J. C., Pattern Recognition with Fuzzy Objective Function Algorithms (1981), Plenum Press: Plenum Press New York · Zbl 0503.68069
[5] Celmins, A., Least squares model fitting to fuzzy vector data, Fuzzy Sets and Systems, 22, 245-269 (1987) · Zbl 0636.62111
[6] Chan, K. Y.; Kwong, C. K.; Fogarty, T. C., Modeling manufacturing processes using a genetic programming-based fuzzy regression with detection of outliers, Information Sciences, 180, 506-518 (2010)
[7] Chang, P.-T.; Lee, E. S.; Konz, S. A., Applying fuzzy linear regression to VDT legibility, Fuzzy Sets and Systems, 80, 197-204 (1996)
[8] Chang, Y.-H. O.; Ayyub, B. M., Fuzzy regression methods - a comparative assessment, Fuzzy Sets and Systems, 119, 187-203 (2001)
[9] Chen, Y. S., Outliers detection and confidence interval modification in fuzzy regression, Fuzzy Sets and Systems, 119, 259-272 (2001) · Zbl 1044.62071
[10] Chen, S.-P.; Dang, F., A variable spread fuzzy linear regression model with higher explanatory power and forecasting accuracy, Information Sciences, 178, 3973-3988 (2008) · Zbl 1242.90305
[11] Chuang, C.-C., Extended support vector interval regression networks for interval input-output data, Information Sciences, 178, 871-891 (2008) · Zbl 1126.68525
[12] Coppi, R.; D’Urso, P., Fuzzy \(k\)-mean clustering models for triangular fuzzy time trajectories, Statistical Methods and Applications, 11, 21-24 (2002) · Zbl 1145.62347
[13] Coppi, R.; D’Urso, P., Regression analysis with fuzzy informational paradigm: a least-squares approach using membership function information, International Journal of Pure and Applied Mathematics, 8, 3, 279-306 (2003) · Zbl 1042.62069
[14] Coppi, R.; D’Urso, P.; Giordani, P.; Santoro, A., Least squares estimation of a linear regression model with LR fuzzy response, Computational Statistics and Data Analysis, 51, 267-286 (2006) · Zbl 1157.62460
[15] D’Urso, P.; Gastaldi, T., An “orderwise” polynomial regression procedure for fuzzy data, Fuzzy Sets and Systems, 130, 1, 1-19 (2002) · Zbl 1033.62070
[16] D’Urso, P.; Santoro, A., Goodness of fit and variable selection in the fuzzy multiple linear regression, Fuzzy Sets and Systems, 157, 2627-2647 (2006) · Zbl 1112.62064
[17] D’Urso, P.; Santoro, A., Fuzzy clusterwise linear regression analysis with symmetrical fuzzy output variable, Computational Statistics and Data Analysis, 51, 287-313 (2006) · Zbl 1157.62461
[18] D’Urso, P., Linear regression analysis for fuzzy/crisp input and fuzzy/crisp output data, Computational Statistics and Data Analysis, 42, 1-2, 47-72 (2003) · Zbl 1429.62337
[19] D’Urso, P.; Gastaldi, T., A least-squares approach to fuzzy linear regression analysis, Computational Statistics and Data Analysis, 34, 427-440 (2000) · Zbl 1046.62066
[20] De Sarbo, W. S.; Cron, W. L., A maximum likelihood methodology for clusterwise linear regression, Journal of Classification, 5, 2, 249-282 (1988) · Zbl 0692.62052
[21] De Sarbo, W. S.; Oliver, R. L.; Rangaswamy, A., A simulated annealing methodology for clusterwise linear regression, Psychometrika, 54, 4, 707-736 (1989)
[22] De Veaux, R. D., Mixtures of linear regressions, Computational Statistics and Data Analysis, 8, 227-245 (1989) · Zbl 0726.62109
[23] Diamond, P.; Tanaka, H., Fuzzy regression analysis, (Slowinski, A., Fuzzy Sets in Decision Analysis. Fuzzy Sets in Decision Analysis, Operations Research and Statistics (1998), Kluwer Academic Publishers: Kluwer Academic Publishers USA, Massachusetts), 349-387 · Zbl 0922.62058
[24] Dubois, D.; Prade, H., Possibility Theory (1988), Plenum Press: Plenum Press New York · Zbl 0645.68108
[25] Ferrari-Trecate, G.; Muselli, M.; Liberati, D.; Morari, M., A clustering technique for the identification of piecewise affine systems, Automatica, 39, 205-217 (2003) · Zbl 1011.93508
[26] Gordaliza Garcı́a-Escudero, L. A.; San Martı́n, R. A.; Van Aelst, S.; Zamar, R., Robust linear clustering, Journal of the Royal Statistical Society B, 71, 301-318 (2009) · Zbl 1231.62112
[27] Garcia-Escudero, L. A.; Gordaliza, A.; Mayo-Iscar, A.; San Martin, R., Robust clusterwise linear regression through trimming, Computational Statistics & Data Analysis, 54, 12, 3057-3069 (2010) · Zbl 1284.62198
[28] González-Rodrı́guez, G.; Blanco, G. A.; Colubi, A.; Lubiano, M. A., Estimation of a simple linear regression model for fuzzy random variables, Fuzzy Sets and Systems, 160, 357-370 (2009) · Zbl 1175.62073
[29] Hathaway, R. J.; Bezdek, J. C., Switching regression models and fuzzy clustering, IEEE Transactions on Fuzzy Systems, 1, 3, 195-204 (1993)
[30] Hathaway, R. J.; Bezdek, J. C.; Pedrycz, W., A parametric model for fusing heterogeneous fuzzy data, IEEE Transactions on Fuzzy Systems, 4, 3, 1277-1282 (1996)
[31] Hennig, C., Identifiability of models for clusterwise linear regression, Journal of Classification, 17, 273-296 (2000) · Zbl 1017.62058
[32] Hennig, C., Clusters, outliers, and regression: fixed point clusters, Journal of Multivariate Analysis, 86, 183-212 (2003) · Zbl 1020.62051
[33] Hennig, C., Cluster-wise assessment of cluster stability, Computational Statistics and Data Analysis, 52, 258-271 (2007) · Zbl 1452.62447
[34] Hong, D. H.; Hwang, C.; Ahn, C., Ridge estimation for regression models with crisp inputs and Gaussian fuzzy output, Fuzzy Sets and Systems, 142, 307-319 (2004) · Zbl 1041.62054
[35] Hong, Y.-Y.; Chao, Z.-T., Development of energy loss formula for distribution systems using FCN algorithm and cluster-wise fuzzy regression, IEEE Transactions on Power Delivery, 17, 3, 794-799 (2002)
[36] Hung, W.-L.; Yang, M.-S., Fuzzy clustering on LR-type fuzzy numbers with an application to Taiwanese tea evaluation, Fuzzy Sets and Systems, 150, 561-577 (2005) · Zbl 1065.68084
[37] Hung, W.-L.; Yang, M.-S., An omission approach for detecting outliers in fuzzy regression models, Fuzzy Sets and Systems, 157, 3109-3122 (2006) · Zbl 1106.62068
[38] Khashei, M.; Hejazi, S. R.; Bijari, M., A new hybrid artificial neural networks and fuzzy regression model for time series forecasting, Fuzzy Sets and Systems, 159, 769-786 (2008) · Zbl 1274.62599
[39] Kim, H. K.; Yoon, J. H.; Li, Y., Asymptotic properties of least squares estimation with fuzzy observations, Information Sciences, 178, 439-451 (2008) · Zbl 1128.62028
[40] Lau, K.; Leung, P.; Tse, K., A mathematical programming approach to clusterwise regression model and its extensions, European Journal of Operational Research, 111, 640-652 (1999) · Zbl 0993.62052
[41] Leşki, J. M., \(ε\)-insensive fuzzy c-regression models: introduction to \(ε\)-insensive fuzzy modelling, IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics, 34, 1, 4-15 (2004)
[42] Li, C.; Zhou, J.; Xiang, X.; Li, Q.; An, X., T-S fuzzy model identification based on a novel fuzzy c-regression model clustering algorithm, Engineering Applications of Artificial Intelligence, 22, 646-653 (2009)
[43] McCormick, R. E., Managerial Economics (1993), Prentice-Hall: Prentice-Hall Englewood Cliffs, NJ
[44] Papadakis, S. E.; Kaburlasos, V. G., Piecewise-linear approximation of nonlinear models based on probabilistically/possibilistically interpreted intervals’ numbers (INs), Information Sciences, 180, 5060-5076 (2010) · Zbl 1206.68302
[45] Preda, C.; Saporta, G., Clusterwise PLS regression on a stochastic process, Computational Statistics and Data Analysis, 49, 99-108 (2005) · Zbl 1429.62299
[46] Quandt, R. E., A new approach to estimating switching regressions, Journal of the American Statistical Association, 67, 306-310 (1972) · Zbl 0237.62047
[47] Quandt, R. E.; Ramsey, J. B., Estimating mixtures of normal distributions and switching regressions (with discussion), Journal of the American Statistical Association, 73, 730-752 (1978) · Zbl 0401.62024
[48] Raj, P. A.; Kumar, D. N., Ranking alternatives with fuzzy weights using maximizing set and minimizing set, Fuzzy Sets and Systems, 105, 365-375 (1999) · Zbl 0963.91024
[49] Roubens, M., Fuzzy clustering algorithm and their cluster validity, Eur. J. Oper. Res., 10, 294-331 (1982) · Zbl 0485.62055
[50] Shakouri, H.; Nadimi, G. R., A novel fuzzy linear regression model based on a non-equality possibility index and optimum uncertainty, Appl. Soft Comput., 9, 590-598 (2009)
[51] Shao, Q.; Wu, Y., A consistent procedure for determining the number of clusters in regression clustering, J. Stat. Planning Inference, 135, 461-476 (2005) · Zbl 1074.62042
[52] Spath, H., Clusterwise linear regression, Computing, 22, 4, 367-373 (1979) · Zbl 0387.65028
[53] Steinley, D.; Brusco, M. J., Initializing K-means batch clustering, a critical evaluation of several techniques, Journal of Classification, 24, 99-121 (2007) · Zbl 1144.62331
[54] Suk, H. W.; Hwang, H., Regularized fuzzy clusterwise ridge regression, Advances in Data Analysis and Classification, 4, 35-51 (2010) · Zbl 1306.62166
[55] Van Aelest, S.; Wang, X. S.; Zamar, R. H.; Zhu, R., Linear grouping using orthogonal regression, Computational Statistics and Data Analysis, 50, 5, 1287-1312 (2006) · Zbl 1431.62273
[56] Wang, N.; Zhang, W.-X.; Mei, C.-L., Fuzzy nonparametric regression based on local linear smoothing technique, Information Sciences, 177, 3882-3900 (2007) · Zbl 1114.62056
[57] Wedel, M.; Steenkamp, J. B.E. M., A clusterwise regression method for simultaneous fuzzy market structuring and benefit segmentation, Journal of Marketing Research, 28, 385-396 (1991)
[58] Wu, K.-L.; Yang, M.-S.; Hsieh, J.-N., Mountain c-regressions method, Pattern Recognition, 43, 86-98 (2010) · Zbl 1176.68190
[59] Xu, R.; Li, C., Multidimensional least-squares fitting with a fuzzy model, Fuzzy Sets and Systems, 119, 215-223 (2001) · Zbl 0964.62057
[60] Yang, M. S.; Ko, C. H., On cluster-wise fuzzy regression analysis, IEEE Transactions on Systems, Man, and Cybernetics, Part B, 27, 1, 1-13 (1997)
[61] Yang, M.-S.; Wu, K.-L.; Hsieh, J.-N.; Yu, J., Alpha-cut implemented fuzzy clustering algorithms and switching regressions, IEEE Transactions on Systems, Man, and Cybernetics-B, 38, 588-603 (2008)
[62] Yu, J.-R.; Lee, C.-W., Piecewise regression for fuzzy input-output data with automatic change-point detection by quadratic programming, Applied Soft Computing, 10, 111-118 (2010)
[63] Zimmermann, H. J., Fuzzy Set Theory and its Applications (2001), Kluwer Academic Press: Kluwer Academic Press Dordrecht
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. In some cases that data have been complemented/enhanced by data from zbMATH Open. This attempts to reflect the references listed in the original paper as accurately as possible without claiming completeness or a perfect matching.