×

Genetic learning of the membership functions for mining fuzzy association rules from low quality data. (English) Zbl 1360.68703

Summary: Many methods have been proposed to mine fuzzy association rules from databases with crisp values in order to help decision-makers make good decisions and tackle new types of problems. However, most real-world problems present a certain degree of imprecision. Various studies have been proposed to mine fuzzy association rules from imprecise data but they assume that the membership functions are known in advance and it is not an easy task to know a priori the most appropriate fuzzy sets to cover the domains of the variables. In this paper, we propose FARLAT-LQD, a new fuzzy data-mining algorithm to obtain both suitable membership functions and useful fuzzy association rules from databases with a wide range of types of uncertain data. To accomplish this, first we perform a genetic learning of the membership functions based on the 3-tuples linguistic representation model to reduce the search space and to learn the most adequate context for each fuzzy partition, maximizing the fuzzy supports and the interpretability measure GM3M in order to preserve the semantic interpretability of the obtained membership functions. Moreover, we propose a new algorithm based on the Fuzzy Frequent Pattern-growth algorithm, called FFP-growth-LQD, to efficiently mine the fuzzy association rules from inaccurate data considering the learned membership functions in the genetic process. The results obtained over 3 databases of different sizes and kinds of imprecisions demonstrate the effectiveness of the proposed algorithm.

MSC:

68T05 Learning and adaptive systems in artificial intelligence
68T20 Problem solving in the context of artificial intelligence (heuristics, search strategies, etc.)
68T37 Reasoning under uncertainty in the context of artificial intelligence
PDF BibTeX XML Cite
Full Text: DOI

References:

[1] Aggarwal, C.; Yu, S., A survey of uncertain data algorithms and applications, IEEE Trans. Knowl. Data Eng., 21, 5, 1-15, (2009)
[2] R. Agrawal, R. Srikant, Fast algorithms for mining association rules, in: International Conference on Very Large Data Bases, Santiago de Chile, 1994, pp. 487-499.
[3] Alcalá, R.; Alcalá-Fdez, J.; Gacto, M.; Herrera, F., Rule base reduction and genetic tuning of fuzzy systems based on the linguistic 3-tuples representation, Soft Comput., 11, 5, 401-419, (2007) · Zbl 1147.68063
[4] Alcalá, R.; Gacto, M.; Herrera, F., A fast and scalable multi-objective genetic fuzzy system for linguistic fuzzy modeling in high-dimensional regression problems, IEEE Trans. Fuzzy Syst., 19, 4, 666-681, (2011)
[5] Alcalá-Fdez, J.; Alcalá, R.; Herrera, F., A fuzzy association rule-based classification model for high-dimensional problems with genetic rule selection and lateral tuning, IEEE Trans. Fuzzy Syst., 19, 5, 857-872, (2011)
[6] Chen, C.-H.; He, J.-S.; Hong, T.-P., Moga-based fuzzy data mining with taxonomy, Knowl.-Based Syst., 54, 53-65, (2013)
[7] Chen, C.-H.; Hong, T.-P.; Tseng, V., Finding Pareto-front membership functions in fuzzy data mining, Int. J. Comput. Intell. Syst., 5, 2, 343-354, (2012)
[8] Cordón, O.; Herrera, F.; Hoffmanm, F.; Magdalena, L., Genetic fuzzy systems: evolutionary tuning and learning of fuzzy knowledge bases, (2001), World Scientific Singapore
[9] Cordón, O.; Herrera, F.; Hoffmann, F.; Magdalena, L., Genetic fuzzy systems: evolutionary tuning and learning of fuzzy knowledge bases, (Advances in Fuzzy Systems - Applications and Theory, vol. 19, (2001), World Scientific) · Zbl 1042.68098
[10] Couso, I.; Dubois, D.; Sánchez, L., Random sets and random fuzzy sets as ill-perceived random variables, springerbriefs in applied sciences and technology, (2014), Springer International Publishing
[11] Couso, I.; Sanchez, L., Higher order models for fuzzy random variables, Fuzzy Sets Syst., 159, 3, 237-258, (2008) · Zbl 1178.60004
[12] Delgado, M.; Marín, N.; Sánchez, D.; Vila, M., Fuzzy association rules: general model and applications, IEEE Trans. Fuzzy Syst., 11, 2, 214-225, (2003)
[13] Driankov, D.; Hellendoorn, H.; Reinfrank, M., An introduction to fuzzy control, (1993), Springer-Verlag
[14] Dubois, D.; Prade, H., The mean value of a fuzzy number, Fuzzy Sets Syst., 24, 3, 279-300, (1987) · Zbl 0634.94026
[15] Dubois, D.; Prade, H., Fuzzy sets - a convenient fiction for modeling vagueness and possibility, IEEE Trans. Fuzzy Syst., 2, 1, 16-21, (1994)
[16] Dubois, D.; Prade, H.; Sudkamp, T., On the representation, measurement, and discovery of fuzzy associations, IEEE Trans. Fuzzy Syst., 13, 2, 250-262, (2005)
[17] Eiben, A.; Smith, J., Introduction to evolutionary computing, (2003), Springer-Verlag · Zbl 1028.68022
[18] Eshelman, L., The chc adaptive search algorithm: how to have safe search when engaging in nontraditional genetic recombination, (Rawlin, G., Foundations of Genetic Algorithms, vol. 1, (1991), Morgan Kaufmann), 265-283
[19] Eshelman, L.; Schaffer, J., Real-coded genetic algorithms and interval schemata, (Whitley, D., Foundations of Genetic Algorithms, vol. 2, (1993), Morgan Kaufmann), 187-202
[20] Gacto, M.; Alcalá, R.; Herrera, F., Integration of an index to preserve the semantic interpretability in the multiobjective evolutionary rule selection and tuning of linguistic fuzzy systems, IEEE Trans. Fuzzy Syst., 18, 3, 515-531, (2010)
[21] Goldberg, D., Genetic algorithms in search, optimization, and machine learning, (1989), Addison-Wesley New York · Zbl 0721.68056
[22] González, A.; Pérez, R., A study about the inclusion of linguistic hedges in a fuzzy rule learning algorithm, Int. J. Uncertain. Fuzziness Knowl.-Based Syst., 7, 3, 257-266, (1999) · Zbl 1087.68615
[23] Han, J.; Kamber, M., Data mining: concepts and techniques, (2006), Morgan Kaufmann San Fransisco · Zbl 1445.68004
[24] Herrera, F., Genetic fuzzy systems: taxonomy, current search trends and prospects, Evol. Intell., 1, 1, 27-46, (2008)
[25] Hong, T.; Kuo, C.; Chi, S., Trade-off between time complexity and number of rules for fuzzy mining from quantitative data, Int. J. Uncertain Fuzziness Knowl.-Based Syst., 9, 5, 587-604, (2001) · Zbl 1113.68438
[26] Hong, T.-P.; Lee, Y.-C., An overview of mining fuzzy association rules, (Bustince, H.; Herrera, F.; Montero, J., Studies in Fuzziness and Soft Computing, vol. 220, (2008), Springer Berlin/Heidelberg), 397-410
[27] Ishibuchi, H.; Nakashima, T.; Yamamoto, T., Fuzzy Association Rules for Handling Continuous Attributes, vol. 1, (2001), Pusan Korea
[28] Kruse, R.; Schwecke, E.; Heinsohn, J., Uncertainty and vagueness in knowledge based systems, artificial intelligence, (1991), Springer Berlin, Heidelberg · Zbl 0755.68129
[29] Lin, C.-W.; Hong, T.-P.; Lu, W.-H., Linguistic data mining with fuzzy fp-trees, Expert Syst. Appl., 37, 6, 4560-4567, (2010)
[30] Liu, B.-D.; Chen, C.-Y.; Tsao, J.-Y., Design of adaptative fuzzy logic controller based on linguistic-hedge and genetic algorithms, IEEE Trans. Syst. Man Cybernet. Part B: Cybernet., 31, 1, 32-53, (2001)
[31] Lozano, M.; Herrera, F.; Krasnogor, N.; Molina, D., Real-coded memetic algorithms with crossover Hill-climbing, Evol. Comput., 12, 3, 273-302, (2004)
[32] E.H. Mamdani, Application of fuzzy algorithms for control of simple dynamic plant, in: Proc. IEEE, vol. 121, 1974, pp. 1585-1588.
[33] Martin, D.; Rosete, A.; Alcalá-Fdez, J.; Herrera, F., A new multiobjective evolutionary algorithm for mining a reduced set of interesting positive and negative quantitative association rules, IEEE Trans. Evol. Comput., 18, 1, 54-69, (2014)
[34] Martin, D.; Rosete, A.; Alcalá-Fdez, J.; Herrera, F., QAR-CIP-NSGA-II: a new multi-objective evolutionary algorithm to mine quantitative association rules, Inform. Sci., 258, 1-28, (2014)
[35] C. Moraga, R. Salas, A new aspect for the optimization of fuzzy if-then rules, in: Proc. of the 35th International Symposium on Multiple-Valued Logic (ISMVL’05), 2005, pp. 160-165.
[36] Palacios, A.; Gacto, M.; Alcalá-Fdez, J., Mining fuzzy association rules from low quality data, Soft Comput., 16, 5, 883-901, (2012)
[37] Peia, B.; Zhaoa, S.; Chena, H.; Zhoua, X.; Chena, D., Farp: mining fuzzy association rules from a probabilistic quantitative database, Inform. Sci., 237, 242-260, (2013)
[38] Sanchez, L.; Suarez, M.; Villar, J.; Couso, I., Mutual information-based feature selection and partition design in fuzzy rule-based classifiers from vague data, Int. J. Approx. Reason., 49, 3, 607-622, (2008)
[39] J. Shin, T. Bond, Results of an icing test on a naca 0012 airfoil in the nasa lewis icing research tunnel, Reno, NV, 1992, pp. 1-9.
[40] L.Sánchez, I. Couso, J. Casillas, Modeling vague data with genetic fuzzy systems under a combination of crisp and imprecise criteria, Honolulu, Hawaii, USA, 2007, pp. 30-37.
[41] Sun, K.; Fengshan, B., Mining weighted association rules without preassigned weights, IEEE Trans. Knowl. Data Eng., 20, 4, 489-495, (2008)
[42] Wang, C.; Lee, W.; Pang, C., Applying fuzzy fp-growth to mine fuzzy association rules, World Acad. Sci. Eng. Technol., 65, 956-962, (2010)
[43] Weng, C.-H.; Chen, Y.-L., Mining fuzzy association rules from uncertain data, Knowl. Inform. Syst., 23, 2, 129-152, (2010)
[44] Zhang, C.; Zhang, S., Association rule mining: models and algorithms series, (Lecture Notes in Computer Science, LNAI 2307, (2002), Springer-Verlag Berlin) · Zbl 0998.68047
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.