Bayesian network classifiers for identifying the slope of the customer lifecycle of long-life customers. (English) Zbl 1056.90019

Summary: Undoubtedly, customer relationship management has gained its importance through the statement that acquiring a new customer is several times more costly than retaining and selling additional products to existing customers. Consequently, marketing practitioners are currently often focusing on retaining customers for as long as possible. However, recent findings in relationship marketing literature have shown that large differences exist within the group of long-life customers in terms of spending and spending evolution. Therefore, this paper focuses on introducing a measure of a customer’s future spending evolution that might improve relationship marketing decision making. In this study, from a marketing point of view, we focus on predicting whether a newly acquired customer will increase or decrease his/her future spending from initial purchase information. This is essentially a classification task. The main contribution of this study lies in comparing and evaluating several Bayesian network classifiers with statistical and other artificial intelligence techniques for the purpose of classifying customers in the binary classification problem at hand. Certain Bayesian network classifiers have been recently proposed in the artificial intelligence literature as probabilistic white-box classifiers which allow to give a clear insight into the relationships between the variables of the domain under study. We discuss and evaluate several types of Bayesian network classifiers and their corresponding structure learning algorithms. We contribute to the literature by providing experimental evidence that: (1) Bayesian network classifiers offer an interesting and viable alternative for our customer lifecycle slope estimation problem; (2) the Markov Blanket concept allows for a natural form of attribute selection that was very effective for the application at hand; (3) the sign of the slope can be predicted with a powerful and parsimonious general, unrestricted Bayesian network classifier; (4) a set of three variables measuring the volume of initial purchases and the degree to which customers originally buy in different categories, are powerful predictors for estimating the sign of the slope, and might therefore provide desirable additional information for relationship marketing decision making.


90B15 Stochastic network models in operations research
91B26 Auctions, bargaining, bidding and selling, and other market models


C4.5; BNT
Full Text: DOI


[1] S. Acid, L.M. Campos, An algorithm for finding minimum d-separating sets in belief networks, in: Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI), Portland, Oregon, USA, 1996, pp. 3-10
[2] Baesens, B.; Viaene, S.; Van den Poel, D.; Vanthienen, J.; Dedene, G., Using Bayesian neural networks for repeat purchase modelling in direct marketing, European journal of operational research, 138, 1, 191-211, (2002) · Zbl 1008.90515
[3] Blattberg, R.C.; Deighton, J., Manage marketing by the customer equity test, Harvard business review, July-Aug, 136-144, (1996)
[4] Brodie, R.J.; Coviello, N.E.; Brookes, R.W.; Little, V., Towards a paradigm shift in marketing? an examination of current marketing practices, Journal of marketing management, 13, 383-406, (1997)
[5] Buntine, W., A guide to the literature on learning probabilistic networks from data, IEEE transactions on knowledge and data engineering, 8, 195-210, (1996)
[6] J. Cheng, Powerpredictor system, 2000. Available from <http://www.cs.ualberta.ca/jcheng/bnpp.htm>
[7] J. Cheng, D.A. Bell, W. Liu, An algorithm for Bayesian belief network construction from data, in: Proceedings of the Sixth International Workshop on Artificial Intelligence and Statistics (AI and STAT), Fort Lauderdale, Florida, USA, 1997, pp. 83-90
[8] J. Cheng, D.A. Bell, W. Liu, Learning belief networks from data: An information theory based approach, in: Proceedings of the Sixth ACM Conference on Information and Knowledge Management (CIKM), Las Vegas, Nevada, USA, 1997, pp. 325-331
[9] J. Cheng, R. Greiner, Comparing Bayesian network classifiers, in: Proceedings of the Fifteenth Conference on Uncertainty in Artificial Intelligence (UAI), Stockholm, Sweden, 1999, pp. 101-108
[10] J. Cheng, R. Greiner, Learning Bayesian belief network classifiers: Algorithms and system, in: Proceedings of the Fourteenth Canadian Conference on Artificial Intelligence (AI), 2001 · Zbl 0984.68191
[11] Chow, C.K.; Liu, C.N., Approximating discrete probability distributions with dependence trees, IEEE transactions on information theory, 14, 3, 462-467, (1968) · Zbl 0165.22305
[12] G.J. Cullinan, Picking them by their batting averages recency-frequency-monetary method of controlling circulation, Manual release 2103, Direct Mail/Marketing Association, NY, 1977
[13] De Long, E.R.; De Long, D.M.; Clarke-Pearson, D.L., Comparing the areas under two or more correlated receiver operating characteristic curves: A nonparametric approach, Biometrics, 44, 837-845, (1988) · Zbl 0715.62207
[14] Dietterich, T.G., Approximate statistical tests for comparing supervised classification learning algorithms, Neural computation, 10, 7, 1895-1924, (1998)
[15] Duda, R.O.; Hart, P.E., Pattern classification and scene analysis, (1973), John Wiley New York · Zbl 0277.68056
[16] Egan, J.P., Signal detection theory and ROC analysis. series in cognition and perception, (1975), Academic Press New York
[17] Everitt, B.S., The analysis of contingency tables, (1977), Chapman and Hall London · Zbl 0777.62060
[18] T. Fawcett, Using rule sets to maximize roc performance, in: Proceedings of the IEEE International Conference on Data Mining, San Jose, California, USA, 2001
[19] Fayyad, U.M.; Irani, K.B., Multi-interval discretization of continuous-valued attributes for classification learning, (), 1022-1029
[20] Fornell, C.; Wernerfelt, B., Defensive marketing strategy by customer complaint management: A theoretical analysis, Journal of marketing research, 24, 337-346, (1987)
[21] Friedman, N.; Geiger, D.; Goldszmidt, M., Bayesian network classifiers, Machine learning, 29, 131-163, (1997) · Zbl 0892.68077
[22] Ganesh, J.; Arnold, M.J.; Reynolds, K.E., Understanding the customer base of service providers: an examination of the differences between switchers and stayers, Journal of marketing, 64, 65-87, (2000)
[23] Geiger, D.; Heckerman, D., Knowledge representation and inference in similarity networks and Bayesian multinets, Artificial intelligence, 82, 45-74, (1996)
[24] Geiger, D.; Verma, T.S.; Pearl, J., Identifying independence in Bayesian networks, Networks, 20, 5, 507-534, (1990) · Zbl 0724.05066
[25] Grönroos, C., From marketing mix to relationship marketing–towards a paradigm shift in marketing, Management decision, 35, 4, 322-339, (1997)
[26] Hanley, J.A.; McNeil, B.J., The meaning and use of the area under a receiver operating characteristic (ROC) curve, Radiology, 148, 839-843, (1983)
[27] Hanley, J.A.; McNeil, B.J., A method of comparing the areas under receiver operating characteristic curves derived from the same cases, Radiology, 148, 839-843, (1983)
[28] Heckerman, D., Probabilistic similarity networks, (1991), MIT Press Cambridge, MA
[29] D. Heckerman, A tutorial on learning with Bayesian networks, Technical Report MSR-TR-95-06, Microsoft Research, 1995 · Zbl 0921.62029
[30] John, G.H.; Langley, P., Estimating continuous distributions in Bayesian classifiers, (), 338-345
[31] Knox, S., Loyalty-based segmentation and the customer development process, European management journal, 16, 6, 729-737, (1998)
[32] J.B. Kruskal Jr., On the shortest spanning subtree of a graph and the travelling salesman problem, in: Proceedings of the American Mathematics Society, vol. 7, 1956, pp. 48-50
[33] Langley, P.; Iba, W.; Thompson, K., An analysis of Bayesian classifiers, (), 223-228
[34] Lauritzen, S.L., Graphical models, (1996), Clarendon Press Oxford · Zbl 0907.62001
[35] K. Murphy, Bayes net matlab toolbox, 2001. Available from <http://www.cs.berkeley.edu/ murphyk/Bayes/bnt.html>
[36] Pearl, J., Probabilistic reasoning in intelligent systems: networks for plausible inference, (1988), Morgan Kaufmann San Francisco, CA
[37] Peppers, D.; Rogers, M., Enterprise one to one: tools for competing in the interactive age, (1997), Doubleday New York, USA
[38] Provost, F.; Fawcett, T.; Kohavi, R., The case against accuracy estimation for comparing classifiers, (), 445-453
[39] Quinlan, J.R., C4.5 programs for machine learning, (1993), Morgan Kaufmann San Francisco, CA
[40] Reichheld, F.F., The loyalty effect, (1996), Harvard Business School Press Cambridge, MA
[41] Reichheld, F.F., Lead for loyalty, Harvard business review, July, 76-84, (2001)
[42] Reichheld, F.F.; Kenny, D.W., The hidden advantages of customer retention, Journal of retail banking, 4, 19-23, (1990)
[43] Reichheld, F.F.; Sasser, W.E., Zero defections: quality comes to services, Harvard business review, Sept-Okt, 105-111, (1990)
[44] Reinartz, W.J.; Kumar, V., On the profitability of long-life customers in a noncontractual setting: an empirical investigation and implications for marketing, Journal of marketing, 64, 17-35, (2000)
[45] Reinartz, W.J.; Kumar, V., The mismanagement of customer loyalty, Harvard business review, July, 4-12, (2002)
[46] Rosenberg, L.J.; Czepiel, J.A., A marketing approach to customer retention, Journal of consumer marketing, 1, 45-51, (1984)
[47] Rust, R.T.; Zeithaml, V.A.; Lemon, K.N., Driving customer equity: how customer lifetime value is reshaping corporate strategy, (2000), Free Press New York
[48] Swets, J.A.; Pickett, R.M., Evaluation of diagnostic systems: methods from signal detection theory, (1982), Academic Press New York
[49] Thrasher, R.P., Cart: A recent advance in tree-structured List segmentation methodology, Journal of direct marketing, 5, 1, 35-47, (1991)
[50] D. Van den Poel, Response Modeling for Database Marketing using Binary Classification, Ph.D. Thesis, K.U. Leuven, 1999
[51] Van Der Gaag, L.C., Bayesian belief networks: odds and ends, The computer journal, 39, 2, 97-113, (1996) · Zbl 0852.68084
[52] T. Verma, J. Pearl, Causal networks: Semantics and expressiveness, in: Proceedings of the Fourth Workshop on Uncertainty in Artificial Intelligence, Mountain View, CA, USA, 1988, pp. 352-359
[53] Waller, W.G.; Jain, A.K., On the monotonicity of the performance of a Bayesian classifier, IEEE transactions on information theory, 24, 3, 392-394, (1978) · Zbl 0383.62035
This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.