##
**Carpal tunnel syndrome automatic classification: electromyography vs. ultrasound imaging.**
*(English)*
Zbl 1312.92027

Summary: We study automatic classification for the diagnosis of the carpal tunnel syndrome (CTS), a disease frequently observed in occupational medicine. We apply different classification techniques to two real-life medical data sets related to a group of patients reporting the typical symptoms of this syndrome. We are particularly interested in the performance of “box-clustering” (BC), a method that is able to favor readability and interpretation of the results by medical doctors, thanks to its “box-type” output which naturally configures as a medical report. Preliminary results of a basic implementation of BC applied to different data sets already exist in the literature, and here we add more. In particular, in this paper, we apply a recently developed (and specialized) implementation of BC, and we test it for the first time on real-life medical data related to the CTS. Our purpose is to evaluate the performance of BC for automatic diagnosis, as well as, gain in explanation capability and interpretability. This is, in fact, a crucial aspect in medical applications that generally represents a limit for other well-known and powerful classification techniques.

### MSC:

92C55 | Biomedical imaging and signal processing |

94A08 | Image processing (compression, reconstruction, etc.) in information and communication theory |

62H35 | Image analysis in multivariate analysis |

### Keywords:

automatic classification; box-clustering; carpal tunnel syndrome; decision support in medical diagnosis### Software:

WEKA
Full Text:
DOI

### References:

[1] | Alexe, G; Alexe, S; Axelrod, DE; Bonates, TO; Lozina, I; Reiss, M; Hammer, PL, Breast cancer prognosis by combinatorial analysis of gene expression data, Breast Cancer Res, 8, 1-20, (2006) |

[2] | Alexe, S; Blackstone, EH; Hammer, PL; Ishwaran, H; Lauer, MS; Snader, CEP, Coronary risk prediction by logical analysis of data, Ann Oper Res, 119, 15-42, (2003) · Zbl 1026.62119 |

[3] | Alexe G, Hammer PL, Kogan PL (2002) Comprehensive vs. comprehensible classifiers in logical analysis of data, RUTCOR Research Report, RRR 9/2002 · Zbl 1142.62365 |

[4] | Anthony M, Ratsaby J (2012) Using boxes and proximity to classify data into several categories. RUTCOR Research Reports, RRR 7/2012 |

[5] | Arlot, S; Celisse, A, A survey of cross-validation procedures for model selection, Stat Surv, 4, 40-79, (2010) · Zbl 1190.62080 |

[6] | Barakat, N; Diederich, J, Eclectic rule-extraction from support vector machines, Int J Comput Intell, 2, 59-62, (2005) |

[7] | Beekman, R; Visser, LH, Sonography in the diagnosis of carpal tunnel syndrome: a critical review of the literature, Muscle Nerve, 27, 26-33, (2003) |

[8] | Bonates, T; Hammer, PL, Logical analysis of data: from combinatorial optimization to medical applications, Ann Oper Res, 148, 203-225, (2006) · Zbl 1104.92034 |

[9] | Boros, E; Hammer, PL; Ibaraki, T; Kogan, A; Mayoraz, E; Muchnik, I, An implementation of logical analysis of data, IEEE Trans Knowl Data Eng, 12, 292-306, (2000) |

[10] | Boros E, Ibaraki T, Shi L, Yagiura M (2000) Generating all good patterns in polynomial expected time. In: Lecture at the 6th International Symposium on Artificial Intelligence and Mathematics. Ft. Lauderdale, Florida · Zbl 1117.68028 |

[11] | Boros, E; Hammer, PL; Ibaraki, T; Kogan, A, A logical analysis of numerical data, Math Program, 79, 163-190, (1997) · Zbl 0887.90179 |

[12] | Breiman L, Friedman J, Stone CJ, Olshen RA (1984) Classification and regression trees. Chapman & Hall, London |

[13] | Cantatore, FP; Dell’Accio, F; Lapadula, G, Carpal tunnel syndrome: a review, Clin Rheumatol, 16, 596-603, (1997) |

[14] | Carrizosa E, Martin-Barragan B, Romero Morales D (2010) Binarized support vector machines. INFORMS J Comput 22:154-167 · Zbl 1243.62088 |

[15] | Carrizosa, E; Martin-Barragan, B; Romero Morales, D, Detecting relevant variables and interactions in supervised classification, Eur J Oper Res, 213, 260-269, (2011) |

[16] | Crama, Y; Hammer, PL; Ibaraki, T, Cause-effect relationships and partially defined Boolean functions, Ann Oper Res, 16, 299-325, (1988) · Zbl 0709.03533 |

[17] | Du D, Pardalos PM, Wang J (2000) Discrete mathematical problems with medical applications, DIMACS Series, vol 55. American Mathematical Society · Zbl 1028.90039 |

[18] | Eckstein, J; Hammer, PL; Liu, Y; Nediak, M; Simeone, B, The maximum box problem and its application to data analysis, Comput Optim Appl, 23, 285-298, (2002) · Zbl 1028.90039 |

[19] | Ekin O, Hammer PL, Kogan A (1998) Convexity and logical analysis of data. RUTCOR Research Report, RRR 5/1998 · Zbl 0945.68082 |

[20] | Felici G, Spinelli V (2009) Genetic procedure for over-training control in logic mining. Istituto di Analisi dei Sistemi ed Informatica-IASI/CNR. Technical Report 19/2009 |

[21] | Felici G, Simeone B, Spinelli V (2008) Special issue on data mining. In: Sharda R, Voß S (eds) Classification techniques and error control in logic mining., Annals of Information Systems SeriesSpringer, New York |

[22] | Felici, G; Sun, F-S; Truemper, K; Felici, G (ed.); Trintaphyllou, E (ed.), Data mining and knowledge discovery approaches based on rule induction techniques, (2006), New York · Zbl 1117.68028 |

[23] | Felici, G; Truemper, K, A MINSAT approach for learning in logic domains, INFORMS J Comput, 13, 1-17, (2001) |

[24] | Hall, M; Eibe, F; Holmes, G; Pfahringer, B; Reutemann, P; Witten, IH, The WEKA data mining software: an update, SIGKDD Explor, 11, 10-18, (2009) |

[25] | Hammer PL, Kogan A, Simeone B, Szedmák S (2001) Pareto-optimal patterns in logical analysis of data. RUTCOR Research Report, RRR 7/2001 |

[26] | Hammer, PL; Liu, Y; Simeone, B; Szedmák, S, Saturated systems of homogeneous boxes and the logical analysis of numerical data, Discret Appl Math, 144, 103-109, (2004) · Zbl 1078.62503 |

[27] | Huang, J; Ling, CX, Using AUC and accuracy in evaluating learning algorithms, IEEE Trans Knowl Data Eng, 17, 299-310, (2005) |

[28] | Isolani, L; Bonfiglioli, R; Raffi, GB; Violante, FS, Different case definitions to describe the prevalence of occupational carpal tunnel syndrome in meat industry workers, Int Arch Occup Environ Health, 75, 229-234, (2002) |

[29] | Kohavi R (1995) A study of cross-validation and bootstrap for accuracy estimation and model selection. In: Proceedings of the Fourteenth International Joint Conference on artificial intelligence, IJCAI 95. Montrèal, (Quèbec, Canada |

[30] | Lee, D; Holsbeeck, MT; Janevski, PK; Ganos, DL; Ditmars, DM; Darian, VB, Diagnosis of carpal tunnel syndrome: ultrasound versus electromyography, Radiol Clin N Am, 37, 859-872, (1999) |

[31] | Leffler, CT; Gozani, SN; Nguyen, ZQ; Cros, D, An automated electrodiagnostic technique for detection of carpal tunnel syndrome, Neurol Clin Neurophysiol, 3, 2-10, (2000) |

[32] | Mitchell T (1997) Machine learning. McGraw-Hill, New York · Zbl 0913.68167 |

[33] | Mugan J, Truemper K (2008) Discretization of rational data in mathematical methods for knowledge discovery and data mining. In: Felici G, Vercellis C (eds) Chapter I, Hershey: information science reference · Zbl 0887.90179 |

[34] | Pacek, CA; Tang, J; Goitz, RJ; Kaufmann, RA; Li, ZM, Morphological analysis of the carpal tunnel, Hand, 5, 77-81, (2010) |

[35] | Papanicolau, GD; McCabe, SJ; Firrell, J, The prevalence and characteristics of nerve compression symptoms in the general population, J Hand Surg, 12, 712-717, (1987) |

[36] | Pardalos PM, Hansen P (2008) Data mining and mathematical programming. American Mathematical Society, Providence |

[37] | Pardalos PM, Romeijn E (2009) Handbook of optimization in medicine. Springer, New York · Zbl 1157.90004 |

[38] | Sakai, S; Togasaki, M; Yamazaki, K, A note on greedy algorithms for the maximum independent set problem, Discret Appl Math, 126, 313-322, (2003) · Zbl 1013.90108 |

[39] | Simeone B, Boros E, Ricca F, Spinelli V (2011) Incompatibility graphs in data mining, Department of Statistical Sciences, Sapienza, University of Rome, Technical Report 10/2011 (submitted to Journal of Graph Theory) |

[40] | Simeone B, Spinelli V (2007) The optimization problem framework for box clustering approach in logic mining. In: Proceedings of Euro XXII-22nd European Conference on Operational Research, Prague |

[41] | Simeone B, Felici G, Spinelli V (2007) A graph coloring approach for box clustering techniques in logic mining. In: Proceedings of Euro XXII-22nd European Conference on Operational Research, Prague · Zbl 0709.03533 |

[42] | Spinelli V (2009) Logic mining, box-clustering, and graphs. PhD Thesis, University of Rome, Sapienza |

[43] | Spinelli V (2014) Classification and pruning in logic mining. Adv Data Anal Classif (submitted) |

[44] | Spinelli V (2014) Problems and algorithms in Box-Clustering Adv Data Anal Classif (submitted) |

[45] | Wu S, Flach P (2005) A scored AUC metric for classifier evaluation and selection. In: Second Workshop on ROC Analysis in ML, Bonn, Germany |

[46] | Zweig, MH; Campbell, G, Receiver-operating characteristic (ROC) plots: a fundamental evaluation tool in clinical medicine, Clin Chem, 39, 561-577, (1993) |

This reference list is based on information provided by the publisher or from digital mathematics libraries. Its items are heuristically matched to zbMATH identifiers and may contain data conversion errors. It attempts to reflect the references listed in the original paper as accurately as possible without claiming the completeness or perfect precision of the matching.