ACOSampling swMATH ID: 41786 Software Authors: Yu, H.; Ni, J.; Zhao, J. Description: ACOSampling: An ant colony optimization-based undersampling method for classifying imbalanced DNA microarray data. In DNA microarray data, class imbalance problem occurs frequently, causing poor prediction performance for minority classes. Moreover, its other features, such as high-dimension, small sample, high noise etc., intensify this damage. In this study, we propose ACOSampling that is a novel undersampling method based on the idea of ant colony optimization (ACO) to address this problem. The algorithm starts with feature selection technology to eliminate noisy genes in data. Then we randomly and repeatedly divided the original training set into two groups: training set and validation set. In each division, one modified ACO algorithm as a variant of our previous work is conducted to filter less informative majority samples and search the corresponding optimal training sample subset. At last, the statistical results from all local optimal training sample subsets are given in the form of frequence list, where each frequence indicates the importance of the corresponding majority sample. We only extracted those high frequency ones and combined them with all minority samples to construct the final balanced training set. We evaluated the method on four benchmark skewed DNA microarray datasets by support vector machine (SVM) classifier, showing that the proposed method outperforms many other sampling approaches, which indicates its superiority. Homepage: https://dl.acm.org/doi/10.1016/j.neucom.2012.08.018 Source Code: https://github.com/Z1997H/ACOSampling Related Software: SMOTE; ADASYN; MWMOTE; JStatCom; GSVM; ECoL; OpenML; catdata; C4.5; ROSEFW-RF; Imbalanced-learn; SMOTEBoost; Scikit; Twister; Hadoop; KEEL; UCI-ml; MapReduce; PCP; HykGene Cited in: 4 Publications all top 5 Cited by 15 Authors 1 Ahn, Gilseung 1 Barella, Victor H. 1 de Souto, Marcílio C. P. 1 Garcia, Luís P. F. 1 Gu, Jason J. 1 Hur, Sun 1 Koziarski, Michał 1 Liu, Fei 1 Lorena, Ana Carolina 1 Park, You-Jin 1 Ponce de Leon Ferreira de Carvalho, André Carlos 1 Qi, Xiaomei 1 Wang, Ying 1 Wozniak, Michal 1 Zhang, Chengjin Cited in 4 Serials 1 Information Sciences 1 Journal of Classification 1 International Journal of Applied Mathematics and Computer Science 1 Journal of Control Science and Engineering Cited in 4 Fields 3 Statistics (62-XX) 2 Computer science (68-XX) 1 General and overarching topics; collections (00-XX) 1 Systems theory; control (93-XX) Citations by Year