International Journal of Intelligent Information Systems
Volume 4, Issue 6, December 2015, Pages: 101-105
Received: Nov. 5, 2015;
Accepted: Nov. 22, 2015;
Published: Dec. 14, 2015
Views 3470 Downloads 79
Hooman Sanatkar, Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran
Saman Haratizadeh, Faculty of New Sciences and Technologies, University of Tehran, Tehran, Iran
Imbalanced datasets are datasets with different samples distribution in which the distribution of samples in one class is scientifically more than other class samples. Learning a classification model for such imbalanced data has been shown to be a tricky task. In this paper we will focus on learning classifier systems, and will suggest a new XCS-based approach for learning classification models from imbalanced data sets. The main idea behind the suggested approach is to update the important parameters of the learning method based on the information gathered in each step of learning, in order to provide a fair situation for the minor class, to contribute in building the final model. We have also evaluated our approach by testing it with real-world known imbalanced datasets. The results show that our new algorithm has a high detection rate and a low false positive rate.
An XCS-Based Algorithm for Classifying Imbalanced Datasets, International Journal of Intelligent Information Systems.
Vol. 4, No. 6,
2015, pp. 101-105.
Copyright © 2015 Authors retain the copyright of this article.
This article is an open access article distributed under the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/
) which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
A. Bria, N. karssemeijer and F. Tortorella, "Learning from unbalanced data: A cascade-based approach for detecting clustered microcalcifications," Medical image analysis, 2014.
Xue, Jing-Hao and P. Hall, "Why Does Rebalancing Class-unbalanced Data Improve AUC for Linear Discriminant Analysis," Pattern Analysis and Machine Intelligence,IEEE Transactions on , pp. 1109-1112, 2015.
M. Butz, "Learning classifier systems," in Springer Handbook of Computational Intelligence, Berlin , Springer , 2015, pp. 961-981.
P. L. Lanzi, "Learning classifier systems: a gentle introduction," in companion on Genetic and evolutionary computation companion, 2014.
L. Bull, "A brief history of learning classifier systems: from CS-1 to XCS and its variants," Evolutionary Intelligence, 2015.
N. CHAWLA, K. Bowyer, L. Hall and W. Kegelmeye, "SMOTE: Synthetic minority over-sampling technique," Journal of Artificial Intelligence Research, vol. 15, pp. 321-357, 2002.
N. Japcowicz and S. Stephen, "The Class Imbalance Problem:A Systematic Study," Intelligent Data Analisis, 2002.
Wang, Shuo, L. Leandro, Minku and Xin Ya, "Resampling-Based Ensemble Methods for Online Class Imbalance Learning," Knowledge and Data Engineering, IEEE Transactions on, pp. 1356-1368, 2015.
A. Orriols-Puig, "Facetwise Analysis of Learning Classifier Systems in Imbalanced Domains," Ramon Liull University, 2006.
A. Orriols-Puig and E. Bernad´o-Mansilla, "Evolutionary rule-based systems for imbalanced datasets," Soft Computing Journal, vol. 13, no. 3, pp. 213-225, 2009.
A. Orriols-Puig and E. Bernad´o-Mansilla, "Bounding XCS parameters for unbalanced datasets," in Genetic and Evolutionary Computation Conference, 2006.
A. Orriols-Puig and E. Bernad´o-Mansilla, "The Class Imbalance Problem in Learning Classiﬁer Systems:A Preliminary Study," in Genetic and Evolutionary Computation Conference, 2005.
K. Deb, A. Pratap , S. Agarwal and T. Meyarivan, "A fast and elitist multiobjective genetic algorithm: NSGA-II.," Evolutionary Computation, vol. 6, no. 2, pp. 182-197, 2002.
M. Erickson, A. Mayer and J. Horn, "Multi-objective optimal design of groundwater remediation systems: application of the niched Pareto genetic algorithm (NPGA)," Advances in Water Resources, vol. 25, no. 1, pp. 51-65, 2002.
D. Corne, N. R. Jerram, J. D. Knowles and M. J. Oates, "Corne, David W., et al. "PESA-II: Region-based selection in evolutionary multiobjective optimization," in Genetic and Evolutionary Computation Conference, 2001.