Home > Archive > 2020 > Volume 10 Number 3 (May 2020) >
IJMLC 2020 Vol.10(3): 444-451 ISSN: 2010-3700
DOI: 10.18178/ijmlc.2020.10.3.955

Decision Tree Algorithm with Class Overlapping-Balancing Entropy for Class Imbalanced Problem

Artit Sagoolmuang and Krung Sinapiromsaran

Abstract—The problem of handling a class imbalanced problem by modifying decision tree algorithm has received widespread attention in recent years. A new splitting measure, called class overlapping-balancing entropy (OBE), is introduced in this paper that essentially pay attentions to all classes equally. Each step, the proportion of each class is balanced via the assigned weighted values. They not only depend on equalizing each class, but they also take into account the overlapping region between classes. The proportion of weighted values corresponding to each class is used as the component of Shannon's entropy for splitting the current dataset. From the experimental results, OBE significantly outperforms the conventional splitting measures like Gini index, gain ratio and DCSM, which are used in the well-known decision tree algorithms. It also exhibits superior performance compared to AE and ME that are designed for handling the class imbalanced problem specifically.

Index Terms—Classification problem, class imbalanced learning, class overlapping-balancing entropy, decision tree algorithm.

The authors are with the Department of Mathematics and Computer Science, Faculty of Science, Chulalongkorn University, Bangkok 10330, Thailand (e-mail: a.sagoolmuang@gmail.com, krung.s@chula.ac.th).


Cite: Artit Sagoolmuang and Krung Sinapiromsaran, "Decision Tree Algorithm with Class Overlapping-Balancing Entropy for Class Imbalanced Problem," International Journal of Machine Learning and Computing vol. 10, no. 3, pp. 444-451, 2020.

Copyright © 2020 by the authors. This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

General Information

  • ISSN: 2010-3700 (Online)
  • Abbreviated Title: Int. J. Mach. Learn. Comput.
  • Frequency: Bimonthly
  • DOI: 10.18178/IJMLC
  • Editor-in-Chief: Dr. Lin Huang
  • Executive Editor:  Ms. Cherry L. Chen
  • Abstracing/Indexing: Inspec (IET), Google Scholar, Crossref, ProQuest, Electronic Journals LibraryCNKI.
  • E-mail: ijmlc@ejournal.net

Article Metrics in Dimensions