CORE: core-based synthetic minority over-sampling and borderline majority under-sampling technique

Chumphol Bunkhumpornpat; Krung Sinapiromsaran

doi:10.1504/ijdmb.2015.068952

CORE: core-based synthetic minority over-sampling and borderline majority under-sampling technique

Int J Data Min Bioinform. 2015;12(1):44-58. doi: 10.1504/ijdmb.2015.068952.

Authors

Chumphol Bunkhumpornpat, Krung Sinapiromsaran

PMID: 26489141
DOI: 10.1504/ijdmb.2015.068952

Abstract

Class imbalance learning has recently drawn considerable attention among researchers. In this area, a rare class is the class of primary interest from the aim of classification. Unfortunately, traditional machine learning algorithms fail to detect this class because a huge majority class overwhelms a tiny minority class. In this paper, we propose a new technique called CORE to handle the class imbalance problem. The objective of CORE is to strengthen the core of a minority class and weaken the risk of misclassified minority instances nearby the borderline of a majority class. These core and borderline regions are defined by the applicability of a safe level. As a result, a minority class is more crowed and dominant. The experiment shows that CORE can significantly improve the predictive performance of a minority class when its dataset is imbalance.

Publication types

Research Support, Non-U.S. Gov't

MeSH terms

Artificial Intelligence*
Models, Theoretical*