Journal of Systems Engineering and Electronics ›› 2011, Vol. 33 ›› Issue (1): 196-0201.doi: 10.3969/j.issn.1001 506X.2011.01.40

Previous Articles     Next Articles

Study on source of classification in imbalanced datasets based on new ensemble classifier

ZHAI Yun1,2,YANG Bing-ru1,QU Wu1,SUI Hai-feng1   

  1. 1. School of Information Engineering, University of Science and Technology Beijing, Beijing 100083, China; 
    2. College of Computer Science, Liaocheng University, Liaocheng 252059, China
  • Online:2011-01-20 Published:2010-01-03

Abstract:

For the issue of classification in imbalanced datasets, this paper presents a new differentiated sampling rate algorithm (DSRA), on this basis, a SVM-Ripper ensemble classifier (SREC) is proposed. SREC employs an unique classifier selection strategy, a novel classifier integration approach and an original classification decision-making method, so that it receives a higher classification accuracy. At the same time, the source of classification in an imbalanced dataset is studied by use of SREC. The simulation results prove that the source of classification in an imbalanced dataset is the aggregation of imbalance between classes, imbalance within a class, sample size as well as the imbalance degree, and only a comprehensive consideration of these factors can better address the issue of classification in imbalanced dataset. 

 

[an error occurred while processing this directive]