Journal of Systems Engineering and Electronics ›› 2009, Vol. 31 ›› Issue (01): 195-199.

Previous Articles     Next Articles

Efficient discretization algorithm for continuous attributes

ZHAO Jing-xian1,2, NI Chun-peng1, ZHAN Yuan-rui1, DU Zi-ping2   

  1. 1. School of Management, Tianjin Univ., Tianjin 300072, China;
    2. School of Economics and Management, Tianjin Univ. of Science & Technology, Tianjin 300222, China
  • Received:2007-10-14 Revised:2008-05-21 Online:2009-01-25 Published:2010-01-03

Abstract: On analysis of the cut points characteristic of entropy-based discretization,an attribute discretization algorithm based on boundary points’ attribute values mergence and inconsistency check is presented.Compared with the traditional discretization algorithms,the proposed method only merges the boundary points’ attribute values,auto-generates cut points’ number without setting them in advance,applies simple rules to merge the intervals,and reduces the computational cost greatly.It is suitable for large scale and high dimension database discretization problems.By applying inconsistency to check the chosen cut points set,the algorithm possesses global property.Experiments show that the method can improve the simplicity and the prediction precision of classifying rules.

CLC Number: 

[an error occurred while processing this directive]