Journal of Systems Engineering and Electronics ›› 2009, Vol. 31 ›› Issue (10): 2497-2500.

• 软件、算法与仿真 • 上一篇    下一篇

建立频繁项目集向量的极大频繁项目集挖掘

周海岩   

  1. 淮阴工学院计算机工程系, 江苏, 淮安, 223003
  • 收稿日期:2008-05-26 修回日期:2008-10-29 出版日期:2009-10-20 发布日期:2010-01-03
  • 作者简介:周海岩(1957- ),男,教授,主要研究方向为数据挖掘、智能决策、算法设计与分析、数据库理论.E-mail:zhy_5703@163.com
  • 基金资助:
    江苏省科技攻关项目(BE2006357)资助课题

Maximal frequent item set mining with establishment of frequent item set vectors

ZHOU Hai-yan   

  1. Dept. of Computer Engineering, Huaiyin Inst. of Technology, Huai'an 223003, China
  • Received:2008-05-26 Revised:2008-10-29 Online:2009-10-20 Published:2010-01-03

摘要: 在分析和研究诸多经典关联规则挖掘算法或最大频繁项目集挖掘算法的基础上,提出了一种新的极大频繁项目集挖掘算法BOFPV_MMFIA算法.该算法引入频繁项目集向量FP-V,将极大频繁项目集的挖掘过程转化为频繁项目集向量FP-V的与运算过程.算法只需扫描数据库一次,克服了Apriori及其相关算法产生大量候选集和需多次扫描数据库的缺点.又不同于BOM算法,挖掘频繁k_项目集时,需要进行 次k个向量的与运算.因此,BOFPV_MMFIA算法的效率明显高于Apriori、DMFIA及BOM算法.

Abstract: After analyzing many typical association rule mining algorithms and the maximal frequent item set mining algorithm,a new algorithm of maximal frequent item set mining,named as BOFPV_MMFIA,is proposed.A frequent item set vector FP-V is introduced so as to convert the course of maximal frequent item set mining into the course of AND operation of the FP_V vector.The existing Apriori and its related algorithms produce a lot of candidacy sets and need scanning database many times,and the BOM algorithm entails the AND operation of k vectors with times.Overcoming these drawbacks,the proposed BOFPV_MFIA algorithm needs scanning database only once.Therefore,the proposed algorithm is obviously superior to Apriori and BOM algorithm in efficiency.

中图分类号: