Journal of Systems Engineering and Electronics ›› 2009, Vol. 31 ›› Issue (3): 671-676.

• 软件、算法与仿真 • 上一篇    下一篇

一种基于滑动窗口的多关系模式频度更新算法

侯伟1, 杨炳儒1, 吴晨生2, 周谆1   

  1. 1. 北京科技大学信息工程学院, 北京, 100083;
    2. 北京市科学技术情报研究所, 北京, 100037
  • 收稿日期:2008-02-01 修回日期:2008-07-05 出版日期:2009-03-20 发布日期:2010-01-03
  • 作者简介:侯伟(1981- ),男,博士研究生,主要研究方向为数据挖掘.E-mail:dr.houwei@gmail.com
  • 基金资助:
    国家自然科学基金资助课题(60675030)

Multi-relational pattern frequency update algorithm based on sliding window

HOU Wei1, YANG Bing-ru1, WU Chen-sheng2, ZHOU Zhun1   

  1. 1. School of Information Engineering, Univ. of Science and Technology Beijing, Beijing 100083, China;
    2. Beijing Municipal Inst. of Science and Technology Information, Beijing 100037, China
  • Received:2008-02-01 Revised:2008-07-05 Online:2009-03-20 Published:2010-01-03

摘要: 面向多个相关数据流的挖掘算法研究尚处于起步阶段。作为多数据流挖掘算法的基础,模式频度更新算法仍然存在计数不准确、性能较低等问题,难以以此构造有效的挖掘算法。通过引入多关系挖掘概念以及目标关系定义,进而限定计数对象,提出了一种基于滑动窗口的多关系模式频度更新算法MRPFU。该算法监视各数据流窗口的更新情况,采用计数传播策略,减少了时间与空间复杂度。理论分析及实验结果证明了所提算法的有效性且具有较高性能。

Abstract: Presently,the study of mining algorithms for multiple correlated data streams is still at a primitive stage.As the basis of mining multiple data streams,the methods of updating the frequencies of patterns,are bearing problems of count deviation,low performances etc.Consequently,efficient mining algorithms are difficult to be built either.The concepts of multi-relational data mining and target relation are introduced firstly,and the count object is defined accordingly.Then an algorithm based on sliding windows for updating frequencies of multi-relational patterns is proposed,which monitors the updates of streams,adopts the strategy of count propagation,and relieves the complexity of runtime and space.The theoretical analysis and experiments prove its effectiveness and performance.

中图分类号: