Journal of Systems Engineering and Electronics ›› 2013, Vol. 35 ›› Issue (4): 864-869.doi: 10.3969/j.issn.1001-506X.2013.04.31

• 通信与网络 • 上一篇    下一篇

具有概念漂移的P2P网络流量识别研究

刘三民1,2,孙知信1,3,4   

  1. 1. 南京航空航天大学计算机科学与技术学院,江苏 南京 210016;
    2. 安徽工程大学计算机与信息学院,安徽 芜湖241000;
    3. 南京邮电大学宽带无线通信与传感网技术教育部重点实验室,江苏 南京 210003;
    4. 南京大学计算机软件新技术国家重点实验室,江苏 南京 210093
  • 出版日期:2013-04-17 发布日期:2010-01-03

Research of traffic identification in P2P network with concept drift

LIU San-min1,2, SUN Zhi-xin1,3,4   

  1. 1.College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China;  2.College of Computer and Information, Anhui Polytechnic University, Wuhu 241000, China;  3.Key Laboratory of Broadband Wireless Communication and Sensor Network Technology, Ministry of Education, Nanjing University of Posts and Telecommunications, Nanjing 210003, China;  4.State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210093,China
  • Online:2013-04-17 Published:2010-01-03

摘要:

针对P2P网络流量产生过程中存在概念漂移现象,提出具有概念漂移检测功能的多分类器动态集成流量识别方案。该方案包括概念漂移检测和分类器动态集成两大模块,由卡方统计推断原理实现概念漂移检测模块功能,采用基分类器的性能优先淘汰策略进行动态集成解决流量概念漂移发生后的识别问题。在以贝叶斯分类器、支持向量机、决策树作为基分类器,针对不同集成规模、数据块大小进行仿真实验,结果证明方案是可行的,模型的识别准确率达到82%以上。

Abstract:

To deal with concept drift in P2P traffic identification, a new traffic identification scheme is presented. The scheme consists of two components: concept drift detection module and ensemble module. Based on the chi-square statistic principle the concept drift detection module is implemented, and the ensemble module is established dynamically by base-classifier’s performance. For different ensemble scale and data block, simulation experiments built on the three kinds of base-classifier(Bayes classifier, support vector machine, and decision tree) are done, experiment results show that the scheme is feasible and the accuracy of the scheme is more than 82%.