系统工程与电子技术 ›› 2021, Vol. 43 ›› Issue (4): 861-867.doi: 10.12305/j.issn.1001-506X.2021.04.01

• 电子技术 •    下一篇

轻量化的增量式集成学习算法设计

丁嘉辉(), 汤建龙*(), 于正洋()   

  1. 西安电子科技大学电子工程学院, 陕西 西安 710071
  • 收稿日期:2020-08-10 出版日期:2021-03-25 发布日期:2021-03-31
  • 通讯作者: 汤建龙 E-mail:dingjiahuiee@qq.com;jltang@xidian.edu.cn;yzyang_2@stu.xidian.edu.cn
  • 作者简介:丁嘉辉(1995-), 男, 硕士研究生, 主要研究方向为机器学习、电子信息对抗与仿真技术。E-mail: dingjiahuiee@qq.com|汤建龙(1978-), 男, 副教授, 硕士研究生导师, 博士, 主要研究方向为电子信息对抗与仿真技术。E-mail: jltang@xidian.edu.cn|于正洋(1995-), 男, 硕士研究生, 主要研究方向为神经网络、电子信息对抗与仿真技术。E-mail: yzyang_2@stu.xidian.edu.cn
  • 基金资助:
    中央高校基本科研业务费专项资金;西安电子科技大学研究生创新基金资助课题

Design of lightweight incremental ensemble learning algorithm

Jiahui DING(), Jianlong TANG*(), Zhengyang YU()   

  1. School of Electronic Engineering, Xidian University, Xi'an 710071, China
  • Received:2020-08-10 Online:2021-03-25 Published:2021-03-31
  • Contact: Jianlong TANG E-mail:dingjiahuiee@qq.com;jltang@xidian.edu.cn;yzyang_2@stu.xidian.edu.cn

摘要:

常规的分类与回归树算法(classification and regression tree, CART)只能通过重新训练来增加对新类别的认知, 导致样本类别数量较多时训练成本大幅增加。针对这一问题, 提出一种轻量化的增量式集成学习算法: 当新的类别进入到训练集中, 只需在原有集成学习算法中添加具有开集识别能力的CART基分类器, 就可以实现对新类别样本的分类, 而不需要重新训练, 从而降低计算复杂度, 简化学习过程。以辐射源分类为背景的仿真实验表明, 该算法在信噪比大于等于-4 dB的环境中, 可以保持90%以上的分类准确率; 在类别数量较多的情况下, 相比常规CART, 该算法可以大幅度降低新增分类类别所需的训练成本。

关键词: 分类与回归树, 计算复杂度, 开集识别, 集成学习, 辐射源分类

Abstract:

Conventional classification and regression tree (CART) can only increase the cognition of new categories by retraining the entire model, causing a great increase in training costs when the number of sample categories is large. To solve this problem, a lightweight incremental ensemble learning algorithm is proposed. When new categories enter the training set, we can classify those new categories by only adding CART base classifiers with the ability of open set recognition into the original ensemble learning algorithm. No retraining is required, so the computational complexity is reduced and the learning process is simplified. In the simulation experiments with the background of emitter classification, the results show that this algorithm can maintain the classification accuracy of more than 90% when the signal noise ratio equal to or larger than -4 dB. In the case of a large number of categories to be classified, this algorithm can significantly reduce the training cost compared with conventional CART.

Key words: classification and regression tree (CART), computational complexity, open set recognition, ensemble learning, emitter classification

中图分类号: