系统工程与电子技术 ›› 2019, Vol. 41 ›› Issue (6): 1210-1217.doi: 10.3969/j.issn.1001-506X.2019.06.06

• 电子技术 • 上一篇    下一篇

基于改进匹配网络的单样本学习

蒋留兵1,3, 周小龙2,3, 姜风伟2,3, 车俐1,3   

  1. 1. 桂林电子科技大学计算机与信息安全学院, 广西 桂林 541004;
    2. 桂林电子科技大学信息与通信学院, 广西 桂林 541004;
    3. 桂林电子科技大学无线宽带通信与信号处理重点实验室, 广西 桂林 541004
  • 出版日期:2019-05-27 发布日期:2019-05-27

Oneshot learning based on improved matching network

JIANG Liubing1,3, ZHOU Xiaolong2,3, JIANG Fengwei2,3, CHE Li1,3   

  1. 1.School of Computer and Information Security, Guilin University of Electronic Technology, Guilin 541004, China;
    2. School of Information and Communication, Guilin University of Electronic Technology, Guilin 541004, China;
    3. Key Laboratory of Wireless Broadband Communication and Signal Processing in
    Guangxi, Guilin University of Electronic Technology, Guilin 541004, China
  • Online:2019-05-27 Published:2019-05-27

摘要: 当前深度学习是基于大量标注数据样本通过多层网络实现模型自动识别。然而,在很多特殊场景下,难以获取大量标注样本数据,小样本物体识别仍是深度学习下关键性的难题。针对这一问题,首先利用4层深度卷积神经网络(deep convolution neural network,DCNN)提取训练样本和测试样本的高层语义特征,然后基于改进的匹配网络分别采用双向LSTM和attLSTM算法对训练样本和测试样本深入提取更加关键和有用特征并进行编码,最后在平方欧氏距离上利用softmax非线性分类器对测试样本进行分类识别。实验通过Omniglot数据集对提出的改进模型进行测试,取得了非常好的效果。改进的模型即使在最复杂的20 way 1 shot情况下,依然能够达到93.2%的识别率,Vinyals的原创匹配网络模型在20 way 1 shot的情况下只能达到88.2%的识别率,与原创匹配网络模型相比,改进的模型在类别数更多而样本数较少的复杂场景下具有更好的识别效果。

关键词: 深度学习, 小样本, 改进匹配网络, 平方欧氏距离, LSTM

Abstract: The current deep learning is based on a large number of labeled data samples to automatically identify the model through a multilayer network. However, in many special scenarios, it is difficult to obtain a large amount of sample data, and the identification of fewshot learning is still a key problem in deep learning. To solve this problem, the fourlayer deep convolutional neural network (DCNN) is first used to extract the highlevel semantic features of the training samples and the test samples. Then use the bidirectional LSTM and attLSTM algorithms for further extraction and code of more critical and useful features of training samples and test samples based on the improved matching network. Finally,  the softmax nonlinear classifier is used to classify the test samples on the squared euclidean distance. The experiment tests on the proposed improved model with the Omniglot data set and achieves very good results. The improved model can achieve a 93.2% recognition rate even in the most complicated 20way 1shot case, and the original matching network model of Vinyals only achieve 88.2% recognition in the case of 20way 1shot. Compared with the original matching network model, the improved model has a better recognition effect in a complex scenario with more categories and fewer samples.

Key words: deep learning, fewshot, improved matching network, squared Euclidean distance, LSTM