Systems Engineering and Electronics ›› 2021, Vol. 43 ›› Issue (12): 3462-3469.doi: 10.12305/j.issn.1001-506X.2021.12.06

• Electronic Technology • Previous Articles     Next Articles

Spectrogram extraction method for acoustic scene data based on neural network

Juan WEI1,*, Zhikai DING1, Fangli NING2   

  1. 1. School of Telecommunication Engineering, Xidian University, Xi'an 710071, China
    2. School of Mechanical Engineering, Northwestern Polytechnical University, Xi'an 710072, China
  • Received:2020-10-06 Online:2021-11-24 Published:2021-11-30
  • Contact: Juan WEI

Abstract:

In complex acoustic scene classification (ASC) tasks, the deep convolution neural network with Mel spectrum as input has good recognition ability. However, the Mel filter bank is designed based on the physiological characteristics of human ears and is not the optimal filter bank for ASC. To solve this problem, spectrogram extraction neural network (SENN) is proposed to replace the traditional Mel-spectrum extraction process, and by training this model, the spectrogram is automatically adapted to the acoustic scene data set. SENN is connected to ResNet50 as the ASC architecture, and the DCASE2019 acoustic scene data set is used for training and testing. The experimental results show that this architecture has higher recognition rate than traditional models and can effectively adjust the frequency curve, amplitude of filters and filter shape.

Key words: acoustic scene classificationcan (ASC), deep convolutional neural network (DCNN), spectrogram extraction neural network (SENN), Mel-spectrum

CLC Number: 

[an error occurred while processing this directive]