基于NMF与CNN联合优化的声学场景分类
|
|
韦娟, 杨皇卫, 宁方立
|
Acoustic scene classification based on joint optimization of NMF and CNN
|
|
Juan WEI, Huangwei YANG, Fangli NING
|
|
表4 不同特征的识别准确率对比
|
Table 4 Comparison of recognition accuracy of different features
|
|
| 场景 | 基线系统 | NMF | TNMF | SNMF | CQT | LM | | 沙滩 | 0.753 | 0.751 | 0.747 | 0.835 | 0.895 | 0.887 | | 公交 | 0.718 | 0.893 | 0.813 | 0.928 | 0.930 | 0.922 | | 饭馆 | 0.577 | 0.618 | 0.544 | 0.793 | 0.611 | 0.628 | | 汽车 | 0.971 | 0.962 | 0.945 | 0.942 | 0.978 | 0.941 | | 市中心 | 0.907 | 0.943 | 0.867 | 0.893 | 0.778 | 0.920 | | 林荫道 | 0.795 | 0.769 | 0.892 | 0.925 | 0.881 | 0.855 | | 杂货店 | 0.587 | 0.801 | 0.828 | 0.920 | 0.883 | 0.929 | | 家 | 0.686 | 0.702 | 0.662 | 0.792 | 0.820 | 0.663 | | 图书馆 | 0.571 | 0.725 | 0.691 | 0.658 | 0.783 | 0.685 | | 地铁站 | 0.917 | 0.742 | 0.826 | 0.815 | 0.852 | 0.747 | | 办公室 | 0.998 | 0.965 | 0.950 | 0.941 | 0.875 | 0.942 | | 公园 | 0.702 | 0.695 | 0.712 | 0.705 | 0.545 | 0.723 | | 居民区 | 0.641 | 0.874 | 0.774 | 0.738 | 0.691 | 0.764 | | 火车 | 0.580 | 0.657 | 0.768 | 0.802 | 0.685 | 0.712 | | 电车 | 0.817 | 0.852 | 0.847 | 0.851 | 0.864 | 0.876 | | 总体 | 0.748 | 0.797 | 0.791 | 0.836 | 0.805 | 0.813 | | 预测时间/s | - | 2.6 | 1.1 | 2.7 | 3.1 | 3.3 |
|
|
|