基于多尺度时频特征感知建模的工业机器异常声检测

张瑞; 周浩; 赵宇; 柳银; 周翊

doi:10.12395/0371-0025.2025241

基于多尺度时频特征感知建模的工业机器异常声检测

Anomalous sound detection for industrial machines using multi-scale time-frequency perceptual modeling

摘要

摘要: 工业机器异常声检测可在非接触、非侵入的条件下实现设备状态的快速判别。然而, 复杂工业噪声环境以及机器不同状态之间的声学相似性会干扰模型对异常声的准确检测。为此, 本文提出一种基于多尺度时频特征感知的异常声检测方法, 以提升模型对不同尺度异常模式的建模能力并兼顾计算效率。该方法采用稠密连接卷积编码器对输入信号进行压缩建模, 并引入多尺度感受野卷积模块, 分别从时间与频率维度提取谱图特征, 增强对异常声分布特性的感知能力。同时, 引入混合数据增强策略以提升模型泛化能力, 并引入加性噪声角度边际损失函数以优化类别间的区分度。在DCASE 2020和2021 Task 2数据集上的实验结果表明, 所提方法在多项指标上均优于主流模型, 验证了其有效性。

Abstract: Anomalous sound detection (ASD) enables rapid, non-invasive machine condition monitoring. However, complex industrial noise environments and the acoustic similarity among different machine operating states can interfere with the accurate detection of anomalous sounds by the model. This paper proposes a multi-scale time-frequency feature modeling approach for ASD, enhancing the perception of anomalous patterns while maintaining computational efficiency. A densely connected convolutional encoder compresses input signals, while a multi-scale receptive field module captures spectrogram features along temporal and frequency dimensions. To further boost generalization and class separability, a hybrid data augmentation strategy and an additive angular margin loss are introduced. Experiments on the DCASE 2020 and 2021 Task 2 datasets show that the proposed method consistently outperforms state-of-the-art models, demonstrating its effectiveness.

HTML全文

参考文献(55)

施引文献

资源附件(0)