基于特征似然度加权和维数缩减的Robust语音端点检测
Robust endpoint detection based on feature weighted likelihood and dimension reduction
-
摘要: 传统的语音端点检测方法在低信噪比环境下可靠性会急剧下降。本文提出了两种特征处理方法:特征的似然度加权和基于散度的维数缩减,来提高噪声下端点检测的性能。通过加权增加动态特征在似然度计算中的比重,可以提高端点检测的噪声Robustness。缩减散度值较小的特征维,对检测精度只有很小的影响,但可以提高检测效率。似然度加权对维数缩减之后的特征同样有效。在Aurora2数据库上的实验结果显示,在干净数据训练的检测模型下,似然度加权可以显著提高噪声下的端点检测性能。对维数缩减后的特征进行似然度加权,获得了与原始特征似然度加权相当的检测性能。这说明本文提出的方法是有效的。Abstract: The performances of the traditional speech endpoint detection algorithms will decline sharply in lower SNR environments. This paper proposes two new methods: feature weighted likelihood and divergence based dimension reduction to improve detecting performance in noise. The weighted likelihood method can increase the proportion of dynamic feature in likelihood score, consequently improves noise robustness of endpoint detection. Reducing these feature dimensions with smaller divergence will degrade the performance little, but can decrease the computation a lot. Weighted likelihood is also effective for reduced feature. The experimental results on Aurora2 show feature weighted likelihood can remarkably improve the detection performance when the model trained on clean data is used to detect speech endpoint in noise. The performance using weighted likelihood on reduced feature is comparable to that on original feature. This proves the method is robust for noise.