基于谐波和能量特征的单声道浊语音分离方法
Monaural voiced speech separation based on harmonic and energy features
-
摘要: 提出了一种基于谐波和能量特征的单声道浊语音分离方法。该方法将浊语音分离问题转化为声音在时频域的分类问题。首先,在已有谐波特征的基础上,引入能量特征。然后,对于谐波特征明显且能量大的时频单元,在分类器训练阶段复制它们的特征。实验结果表明该方法相比之前的方法有更好的信噪比增益。通过引入能量特征和特征复制,改善了浊语音的分离效果。Abstract: A monaural voiced speech separation approach based on harmonic and energy features is proposed. The method casts voiced speech separation as sound classification problem in time-frequency domain. First, the energy feature is employed to assist harmonic features. Then, the harmonic and energy features of time-frequency units with obvious harmonicity and large energy are replicated in the process of classifier training. Experimental results show that the proposed method obtains better signal-to-noise ratio improvement compared with previous approaches. The voiced speech separation is improved by introducing energy feature and feature replication.