EI / SCOPUS / CSCD 收录

中文核心期刊

采用损失函数和声学特征切分声韵母的方法

Initial/final segmentation using loss function and acoustic features

  • 摘要: 为实现鲁棒的声韵母切分,以满足大词汇量连续语音识别系统的需求,提出一种建立损失函数,并利用浊音的“准”周期性和声母时长进行声韵母切分的方法。首先计算语音的自相关函数,接着建立代价损失函数,对计算结果采用动态规划方法检测浊音,然后根据声母时长分布规律确定声母的检测范围,最后在检测范围内对浊音段起始点前后采用听觉事件检测方法分割出声韵母。实验结果表明,采用动态规划方法相对于阈值方法提高了浊音段的检测性能,在浊音段的基础上对声韵母进行切分能够提高切分的正确率,减少噪声及汉语音变现象的影响,切分性能受声母发音方式影响较小。

     

    Abstract: To realize robust initial/final segmentation and satisfy the requirements of large vocabulary continuous speech recognition,an initial/final segmentation method that establishes the loss function and employs the quasiperiodicity of voiced sounds and the duration of initials and finals is proposed.The method firstly calculates the autocorrelation function of the speech,secondly establishes the loss function and performs voiced detection upon the results using the dynamic programming method,thirdly determines the duration of the initials according to their distributing rules,and finally segments the initials and finals from the two parts abutting on the beginning edge of the voiced region within the detection scope using the auditory event detection method.Experimental results show that the dynamic programming method is more efficient than the threshold method in detecting the voiced regions,the segmentation accuracy is improved as the segmentation is performed upon the voiced regions,the impacts of noises in speech and sound changes of Chinese are reduced,and the segmentation efficiency is seldom affected by the pronunciation styles of the initials.

     

/

返回文章
返回