一种基于音素模型感知度的发音质量评价方法

A pronunciation quality evaluation method based on the phoneme model perception

摘要: 为了提高发音质量判别精度,提出了一种基于音素模型感知度的发音质量评价方法。它采用不同语音样本集合下样本声学特征的对数后验概率期望差作为音素模型对变异发音的感知度,并以此为基础,生成各音素对应的识别模型候选集。实验表明,所提出的方法使语音识别网络候选音素模型集合尺寸减少约95%;在非母语语音数据库上,该方法评分与人工专家打分相关性为0.828,基于该方法得到的声韵母错误检出率为70.8%,声调错误检出率为42.5%,均优于其它方法。

Abstract: In order to improve the accuracy rating,a pronunciation evaluation method based on the phoneme model perception was proposed.The phoneme model perception of variation pronunciation was defined by the expectation of the difference between the log-posterior probability which is computed by the different corpus sets,and it was the basis of generating the corresponding speech recognition candidate model set.The results of experiments shows that the proposed method reduced the size of speech recognition model set about 95%;In the non-native speech database,the proposed method could achieve a correlation of 0.828 with the expert scoring,with the errors detection rate of 70.8% for Chinese initials and vowels of 42.5%for tone,which were both better than other methods.