汉语普通话水平测试中儿化音的自动检测与评价
Automatic detection and evaluation of the Erhua in Putonghua proficiency test
-
摘要: 提出一种汉语普通话水平测试中儿化音的自动检测与评价方法。在现有计算机辅助发音评测系统的框架下,深入分析儿化音的发音规律和声学特性,将儿化音的检测与评价转化成典型的分类问题进行处理。经过挑选多个有代表性的声学特征,并尝试多种不同的分类算法,结果表明,集成分类回归树(Boosting CART)强化分类模型,能充分利用儿化音的各种声学特征,分类正确率达到92.41%。通过对声学特征组的进一步分析,发现共振峰、发音置信度、时长是表达儿化音的最重要线索,利用这些线索能有效地实现对儿化音的自动检测与评价。Abstract: An automatic detection and evaluation method of the "Erhua" (also called "r-retrofiexion" or "retroflex suffixation") in Putonghua proficiency test (PSC) is proposed. Based on the framework of the computer aided pronunciation evaluation system, this paper makes an in-depth analysis of phonologic rules and acoustic characteristic of the Erhua, solves the detection and evaluation of the Erhua as a typical classification problem. And then more representative acoustic features are selected and a variety of different classification algorithms are used. The results showed that the boosting classification and regression tree (CART) can make full use of the characteristics of the Erhua, and the classification accuracy is 92.41%. Based on the further analysis of the acoustic feature group, we found the formant, pronunciation confidence and duration are the most important clues of the Erhua, and using these clues can effectively realize the automatic detection and evaluation of the Erhua.