普通话孤立字四声的一种模式识别方法
A pattern recognition approach to four-tone classification of isolated words of standard colloquial Chinese (Putonghua)
-
摘要: 普通话孤立字的声调识别是普通话语音识别中的一项重要任务.本文提出一种新的模式识别算法进行普通话四声调的识别.在大量统计实验基础上,定义了四个参数做为基音频率轨迹的描述.并且,在假设其服从高维正态分布(统计实验表明,这一假设是合理的)的基础上,根据最小错误概率准则推导出参数矢量与每一声调类型的距离公式,实现了统计意义上的最佳识别效果.对于非特定人的四声识别实验表明,这一算法取得了十分满意的结果。Abstract: In this paper, we describe a new pattern recognition approach for the four-tone classification which is important for Putonghua speech recognition. In this method, four measurements on the pitch frequency track are made on the base of a large number of statistical experiments. And under the assumption that the measured parameters satisly multidimensional Gaussian distribution (which is shown to be rational), a distance function for each tone type is derived according to the rule of minimum probability of classification error, so that optimum decision results in statistical meaning are achieved. The proposed method is able to provide very satisfactory results as shown by the experiments of speaker-independent four-tone class ification.