EI / SCOPUS / CSCD 收录

中文核心期刊

音节混淆字典及在汉语口音自适应中的应用研究

Study on syllable confusion dictionary and putonghua accent adaptation

  • 摘要: 提出了一种基于对识别器识别错误的统计分析建立音节混淆字典,并应用该音节混淆字典作为识别器的先验知识来校正结果的识别方法。这种近似的音节混淆字典可以分与语境无关和相关两种情况讨论,作为一种先验知识,它综合地刻画了说话人的发音特征、识别器的识别特征以及二者之间的相对关系。实验中,我们应用语境无关/相关的音节混淆字典于识别带上海口音的普通话识别器,使其音节误识率分别下降15%及20%。在此基础上,本论文结合常规的声学模型参数重估自适应MAP (最大后验概率)算法进一步研究了这种方法应用于语音识别系统的实际效果。结果证明,二者互为补充,更大地提高了识别器的精度。

     

    Abstract: This paper studies a novel approach to emend the result of the recognizer by applying the prior information on syllable confusion, named as Syllable Confusion Dictionary(SCD), which is estimated by the error of the recognizer. The approximated SCD is divided into two categories as context-dependent (CD) and context-independent (CI). As prior information, it characterizes the quality of pronunciation, recognition and their relation synthetically. In our experiments, we apply such a CD/CI SCD to a recognizer, which is used to recognize the putonghua with Shanghai accent, and get a relative improvement of 15%/20%. What is more, our study show that, combined with the MAP(Maximum a Posteriori) algorithm, this method can get better efficacy.

     

/

返回文章
返回