汉语连续语音识别中关键词可信度的贝叶斯估计
Bayesian estimation of keyword confidence in Chinese continuous speech recognition
-
摘要: 在一个基于经典隐马尔可夫模型(Hidden Markov Model,HMM)的汉语全音节、非特定人、连续语音识别系统中,利用声学层分数和基于拼音的统计语言模型分数,对关键词的可信度进行贝叶斯估计。本文提出了最大后验(Maximum APosteriori,MAP)可信测度,给出了计算MAP可信度分数的前向后向算法。并且在关键词捕捉应用中评价了MAP可信测度的性能,实验表明MAP可信度分数对关键词候选具有很强的鉴别能力。此外,MAP可信测度可以广泛地应用于各种语音识别应用中。Abstract: In a syllable-based speaker-independent Chinese continuous speech recognition system based on classical Hidden Markov Model (HMM), a Bayesian approach of keyword confidence estimation is studied, which utilizes both acoustic layer scores and syllable-based statistical language model (LM) scores. The Maximum A Posteriori (MAP) confidence measure is proposed, and the forward-backward algorithm calculating the MAP confidence scores is deduced. The performance of the MAP confidence measure is evaluated in keyword spotting application and the experiment results show that the MAP confidence scores provide high discriminability for keyword candidates. Furthermore, the MAP confidence measure can be applied to various speech recognition applications.