EI / SCOPUS / CSCD 收录

中文核心期刊

融合引导概率的语音识别解码算法研究

杨占磊, 刘文举, 晁浩

杨占磊, 刘文举, 晁浩. 融合引导概率的语音识别解码算法研究[J]. 声学学报, 2012, 37(2): 209-217. DOI: 10.15949/j.cnki.0371-0025.2012.02.010
引用本文: 杨占磊, 刘文举, 晁浩. 融合引导概率的语音识别解码算法研究[J]. 声学学报, 2012, 37(2): 209-217. DOI: 10.15949/j.cnki.0371-0025.2012.02.010
YANG Zhanlei, LIU Wenju, CHAO Hao. Integrating induced probability into decoding for large vocabulary continuous speech recognition[J]. ACTA ACUSTICA, 2012, 37(2): 209-217. DOI: 10.15949/j.cnki.0371-0025.2012.02.010
Citation: YANG Zhanlei, LIU Wenju, CHAO Hao. Integrating induced probability into decoding for large vocabulary continuous speech recognition[J]. ACTA ACUSTICA, 2012, 37(2): 209-217. DOI: 10.15949/j.cnki.0371-0025.2012.02.010

融合引导概率的语音识别解码算法研究

基金项目: 

国家重点基础研究发展计划(973计划)(2004CB318105)、国家高技术研究发展计划(863计划)(20060101Z4073,2006AA01Z194)和国家自然科学基金(90820011,60675026,90820303)资助项目。

详细信息
  • PACS: 
      43.72

Integrating induced probability into decoding for large vocabulary continuous speech recognition

  • 摘要: 语音帧在声学特征空间中的位置信息可以辅助解码器对潜在路径进行筛选。传统的语音识别系统缺乏利用这种位置信息。针对这种不足,本文提出一种引导概率模型,用于描述语音帧属于声学特征空间不同局部的概率,并将其用于识别。使用引导概率后,解码器更强调对声学特征空间中最有希望的局部进行搜索,保留并扩展通过此局部空间的路径,同时弱化不经过此局部空间的路径。实验结果显示,融合引导概率的解码算法在不显著增加解码复杂度的情形下,使汉字相对错误率下降10.95%。结果分析表明,融合了语音帧声学位置信息的解码方法能够更有效地鉴别潜在路径,从而降低误识率。
    Abstract: This paper integrates location information of frames into conventional acoustic model (AM) and language model (LM) likelihoods, in order to distinguish potential path candidates more precisely at decoding stage. This paper proposes an induced probability, which represents location information of frames within the whole acoustic space. By integrating the induced probability, the decoder is directed to search within the most promising regions of acoustic space. Promising paths are enhanced and unlikely paths are weakened. Experiments conducted on Chinese Putonghua show that the character error rate is reduced by 10.95% relatively without increasing decoding complexity significantly. Finally, pruning analysis shows that integrating location information of frames into traditional decoding framework is helpful for improving system performance.
计量
  • 文章访问数:  38
  • HTML全文浏览量:  10
  • PDF下载量:  5
  • 被引次数: 0
出版历程
  • 收稿日期:  2011-03-22
  • 修回日期:  2011-06-08
  • 网络出版日期:  2022-06-22

目录

    /

    返回文章
    返回