Integrating induced probability into decoding for large vocabulary continuous speech recognition

YANG Zhanlei; LIU Wenju; CHAO Hao

doi:10.15949/j.cnki.0371-0025.2012.02.010

YANG Zhanlei, LIU Wenju, CHAO Hao. Integrating induced probability into decoding for large vocabulary continuous speech recognition[J]. ACTA ACUSTICA, 2012, 37(2): 209-217. DOI: 10.15949/j.cnki.0371-0025.2012.02.010

Citation:

Integrating induced probability into decoding for large vocabulary continuous speech recognition

Graphical Abstract

Graphical Abstract

Abstract

Abstract

This paper integrates location information of frames into conventional acoustic model (AM) and language model (LM) likelihoods, in order to distinguish potential path candidates more precisely at decoding stage. This paper proposes an induced probability, which represents location information of frames within the whole acoustic space. By integrating the induced probability, the decoder is directed to search within the most promising regions of acoustic space. Promising paths are enhanced and unlikely paths are weakened. Experiments conducted on Chinese Putonghua show that the character error rate is reduced by 10.95% relatively without increasing decoding complexity significantly. Finally, pruning analysis shows that integrating location information of frames into traditional decoding framework is helpful for improving system performance.

FullText(HTML)

References (0)

Cited By

Integrating induced probability into decoding for large vocabulary continuous speech recognition

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content