Tonal articulatory feature-based acoustic modeling for Chinese Putonghua speech recognition

ZHANG Qingqing; PAN Jielin; YAN Yonghong

doi:10.15949/j.cnki.0371-0025.2010.02.026

ZHANG Qingqing, PAN Jielin, YAN Yonghong. Tonal articulatory feature-based acoustic modeling for Chinese Putonghua speech recognition[J]. ACTA ACUSTICA, 2010, 35(2): 254-260. DOI: 10.15949/j.cnki.0371-0025.2010.02.026

Citation:

Tonal articulatory feature-based acoustic modeling for Chinese Putonghua speech recognition

Graphical Abstract

Graphical Abstract

Abstract

Abstract

The development of a Chinese Putonghua conversational Large Vocabulary Continues Speech Recognition (LVCSR) system using tonal articulatory tandem features is presented.A set of nine Articulatory Features (AF) that are used for classifying sounds and tones of Chinese Putonghua is given,and the posteriors of these nine AF classifiers are used as features in the Automatic Speech Recognition (ASR).In the experiment on Chinese Putonghua conversational LVCSR,compared with baseline ASR using standard acoustic features,the tonal AF-based ASR has a 6.8%decrease on Character Error Rate (CER).When the AF combinations with standard acoustic features at feature-level and word-level, the CER achieves 10.1%relative reduction.These results prove that the AF is effective to capture the characteristics of the speech pronunciations,and with the complementary information provided by standard acoustic features and AF, the combination system achieves better performances further.

FullText(HTML)

References (0)

Cited By

Tonal articulatory feature-based acoustic modeling for Chinese Putonghua speech recognition

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content