EI / SCOPUS / CSCD 收录

中文核心期刊

HUANG Hao, WANG Jianming, Abudureyimu Halidan, Silamu Wushour. Maximum F1-score acoustic model training for automatic mispronunciation detection[J]. ACTA ACUSTICA, 2013, 38(6): 751-758. DOI: 10.15949/j.cnki.0371-0025.2013.06.010
Citation: HUANG Hao, WANG Jianming, Abudureyimu Halidan, Silamu Wushour. Maximum F1-score acoustic model training for automatic mispronunciation detection[J]. ACTA ACUSTICA, 2013, 38(6): 751-758. DOI: 10.15949/j.cnki.0371-0025.2013.06.010

Maximum F1-score acoustic model training for automatic mispronunciation detection

  • To improve the performance of automatic mispronunciation detection in computer-assisted language learning, a discriminative acoustic model training method is proposed. The method aims at maximizing the F1-score of mispronunciation detection results on the annotated non-native speech database. The training objective function is formulated as a smooth form of the F1-score by using the sigmoid function, and is optimized by using the extended Baum-Welch form like updating equations based on the weak-sense auxiliary function method. Simultaneous updating strategy of acoustic models and phone threshold parameters is proposed to ensure monotonicity of the objective function improvement. Mispronunciation detection experiments show that the method is effective in increasing the F1-score,precision, recall and detection accuracy on both the training and evaluation data set.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return