EI / SCOPUS / CSCD 收录

中文核心期刊

LI Xiaoxue, XU Wen, JIN Liling, LI Jianlong. Microphone array processing via joint wideband angle-of-arrival estimation and speech feature enhancement[J]. ACTA ACUSTICA, 2011, 36(4): 444-450. DOI: 10.15949/j.cnki.0371-0025.2011.04.001
Citation: LI Xiaoxue, XU Wen, JIN Liling, LI Jianlong. Microphone array processing via joint wideband angle-of-arrival estimation and speech feature enhancement[J]. ACTA ACUSTICA, 2011, 36(4): 444-450. DOI: 10.15949/j.cnki.0371-0025.2011.04.001

Microphone array processing via joint wideband angle-of-arrival estimation and speech feature enhancement

  • This paper concerns techniques of speech processing using a small-size microphone array to improve automatic speech recognition(ASR)performance in an indoor reverberant environment.The method first applies Estimation of Signal Parameters via Rotational Invariance Techniques(ESPRIT)to compute directions-of-arrivals of wideband speech signals and implements time-delay compensation accordingly;array signal filtering is then considered jointly with the HMM-based speech recognition procedure,whose outputs are fed back to the front end to optimize array filtering design.Different from conventional array processing aiming to enhancing signal waveform,the approach here adjusts array filtering coefficients to reduce mismatch between the features to be recognized and the training model,thus directly maximizing the likelihood of the right transcription for a selected vocabulary.Experimental data processing shows that the above approach can effectively reduce ASR error rate for an isolated word vocabulary of finite size in a meeting-room environment,superior to conventional beamforming processing;compared to a local optimization scheme,applying global optimization in array filtering design further improves the performance.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return