EI / SCOPUS / CSCD 收录

中文核心期刊

WANG Yingxue, ZHAO Shenghui, KUANG Jingming. Speech bandwidth extension supported by temporal information[J]. ACTA ACUSTICA, 2017, 42(3): 370-376. DOI: 10.15949/j.cnki.0371-0025.2017.03.015
Citation: WANG Yingxue, ZHAO Shenghui, KUANG Jingming. Speech bandwidth extension supported by temporal information[J]. ACTA ACUSTICA, 2017, 42(3): 370-376. DOI: 10.15949/j.cnki.0371-0025.2017.03.015

Speech bandwidth extension supported by temporal information

  • Speech Bandwidth Extension (BWE) aims to improve the quality of speech by reconstructing the missing High Frequency (HF) components using the correlation that exists between the Low Frequency (LF) and HF of speech. The Gaussian Mixture Model (GMM) based methods are widely used. However, the derived mapping function by GMM is a piece-wise linear transformation and ignores the temporal information of speech. Thus, a novel BWE method is proposed for estimation of the HF parts of speech by exploiting Conditional Restricted Boltzmann Machines (CRBM). The proposed method introduces CRBM to obtain time information and model deep non-linear relationships between the spectral envelope features of LF and HF by building high-order eigen spaces between the LF and HF of the speech signal. The objective and subjective test results show that the proposed method outperforms the conventional GMM based method.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return