EI / SCOPUS / CSCD 收录

中文核心期刊

WANG Min, ZHAO Heming. Whispered speaker identification based on multiband demodulation analysis and instantaneous frequency estimation[J]. ACTA ACUSTICA, 2010, 35(4): 471-476. DOI: 10.15949/j.cnki.0371-0025.2010.04.014
Citation: WANG Min, ZHAO Heming. Whispered speaker identification based on multiband demodulation analysis and instantaneous frequency estimation[J]. ACTA ACUSTICA, 2010, 35(4): 471-476. DOI: 10.15949/j.cnki.0371-0025.2010.04.014

Whispered speaker identification based on multiband demodulation analysis and instantaneous frequency estimation

  • In order to improve the robust performance of whispered speaker indentification,a kind of whispered speech parameter called instantaneous frequency estimation (IFE) is proposed based on the AM-FM representation of speech signal.According to the formant modulation theory of speech production,the instantaneous envelope and frequency of speech are extracted by multiband demodulation analysis (MDA).IFE is then obtained by the weighted estimation both on envelope amplitude and frequency to represent the accurate frequency structure of speech.The proposed speech parameters have been applied for whispered speaker indentification and compared with conventional MFCC.The experiment results show that,as the test objectives increase,the IFE parameters perform as well as MFCC,even a little better.When the test channels are changed,comparing with MFCC,IFE effectively improves the robust performance of system.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return