EI / SCOPUS / CSCD 收录

中文核心期刊

YU Yibiao, YUAN Dongmei, XUE Feng. A non-linear frequency transform for speaker recognition[J]. ACTA ACUSTICA, 2008, 33(5): 450-455. DOI: 10.15949/j.cnki.0371-0025.2008.05.014
Citation: YU Yibiao, YUAN Dongmei, XUE Feng. A non-linear frequency transform for speaker recognition[J]. ACTA ACUSTICA, 2008, 33(5): 450-455. DOI: 10.15949/j.cnki.0371-0025.2008.05.014

A non-linear frequency transform for speaker recognition

  • The classical frequency transform can describe perception characteristics of human auditory system, but can not relatively enhance speaker's individuality in short-time spectrum of speech. A non-linear frequency transform and feature detection algorithm are proposed based on analyzing contribution of short-time spectrum in different frequency sub-bands and using of polynomial curve fitting. The experimental results show that the proposed non-linear frequency transform can improve the performance effectively in comparison with classical non-linear frequency transform such as Mel, Bark and ERB. In the same condition, the average error rate falls about 70.5%, 60.8% and 70.5% respectively. The proposed frequency scale and feature detection algorithm can strengthen the individual personality and improve the recognition performance.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return