A non-linear frequency transform for speaker recognition

YU Yibiao; YUAN Dongmei; XUE Feng

doi:10.15949/j.cnki.0371-0025.2008.05.014

YU Yibiao, YUAN Dongmei, XUE Feng. A non-linear frequency transform for speaker recognition[J]. ACTA ACUSTICA, 2008, 33(5): 450-455. DOI: 10.15949/j.cnki.0371-0025.2008.05.014

Citation:

YU Yibiao, YUAN Dongmei, XUE Feng. A non-linear frequency transform for speaker recognition[J]. ACTA ACUSTICA, 2008, 33(5): 450-455. DOI: 10.15949/j.cnki.0371-0025.2008.05.014

Citation:

YU Yibiao, YUAN Dongmei, XUE Feng. A non-linear frequency transform for speaker recognition[J]. ACTA ACUSTICA, 2008, 33(5): 450-455. DOI: 10.15949/j.cnki.0371-0025.2008.05.014

A non-linear frequency transform for speaker recognition

Graphical Abstract

Graphical Abstract

Abstract

Abstract

The classical frequency transform can describe perception characteristics of human auditory system, but can not relatively enhance speaker's individuality in short-time spectrum of speech. A non-linear frequency transform and feature detection algorithm are proposed based on analyzing contribution of short-time spectrum in different frequency sub-bands and using of polynomial curve fitting. The experimental results show that the proposed non-linear frequency transform can improve the performance effectively in comparison with classical non-linear frequency transform such as Mel, Bark and ERB. In the same condition, the average error rate falls about 70.5%, 60.8% and 70.5% respectively. The proposed frequency scale and feature detection algorithm can strengthen the individual personality and improve the recognition performance.

FullText(HTML)

References (0)

Cited By

A non-linear frequency transform for speaker recognition

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content