EI / SCOPUS / CSCD 收录

中文核心期刊

嗓音多频带非线性分析的声带病变识别

Vocal cords diseases detection by multi-band nonlinear analysis of voice

  • 摘要: 提出了一种嗓音多频带非线性分析的声带病变识别方法,以提高声带病变嗓音的识别率。首先采用Gammatone听觉滤波器组对嗓音信号进行滤波,求取每个频带下的最大李雅普诺夫指数;对映射到核空间的数据采用高斯最大似然度准则优化核函数,然后采用优化核主成分分析算法实现特征抽取。识别实验表明,多频带最大李雅普诺夫指数的识别率比传统的MFCC和最大李雅普诺夫指数分别有6.52%和8.45%的提高,且采用优化核主成分分析算法比传统核主成分分析算法有更好的抽取效果.将多频带非线性分析和优化核主成分分析算法结合,识别率提升至97.82%。

     

    Abstract: In order to improve the recognition rate of pathological voices caused by disease of vocal cords, multi-band nonlinear analysis is proposed. Gammatone filter bank is applied to voice signal for front-end time-domain filtering, and then calculate the largest Lyapunov exponent of every band. Data is first mapped into kernel space and use Gaussian maximum likelihood rule to get the best parameter for kernel, which is used for kernel principal component analysis to extract feature. The proposed feature achieves higher recognition rate of 6.25% and 8.45% than MFCC and the largest Lyapunov exponent respectively. When the proposed kernel function is used for kernel principal component analysis, it achieves better performance than traditional function. Ultimately, we get recognition rate of 97.82% by combing them.

     

/

返回文章
返回