仿选择性注意机制的语音情感识别算法

梁瑞宇; 赵力; 陶华伟; 王青云; 邹采荣

doi:10.15949/j.cnki.0371-0025.2016.04.012

仿选择性注意机制的语音情感识别算法

Speech emotion recognition algorithm based on the selective attention mechanism

摘要

摘要: 有效特征的选取一直都是语音情感识别算法的关键。为此,针对语音情感特征选择与构建的问题,一种仿选择性注意机制的语音情感识别算法被提出。考虑到语音信号的时频特性,算法首先计算语音信号的语谱图;其次,模仿选择性注意机制,计算语谱图的颜色、方向和亮度特征图,归一化后形成特征矩阵;然后,将特征矩阵重排列并进行PCA降维,形成情感识别特征向量;最后,利用改进的支持向量机分类方法进行语音情感识别。对愤怒、恐惧、高兴、悲伤和惊奇5种情感的识别实验显示,基于选择性注意的方法能够获得较好的识别效果,平均识别率为85.44%。相比于韵律特征和音质特征,语音情感识别率至少提高10%;相比于其它语谱特征,识别率提高7%左右。

Abstract: The selection of effective features is always the key issue of speech emotion recognition algorithm. For the problem of the speech emotion features selection and construction, one speech emotion recognition algorithm based on the selective attention mechanism is proposed. Firstly, considering the time-frequency of speech, the spectrogram is calculated. Secondly, imitating the selective attention mechanism, the color, direction and brightness map of the spectrogram is computed. Based on these maps, the feature matrix is formed after normalization. Then, the feature matrix is rearranged and reduced the dimensionality by the principal component analysis (PCA) to form the emotion recognition feature vector. Finally, the speech emotion is classified with the improved support vector machine (SVM). Experiments on emotion (anger, fear, joy, sadness and surprise) recognition show the proposed method has an improved recognition rate 85.44%. Compared with the prosody and voice quality features, the speech emotion recognition rate has been improved at least by 10%. And compared with others spectrogram features, the improved recognition rate reaches 7%.

HTML全文

参考文献(0)

施引文献

资源附件(0)