融合改进梅尔谱特征和深信念网络的语音测谎算法
Deception detection with spectral features based on deep belief network
-
摘要: 为实现非接触式谎言检测,特提出了以语谱特征为线索,结合深度学习的谎言检测方法。为提取谎言中微颤抖所引起的语谱局部能量变化,算法先对梅尔频谱进行了Hu矩处理,然后进行离散余弦变换去除相关性。该特征利用了Hu矩的正交不变性和平移不变性,能较好的体现出语谱中局部能量的集中方式。然后将所提取的特征作为改进深信念网络输入进行谎言识别。为提高受限玻尔兹曼机的并行回火训练算法中相邻温度链之间的交换率,训练算法先对Markov链的状态能量进行等能量的划分,使得每个能量环内的状态具有相似的能量,然后再进行交换以提高交换率从而优化整个网络的训练。在Columbia-SRI-Colorado数据库上的实验表明,谎言识别率达到了71.47%,比梅尔倒谱系数特征的识别率提高了3%,比传统的BayesNet分类算法提高了7%。Abstract: In order to solve the problems of traditional deception detection using physiological indicators which only detected deception of present person under contacting condition, a new method combining phonetic features with deep learning is proposed. In the term of extracting features, the Hu moment of Mel-Frequency Cepstrum is calculated firstly,then discrete cosine transform is performed to remove correlation. This spectral feature makes use of the orthogonal and translational invariance of Hu moment, which could evaluate the degree how the local energy is concentrated to the center of energy gravity that has a close relationship with the energy changing caused by trembling in deception.Furthermore, a modified training algorithm for Restricted Boltzmann Machines is proposed to improve the exchange rate of adjacent temperature chains in the traditional parallel tempering, which partitions the state energy of Markov into several energy rings. In each ring, the states have similar energies. The performance of network will increase with the exchange rate. Experiments on Columbia-SRI-Colorado dataset show that the recognition rate is 71.47%, 7% higher than the experiments of Columbia University.