基于互信息匹配模型的说话人识别
Text-dependent speaker identification based on mutual information matching model
-
摘要: 依据互信息理论提出的互信息匹配识别模型MIM(Mutual Information Matching),能够有效地综合处理语音信号的统计分布特征与时变分布特征,并具有较强的鲁棒性。介绍了运用互信息进行说话人模式匹配的原理,探讨了基于文本的说话人识别中MIM模型的应用,通过说话人辨别实验对MIM模型的性能进行了实验分析,并与其它识别模型DTW和GMM进行了比较。对18名男性和12名女性组成的30名说话人进行的识别实验表明, MIM模型的说话人识别性能较好,在采用LPCC特征参数的情况下,平均错误识别率为1.33%。Abstract: The Mutual Information Matching model (MIM) was proposed for speaker recognition based on the mutual information theory. Both of statistical and time-variant features of speech signal can be processed effectively, robustly and synchronously in MIM. It is presented a description of MIM principle and then evaluated its application to text-dependent speaker identification with comparison to other two typical models, DTW and GMM. The identification experiments on 30 speakers including 18 males and 12 females show that MIM model has better performance with the identification error rate of 1.33% if LPCC was used as feature parameters.