EI / SCOPUS / CSCD 收录

中文核心期刊

一种改进的基于层次聚类的说话人自动聚类算法

An improved hierarchical speaker clustering

  • 摘要: 说话人聚类是语音识别以及音频检索等众多语音应用的重要组成部分。提出一种改进的基于层次聚类的说话人聚类算法,对层次聚类法做出了进一步改进:(1)改进误差平方和准则以提高聚类速度;(2)引入假设检验方法确定类别数目;(3)提出一种稳健的在线聚类方法以解决对新到来的语音段进行聚类的问题。在聚类实验中,算法的平均类纯度和说话人纯度分别为96.7%和96.6%。实验结果还表明,相比手工标注说话人信息,将该算法的聚类结果应用于说话人自适应可降低系统的误识率。

     

    Abstract: The speaker clustering is a key component in many speech applications. An improved speaker clustering algorithm based on hierarchical clustering was presented. Compared with typical hierarchical clustering approaches, the proposed method improves the performance of the clustering from three aspects. First, a modified sum-of-squared-error criterion is implemented to ensure better efficiency of the clustering. Then hypothesis test is introduced to select the best clusters automatically. Additionally, a robust online speaker clustering algorithm based on hypothesis test is described. The clustering decision can be made whenever a new audio segment is received. Experiments indicate that the improved hierarchical speaker clustering achieves good performance on both precision and speed. The method obtains average speaker purity of 96.7% and average cluster purity of 96.6%. Experiments also show that this method is effective at improving the performance of the unsupervised adaptation even comparing with the true speaker-condition.

     

/

返回文章
返回