EI / SCOPUS / CSCD 收录

中文核心期刊

长时语音特征在说话人识别技术上的应用

张建平, 李明, 索宏彬, 杨琳, 付强, 颜永红

张建平, 李明, 索宏彬, 杨琳, 付强, 颜永红. 长时语音特征在说话人识别技术上的应用[J]. 声学学报, 2010, 35(2): 267-269. DOI: 10.15949/j.cnki.0371-0025.2010.02.019
引用本文: 张建平, 李明, 索宏彬, 杨琳, 付强, 颜永红. 长时语音特征在说话人识别技术上的应用[J]. 声学学报, 2010, 35(2): 267-269. DOI: 10.15949/j.cnki.0371-0025.2010.02.019
ZHANG Jianping, LI Ming, SUO Hongbin, YANG Lin, FU Qiang, YAN Yonghong. Long span prosodic features for speaker recognition[J]. ACTA ACUSTICA, 2010, 35(2): 267-269. DOI: 10.15949/j.cnki.0371-0025.2010.02.019
Citation: ZHANG Jianping, LI Ming, SUO Hongbin, YANG Lin, FU Qiang, YAN Yonghong. Long span prosodic features for speaker recognition[J]. ACTA ACUSTICA, 2010, 35(2): 267-269. DOI: 10.15949/j.cnki.0371-0025.2010.02.019

长时语音特征在说话人识别技术上的应用

基金项目: 

国家科技支撑计划(2008BAI50B00)、国家自然科学基金(10925419,90920302,10874203,60875014)资助项目。

详细信息
  • PACS: 
      43.72;43.60

Long span prosodic features for speaker recognition

  • 摘要: 本文除介绍常用的说话人识别技术外,主要论述了一种基于长时时频特征的说话人识别方法,对输入的语音首先进行VAD处理,得到干净的语音后,对其提取基本时频特征。在每一语音单元内把基频、共振峰、谐波等时频特征的轨迹用Legendre多项式拟合的方法提取出主要的拟合参数,再利用HLDA的技术进行特征降维,用高斯混合模型的均值超向量表示每句话音时频特征的统计信息。在NIST06说话人1side-1side说话人测试集中,取得了18.7%的等错率,与传统的基于MFCC特征的说话人系统进行融合,等错率从4.9%下降到了4.6%,获得了6%的相对等错率下降。
    Abstract: In this paper, we first give an introduction about speaker recognition techniques. Then a novel speaker verification method based on long span prosodic features is proposed. After speech is pre-processed by a voice activity detection module, and basic prosody features are extracted for each speech unit, we carried out an approximation of the pitch, formant, time domain energy and harmonic energy contours by taking the leading terms in a Legendre polynomial expansion. HLDA is used to reduce the feature dimension and mean supervector in each individual Gaussian is used to represent the distribution of the time-frequency features. Experiments on NIST06 show that the proposed method can reduce the EER from 4.9% to 4.6% when fusing with the traditional MFCC-featured system.
计量
  • 文章访问数: 
  • HTML全文浏览量: 
  • PDF下载量: 
  • 被引次数: 0
出版历程
  • 收稿日期:  2010-02-01
  • 修回日期:  2010-02-10
  • 网络出版日期:  2022-06-27
  • 刊出日期:  2022-06-27

目录

    /

    返回文章
    返回