噪声环境下基于最大后验非线性变换的隐马尔可夫模型自适应算法
Hidden Markov model adaptation algorithm using maximum a posteriori nonlinear transformation in noisy environments
-
摘要: 由于训练环境和识别环境的失配,识别系统的性能会严重下降。为此,提出了基于最大后验概率非线性变换的环境自适应算法,可以减小由于环境的失配所引起的系统性能的下降。在本算法中,利用分段线性回归近似非线性变换将训练环境下隐马尔可夫模型(HMM)的均值向量变换到识别环境,减小环境的失配,变换参数的估计采用了最大后验概率估计(MAP)。数字语音识别实验证明:该环境自适应算法的识别性能优于MLST,MAPLR和MLLR等算法。Abstract: The performance of speech recognition system will be significantly deteriorated because of the mismatches between training and testing conditions. This paper addresses the problem and proposes an environment adaptation algorithm to adapt the mean vectors of HMM using nonlinear transformation, which is approximated by piecewise linear regression. The algorithm can reduce the performance deterioration of the speech recognition system caused by the mismatches. Rather than estimating the transformation parameters using maximum likelihood estimation (MLE), we proposed to use maximum a posteriori (MAP) as the estimation criterion. The proposed algorithm, called MAPNT, has been evaluated on a Chinese digit recognition experiment based on continuous density HMM. The test shows that the proposed algorithm is efficient and outgoing other algorithms, such as maximum a posteriori linear regression (MAPLR) algorithm and maximum likelihood linear regression (MLLR) algorithm etc..