基于共享状态空间旋转变换的相关特征建模
Tying state-specified rotation using semi-tied covariance transform to model correlations between feature vectors
-
摘要: 提出了一种共享空间旋转变换的声学建模方法。该方法结合状态空间旋转变换和方差部分共享的优点,克服了空间旋转变换方法由于每个输出都有一个变换矩阵而带来的计算量和存储量增加的缺点。在空间旋转变换方法得到比较精确的初始模型的基础上,通过共享的方差变换方法实现了不同状态的空间旋转矩阵的共享,解决了状态空间旋转变换后参数过多的缺点并提高了系统的识别率。试验结果表明,在汉语大词汇量连续语音识别系统中,同传统的对角方差建模技术相比,这种方法在计算量增加很小的情况下,系统字的误识率降低了18.8%。Abstract: An acoustic modeling method called the tying State-Specified Rotation is proposed. The method incorporates the merits of State-specified Rotation (SSR) and Semi-Tied Covariance Transform (STC), and overcomes computation and memory problems which are incurred because each state has one full feature-space transform matrix. Based on more precision initial model of SSR, STC is used for tying the feature-space transform matrices among different states. The technique solved the problem that the parameters are overload after SSR, and decreased the number of transform matrices without reducing recognition accuracy. Experimental results on a large vocabulary continues speech recognition task of mandarin show that in comparison to the traditional diagonal modeling technique, the proposed method can get nearly 18.8% word error rate reduction without incurring much computation load during decoding.