近似窄带假设下的最小方差无失真响应波束形成
Narrowband approximation based minimum variance distortionless response beamforming
-
摘要: 最小方差无失真响应波束形成算法在应用于语音等宽带信号时,依赖窄带假设可以在频域各个子带分别进行滤波。窄带假设下语音信号协方差矩阵是秩-1矩阵,而实际中窄带信号模型只是实际信号模型的一种近似,同时由于存在统计量估计误差,估计的语音信号协方差矩阵的秩一般大于1。提出利用语音协方差矩阵和噪声协方差矩阵的广义主特征向量来估计相对传递函数,用于重构语音信号协方差矩阵为秩-1矩阵。在REVERB数据集以及CHiME-4数据集上进行实验验证,最小方差无失真响应波束形成算法经过语音协方差矩阵低秩近似后,对估计误差的鲁棒性提高,输出信噪比分别提升平均0.8 dB和1.4 dB,同时提升了语音识别准确率。Abstract: Narrowband assumption is a general underlying assumption when the Minimum Variance Distortionless Response (MVDR) beamformer is applied to broadband signals such as speech.Under the narrowband assumption,the speech covariance matrix is a rank-1 matrix.In practice,however,the rank of the speech covariance matrix is usually greater than one because of the simplified narrowband signal model and unavoidable errors in the estimation process.We propose to reconstruct a rank-1 speech covariance matrix using the relative transfer function which is estimated from the generalized eigenvector of the speech covariance matrix and noise covariance matrix.Compared with the plain MVDR,the low rank approximation based algorithm is shown to be more robust to estimation errors.Experiments are conducted on the REVERB and CHiME-4 datasets,and the output SNRs are on average improved by 0.8 dB and 1.4 dB respectively,and the speech recognition accuracies are also improved.