EI / SCOPUS / CSCD 收录

中文核心期刊

扩散噪声环境下的多通道盲语音分离方法

Multi-channel blind speech separation method for diffuse noise environments

  • 摘要: 适用于超定系统的处理扩散噪声的独立向量提取方法受限于高斯噪声假设, 无法消除与目标声源同方向的扩散噪声。为此, 提出了一种适用于扩散噪声环境的多通道盲源分离方法。该方法假设扩散噪声在各个方向上的能量分布均匀且具有时变特性。采用低秩声源模型和秩1空间模型构建适用于扩散噪声环境下混合信号的概率模型。在此基础上, 通过最大似然准则推导出分离矩阵的更新公式, 并估计语音和噪声的功率谱密度。最后, 利用维纳滤波抑制与目标声源相同方向的扩散噪声。仿真实验结果表明, 所提方法的声源分离性能和抑制噪声能力比现有算法取得了显著提高, 验证了其在复杂噪声环境下的有效性。

     

    Abstract: Independent vector extraction is an advanced method for blind source separation (BSS) in diffuse noise environments, suitable for overdetermined systems and characterized by high computational efficiency. However, it is limited by the assumption of stationary Gaussian noise, which makes it ineffective at eliminating diffuse noise that is aligned with the target source. To tackle this issue, a multi-channel blind source separation method is proposed for diffuse noise environments. This method assumes that the energy distribution of diffuse noise is uniform in all directions and exhibits time-varying characteristics. A low-rank source model and a rank-1 spatial model are utilized to construct a probabilistic model for the noisy mixtures. Using this probabilistic model, the update formula for the separation matrix is derived based on the maximum likelihood criterion, and the power spectral densities of the speech and noise components are estimated. Subsequently, Wiener filtering is employed to suppress the noise components that are aligned with the target source direction. Ultimately, experimental results demonstrate that the proposed method significantly outperforms existing BSS algorithms in terms of source separation performance and noise suppression capability, thus validating its effectiveness in complex acoustic environments.

     

/

返回文章
返回