EI / SCOPUS / CSCD 收录

中文核心期刊

张聪, 杨飞然, 陈先梅, 杨军. 广义高斯分布的卷积传递函数多通道非负矩阵分解[J]. 声学学报, 2024, 49(3): 598-610. DOI: 10.12395/0371-0025.2023009
引用本文: 张聪, 杨飞然, 陈先梅, 杨军. 广义高斯分布的卷积传递函数多通道非负矩阵分解[J]. 声学学报, 2024, 49(3): 598-610. DOI: 10.12395/0371-0025.2023009
ZHANG Cong, YANG Feiran, CHEN Xianmei, YANG Jun. Convolution transfer function-based multi-channel non-negative matrix factorization using generalized Gaussian distributions[J]. ACTA ACUSTICA, 2024, 49(3): 598-610. DOI: 10.12395/0371-0025.2023009
Citation: ZHANG Cong, YANG Feiran, CHEN Xianmei, YANG Jun. Convolution transfer function-based multi-channel non-negative matrix factorization using generalized Gaussian distributions[J]. ACTA ACUSTICA, 2024, 49(3): 598-610. DOI: 10.12395/0371-0025.2023009

广义高斯分布的卷积传递函数多通道非负矩阵分解

Convolution transfer function-based multi-channel non-negative matrix factorization using generalized Gaussian distributions

  • 摘要: 基于卷积传递函数的多通道非负矩阵分解(CTF-MNMF)在长混响环境的盲源分离中取得了较好的性能, 但该算法的分离性能依然受到声源模型的限制。因此提出了基于广义高斯分布(GGD)的CTF-MNMF算法, 通过将域参数引入NMF中并利用广义非负矩阵分解(GNMF)建模GGD的非负尺度因子, 提高了声源模型捕捉信号离群值的鲁棒性, 进而提高了声源估计的准确性。采用基于辅助函数的优化策略给出分离矩阵和非负矩阵参数的更新公式。仿真结果表明所提算法在语音和音乐两种信号的分离实验中均取得了比GGD-ILRMA、WPE-ILRMA和CTF-MNMF更好的分离性能。

     

    Abstract: The convolution transfer function-based multi-channel non-negative matrix factorization (CTF-MNMF) has been shown to perform well in blind source separation in highly reverberant environments, but its effectiveness may be limited by the source model. An improved version of the CTF-MNMF is proposed, where the generalized Gaussian distribution (GGD) is used as the source model. The domain parameter is introduced into the NMF and the generalized NMF (GNMF) is utilized to model the non-negative scale factors of the GGD, which enhances the robustness of the source model in capturing signal outliers, and thus improves the accuracy of source estimation. An auxiliary function-based method is used to derive an improved formula for updating the separated matrix and non-negative matrix parameters. Simulation results shows that the proposed algorithm achieves better separation performance than the GGD-ILRMA, WPE-ILRMA, CTF-MNMF algorithms for both speech and music input signals.

     

/

返回文章
返回