Unsupervised speech denoising via perceptually motivated robust principal component analysis

MIN Gang; ZOU Xia; HAN Wei; ZHANG Xiongwei; TAN Wei

doi:10.15949/j.cnki.0371-0025.2017.02.015

MIN Gang, ZOU Xia, HAN Wei, ZHANG Xiongwei, TAN Wei. Unsupervised speech denoising via perceptually motivated robust principal component analysis[J]. ACTA ACUSTICA, 2017, 42(2): 246-256. DOI: 10.15949/j.cnki.0371-0025.2017.02.015

Citation:

Unsupervised speech denoising via perceptually motivated robust principal component analysis

Graphical Abstract

Graphical Abstract

Abstract

Abstract

To overcome the shortcomings in the existing sparse and low-rank speech denoising method that the auditory perceptual properties are not fully exploited and the speech distortion is easily perceived,a perceptually motivated robust principal component analysis(ISNRPCA) method is presented.To reflect the nonlinear property for frequency perception of the basilar membrane,cochleagram is utilized as inputs of ISNRPCA.ISNRPCA uses the perceptually meaningful Itakura-Saito measure as its optimization objective function.Moreover,nonnegative constraints are also imposed to regularize the decomposed terms with respect to their physical meaning.We propose an alternating direction method of multipliers(ADMM) to solve the optimization problem of ISNRPCA.ISNRPCA is totally unsupervised,neither the speech nor the noise model needs to be trained beforehand.Experimental results under various noise types and different SNRs demonstrate that ISNRPCA shows promising results for speech denoising.Compared to the state of the art baselines,this method achieves better performance on noise suppression and demonstrates at least comparable intelligibility and overall speech quality.

FullText(HTML)

References (0)

Cited By

Unsupervised speech denoising via perceptually motivated robust principal component analysis

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content