Speech blind extraction algorithm based on sound source azimuth information and nonlinear time-frequency masking
-
-
Abstract
For the underdetermined convolution mixture model,a new speech blind extraction algorithm based on sound source azimuth information and nonlinear time-frequency masking was proposed.At first,instantaneous ITDs were calculated through time-frequency analysis in lower frequency domain,and the number of sources and their ITDs were estimated using the potential function.Then the object source was locked and accurate azimuth information of object was estimated.At last,the object speech was extracted via nonlinear time-frequency masking which was based on the azimuth information of object.Simulation results showed that our proposed speech extraction algorithm can lock interested object speech from random direction and extract object speech effectively,the signal-noise-ratio gain(SNRG) was obtained 9.5 dB averagely in our experiment condition.
-
-