Speech blind extraction algorithm based on sound source azimuth information and nonlinear time-frequency masking

XIA Xiuyu; HE Peiyu

doi:10.15949/j.cnki.0371-0025.2013.02.012

XIA Xiuyu, HE Peiyu. Speech blind extraction algorithm based on sound source azimuth information and nonlinear time-frequency maskingJ. ACTA ACUSTICA, 2013, 38(2): 224-230. DOI: 10.15949/j.cnki.0371-0025.2013.02.012

Citation:

Speech blind extraction algorithm based on sound source azimuth information and nonlinear time-frequency masking

XIA Xiuyu,
HE Peiyu

Graphical Abstract

Abstract

Abstract

For the underdetermined convolution mixture model,a new speech blind extraction algorithm based on sound source azimuth information and nonlinear time-frequency masking was proposed.At first,instantaneous ITDs were calculated through time-frequency analysis in lower frequency domain,and the number of sources and their ITDs were estimated using the potential function.Then the object source was locked and accurate azimuth information of object was estimated.At last,the object speech was extracted via nonlinear time-frequency masking which was based on the azimuth information of object.Simulation results showed that our proposed speech extraction algorithm can lock interested object speech from random direction and extract object speech effectively,the signal-noise-ratio gain(SNRG) was obtained 9.5 dB averagely in our experiment condition.

FullText(HTML)

References (0)

Cited By

Speech blind extraction algorithm based on sound source azimuth information and nonlinear time-frequency masking

Abstract

Catalog

Export File

Citation

Format

Content