Light-weight speech separation based on dual-path attention and recurrent neural network

YANG Yi; HU Qi; ZHANG Pengyuan

doi:10.12395/0371-0025.2022044

YANG Yi, HU Qi, ZHANG Pengyuan. Light-weight speech separation based on dual-path attention and recurrent neural network[J]. ACTA ACUSTICA, 2023, 48(5): 1060-1069. DOI: 10.12395/0371-0025.2022044

Citation:

YANG Yi, HU Qi, ZHANG Pengyuan. Light-weight speech separation based on dual-path attention and recurrent neural network[J]. ACTA ACUSTICA, 2023, 48(5): 1060-1069. DOI: 10.12395/0371-0025.2022044

Citation:

YANG Yi, HU Qi, ZHANG Pengyuan. Light-weight speech separation based on dual-path attention and recurrent neural network[J]. ACTA ACUSTICA, 2023, 48(5): 1060-1069. DOI: 10.12395/0371-0025.2022044

Light-weight speech separation based on dual-path attention and recurrent neural network

Graphical Abstract

Graphical Abstract

Abstract

Abstract

A light-weight speech separation algorithm based on dual-path attention and recurrent neural network is proposed. First, optional branch structures based on dual-path attention mechanism and dual-path recurrent network are utilized to model the speech signals, which facilitate the extraction of deep feature information and the reduction of training parameters. Second, sub-band processing approach is introduced to alleviate the computation burden. As shown by the experimental results on the LibriCSS dataset, the average word error rate obtained by the proposed algorithm is 8.6% with only 0.15 MiB training parameters and 15.2 G/6s computation cost, which is 3.3−391.3 and 1.1−3.2 times smaller than other mainstream approaches. This proves the proposed algorithm can effectively reduce the training parameters and computation cost while achieving high speech separation performance.

FullText(HTML)

References (35)

Cited By

Light-weight speech separation based on dual-path attention and recurrent neural network

Graphical Abstract

Abstract

Catalog

Export File

Citation

Format

Content