EI / SCOPUS / CSCD 收录

中文核心期刊

JIAN Zhihua, WANG Xiangwen. A modified algorithm for voice conversion using compressed sensing[J]. ACTA ACUSTICA, 2014, 39(3): 400-406. DOI: 10.15949/j.cnki.0371-0025.2014.03.016
Citation: JIAN Zhihua, WANG Xiangwen. A modified algorithm for voice conversion using compressed sensing[J]. ACTA ACUSTICA, 2014, 39(3): 400-406. DOI: 10.15949/j.cnki.0371-0025.2014.03.016

A modified algorithm for voice conversion using compressed sensing

  • A voice conversion algorithm, which makes use of the information between continuous frames of speech by compressed sensing, is proposed in this paper. According to the sparsity property of the concatenated vector of several continuous Linear Spectrum Pairs (LSP) in the discrete cosine transformation domain, this paper utilizes compressed sensing to extract the compressed vector from the concatenated LSPs and uses it as the feature vector to train the conversion function. The results of evaluations demonstrate that the performance of this approach can averagety improve 3.21% comparing with the conventional algorithm based on weighted frequency warping when choosing the appropriate numbers of speech frame. The experimental results also illustrate that the performance of voice conversion system can be itnproved by taking full advantage of the inter-frame information, because those information can make the converted speech remain the more stable acoustic properties which is inherent in inter-frames.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return