EI / SCOPUS / CSCD 收录

中文核心期刊

XU Yanjun, DU Limin, LI Guoqiang, ZHANG Xin, ZHOU Zhi. Chinese audiovisual bimodal speech database CAVSR1.O[J]. ACTA ACUSTICA, 2000, 25(1): 42-49. DOI: 10.15949/j.cnki.0371-0025.2000.01.008
Citation: XU Yanjun, DU Limin, LI Guoqiang, ZHANG Xin, ZHOU Zhi. Chinese audiovisual bimodal speech database CAVSR1.O[J]. ACTA ACUSTICA, 2000, 25(1): 42-49. DOI: 10.15949/j.cnki.0371-0025.2000.01.008

Chinese audiovisual bimodal speech database CAVSR1.O

  • Audiovisual bimodal speech recognition has been one of the most promising branch in international speechrecognition area,Chinese bimodal speech recognition has also started.But,as the visual information capturing is verydifficult,there are a few audiovisual speech databases developed,and there are no such Chinese database yet.So wedesigned and created the audiovisual bimodal speech database CAVSR1.0.t has following advantages:Its corpus includesall Chinese phonetic units (initials and finals),and its size is very large.Its corpus selection conforms to the distributionprobability of initials and finals,conclusions from it could stand for Chinese language.There are automatic segmentingand automatic main features labeling bound with it,so it has good extendibility.The database may be enlarged (numberof subjects,speech data,repetitions) conveniently according to research requirements.
  • loading

Catalog

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return