Chinese audiovisual bimodal speech database CAVSR1.O
-
-
Abstract
Audiovisual bimodal speech recognition has been one of the most promising branch in international speechrecognition area,Chinese bimodal speech recognition has also started.But,as the visual information capturing is verydifficult,there are a few audiovisual speech databases developed,and there are no such Chinese database yet.So wedesigned and created the audiovisual bimodal speech database CAVSR1.0.t has following advantages:Its corpus includesall Chinese phonetic units (initials and finals),and its size is very large.Its corpus selection conforms to the distributionprobability of initials and finals,conclusions from it could stand for Chinese language.There are automatic segmentingand automatic main features labeling bound with it,so it has good extendibility.The database may be enlarged (numberof subjects,speech data,repetitions) conveniently according to research requirements.
-
-