汉语语音合成系统评价方法
Assessment methods of speech synthesis systems for Chinese
-
摘要: 从1994年开始,对汉语语音合成系统的工作性能定期举行全国评测.采用语言清晰度测试方法,1994年对五个不同的合成系统进行了评测和诊断.听音人为16名大学生(男8,女8),对合成言语没有经验.听音人响应是开放的听音记录.同时,还采用十点主观评价(MOS)测定言语自然度.为给出各合成系统音段层的诊断信息,对合成语音的辅音知觉混淆矩阵进行了分析.借助于对比自然言语和合成言语在不同语言层次上清晰度试验得分间的统计关系,来考察合成系统韵律特征处理的缺陷.结果表明,采用上述方法可得到评测合成系统工作性能的稳定合理的指标.有关韵律特征的评价方法有待于进一步发展.Abstract: A national assesment of the performance of speech synthesis systems for Chinese has been carried out regularly since 1994.The synthetic speech quality of five defferent systems were evaluated and diagnosed by using speech intelligibility tests.16 college students (8 male,8 female) with no experience with synthetic speech were the listeners,they were asked to do open response task by pencil-paper.In addition,speech naturalness was measured by Mean Opinion Score (MOS) on a ten point scale.The perceptual confusion matrices of consonants were analyzed in order to give some diagnostic information of the synthesis systems at segmental level.And the statistical relations between speech intelligibilities at different levels of synthetic speech were compared with that of natural speech to explore some deficiencies in prosody.It is shown that stable and rational quality index and diagnostic information can be obtained in this method.
The eveluation methods of prosody have not been become available.