幅度调制信息对乐器识别的影响
The effects of amplitude modulation cues on musical instrument recognition
-
摘要: 通过乐器音自动识别实验研究了幅度调制(amplitude modulation,AM)信息对于人类乐器识别的影响。具体步骤为:依据听觉模型,提取乐器音信号中若干频带中的AM信息,再基于所得到的AM信息计算统计学特征,采用逐对比较的支持向量机法进行乐器音的机器识别。采用了5种分频带数目(2,4,8,16和32)和4种AM计算方法。结果表明,频带数的增加有助于识别效果的提高,但从16频带到32频带效果趋近平稳;不同的AM提取方法也会对识别结果产生影响,其中解析信号法产生的AM信息提供了最好的乐器识别效果。分析发现自动识别结果高于采用相似的AM信息的人类识别结果。该自动识别系统为人工耳蜗或声码器仿真声模型的乐器识别提供了一个计算模型,对人工耳蜗乐器识别实验和训练具有参考价值。Abstract: The effects of amplitude modulation (AM) cues on human musical instrument recognition were investigated.An automatic musical instrument recognition experiment was carried out.The following stages were manipulated:calculation of the statistical features from AM of the musical signals extracted based on the auditory model;a support vector machine with pairwise classification strategy was used as the classifier.Totally five band conditions (2,4,8,16 and 32) and four AM extraction methods were evaluated.Results showed that recognition performance improved with the increasing of the band number,and 16 bands were sufficient for reaching asymptotic performance.The AM extraction methods also had significant effects on the performance.The analytic signal based method showed better performance than the rectification-low-pass-filter methods.The automatic recognition system showed better performance than human subjects using similar AM information.The automatic recognition system provided a computational model for predicting the performance of musical instrument recognition with cochlear implants and vocoder simulations and can be used as a reference in relevant experiments and training about musical instrument recognition with cochlear implants.