Abstract:
This paper presents a segmentation method of Chinese initials and finals based on the detection of auditory events.According to this method,the voice should first of all be filtered by using the cochlear filter bank,and then the auditory events corresponding to energy mutation in each band are detected.Finally,the auditory events are integrated in different frequency ranges respectively to determine the boundaries of Chinese initials and finals.The experimental results show that with 8 kHz sampling frequency,the accuracy is 88.9% for clean speech and above 82.9% for noisy speech with the SNR of 10 dB.