Title: Discrimination algorithm using voiced detection method and time-delay neural network system by 3 FFT sub-bands
Authors: Jae Seung Choi
Addresses: Department of Electronic Engineering, College of Engineering, Silla University, 140 Baegyang-daero Blvd., 700 Beon-gil Rd., Sasang-gu, Busan, 617-736, Korea
Abstract: In the field of speech processing, the main application of a neural network is the category discrimination of phoneme recognition and speech/speaker recognition. In a speech signal, information on the time variation is significant when training the neural network for the speech signal input. Therefore, this paper proposes a speech discrimination algorithm in noisy speech signals using a voiced detection method and a time-delay neural network with a time structure. Thus, this algorithm first detects voiced sections using the proposed neural network at each frame in the condition of background noises, then discriminates the speech signals using the time-delay neural network based on three fast Fourier transform sub-bands, in the noisy environments. The effectiveness of the proposed algorithm is experimentally confirmed based on measuring the correct discrimination rates for speech degraded by various noises.
Keywords: discrimination rate; voiced detection; speech discrimination; time-delay neural networks; TDNN; fast Fourier transform; FFT; background noise; speech processing; speech recognition; phoneme recognition; speaker recognition.
DOI: 10.1504/IJCVR.2015.068795
International Journal of Computational Vision and Robotics, 2015 Vol.5 No.2, pp.99 - 111
Received: 09 Jul 2013
Accepted: 13 Jan 2014
Published online: 13 Apr 2015 *