Title: Topological invariants as speech features for automatic speech recognition

Authors: Juraj Kacur; Vladimir Chudy

Addresses: Institute of Telecommunications, Faculty of Electrical Engineering and Information Technology, Slovak University of Technology, Ilkovičova 3, Bratislava 812 19, Slovakia ' Department of Psychology, Faculty of Philosophy, Comenius University in Bratislava, Gondova 2, Bratislava 814 99, Slovakia

Abstract: The article presents topological invariants as speech features for speech recognition systems based on hidden Markov models. A short introduction is provided to the mathematical concept of topological invariants and space symmetries for the speech recognition problem. This involves a basic overview of the relevant auditory characteristic and its modelling in order to identify possible symmetries and invariants. Once the concept is derived, several of its modifications vital for HMM systems such as reduction of dimensions, within-class feature decorrelation and a signal plane rotation are presented and evaluated on a real system. The final system is evaluated and compared to other features using both context-dependent and context-independent models. Tests were accomplished on the professional speech database, where the achieved accuracies reached up to 97.7%, 98.7% and 98.9% for string of digits, application words and isolated digits tests, respectively.

Keywords: automatic speech recognition; ASR; topological invariants; speech features; HMM; hidden Markov models; auditory system; LDA; decorrelation; space symmetries; modelling.

DOI: 10.1504/IJSISE.2014.066601

International Journal of Signal and Imaging Systems Engineering, 2014 Vol.7 No.4, pp.235 - 244

Received: 17 Mar 2012
Accepted: 30 May 2013

Published online: 29 Dec 2014 *

Full-text access for editors Full-text access for subscribers Purchase this article Comment on this article