Title: VAD, feature extraction and modelling techniques for speaker recognition: a review
Authors: Spoorti J. Jainar; Pritam Limbaji Sale; B.G. Nagaraja
Addresses: Department of E&CE, Visvesvaraya Technological University, Belagavi – 18, Karnataka, India ' Department of E&CE, Visvesvaraya Technological University, Belagavi – 18, Karnataka, India ' Department of E&CE, Jain Institute of Technology, Davangere – 03, Karnataka, India
Abstract: This paper reviews an automatic speaker recognition technology, with an emphasis on state-of-the-art voice activity detection (VAD), feature extraction and speaker-modelling techniques that have emerged during the last few years. Researchers in the field of speaker recognition have made a few attempts to recognise the speaker in the language mismatch environment and limited data condition.To address robustness issues, we also elaborate language mismatch and limited data speaker recognition. Further, this paper identified some issues with the existing speaker recognition systems and also investigated areas of possible improvements in speaker recognition field. We conclude the paper with a discussion on the possible future directions.
Keywords: VAD; voice activity detection; speaker identification; speaker verification; language mismatch; limited data; multilingual; features; modelling techniques.
DOI: 10.1504/IJSISE.2020.113552
International Journal of Signal and Imaging Systems Engineering, 2020 Vol.12 No.1/2, pp.1 - 18
Received: 26 Apr 2018
Accepted: 13 May 2019
Published online: 11 Mar 2021 *