Spectro-temporal features for audio replay attack detection Online publication date: Thu, 28-Jan-2021
by R. Hemavathi; R. Kumara Swamy
International Journal of High Performance Computing and Networking (IJHPCN), Vol. 16, No. 2/3, 2020
Abstract: Speaker verification can be viewed as a process of verifying the person using his/her utterance. The major challenge to implement automatic speaker verification in security applications is spoofing attacks. Speaker verification systems can be spoofed using pre-recorded speech, synthetic and voice conversion speech. Hence, there is a need to develop spoof detection system in order to make voice biometrics viable for security applications. This paper proposes to explore time-frequency representations obtained using gammatone filterbank and constant Q transform for detecting presentation attack for automatic speaker verification. The experiments are carried out for ASV spoof 2017 database and the results are compared with state-of-art replay speech detection systems based on cepstral features.
Existing subscribers:
Go to Inderscience Online Journals to access the Full Text of this article.
If you are not a subscriber and you just want to read the full contents of this article, buy online access here.Complimentary Subscribers, Editors or Members of the Editorial Board of the International Journal of High Performance Computing and Networking (IJHPCN):
Login with your Inderscience username and password:
Want to subscribe?
A subscription gives you complete access to all articles in the current issue, as well as to all articles in the previous three years (where applicable). See our Orders page to subscribe.
If you still need assistance, please email subs@inderscience.com