Title: A microphone array beamforming-based system for multi-talker speech separation
Authors: Adel Hidri; Hamid Amiri
Addresses: Laboratoire de Recherche: Signal, Image et Technologie de l'Information (LR-SITI), École Nationale d'Ingénieurs de Tunis (ENIT), BP 37, le Belvédère 1002 Tunis, Tunisia ' Laboratoire de Recherche: Signal, Image et Technologie de l'Information (LR-SITI), École Nationale d'Ingénieurs de Tunis (ENIT), BP 37, le Belvédère 1002 Tunis, Tunisia
Abstract: This paper presents a Multichannel Speech Separation System (MCSS) based on new beamforming frequency domain method. The beamformer exploits the spatial properties of the source signals using a microphone array. Therefore, it is based on a prior knowledge of the position of the speakers relative to the array. The proposed beamformer is defined with two processing steps: the first one is to keep a unit gain of the desired signal and the other blocks the wanted signal and minimises the output power of the interferences within only one step. In order to separate multiple speakers, multiple beamformers are used simultaneously, where a beamformer is computed for each source considering the remaining sources as interferers. We test and evaluate the proposed MCSS on real recording mixtures extracted from 'Multichannel In-Car Speech Database'. The experimental results proved the effectiveness of the proposed system in terms of speech separation. The quality of speech will be improved compared to the state-of-the-art.
Keywords: speech signals; beamforming; microphone arrays; multichannel speech separation; optimal filtering; spatial filters; multi-talker speech; speech quality.
DOI: 10.1504/IJSISE.2016.078257
International Journal of Signal and Imaging Systems Engineering, 2016 Vol.9 No.4/5, pp.209 - 217
Received: 20 May 2015
Accepted: 08 Feb 2016
Published online: 10 Aug 2016 *