Go To Program
29

Robust Neuro-Fuzzy Speaker Localization Using a Circular Microphone Array

Axel Plinge 1 Marius H. Hennecke 1 Gernot A. Fink 1
1 Intelligent Systems Group, Robotics Research Institute, TU Dortmund, Dortmund, Germany

A major application area of microphone array processing is the localization of sound sources, mainly of speaking persons. In contrast to most state-of-the-art approaches that are based on correlation measures, we propose a neurologically inspired system that generalizes findings about human spatial hearing to the multi-channel case. It mimics the processing in the human cochlea and the auditory mid-brain. To enhance the localization quality, a new spike generation approach is introduced, termed peak-over-average position (PoAP). A fuzzy combination is used to remove putative artifacts. In contrast to a human listener we employ multiple sensors to gain ro- bustness in reverberant and noisy environments. Post-processing estimates the locations of concurrent speakers. The robustness of the proposed system is shown by comparison with the well- known steered response power approach. Finally, we show the applicability of our realtime neuro-fuzzy model to the concurrent speaker localization task using real reverberant recordings.


View pdf