Optimal Design of Microphone Array for Humanoid-Robot Audition

V. Tourbabin and B. Rafaely (Ben-Gurion University of the Negev)
Israeli Conference on Robotics (ICR 2016), Herzelia, Israel, April 2016
[showhide type=”Abstract”] Abstract: One of the important components of a humanoid robot is its auditory system. The system is mainly aimed at increasing the robot’s awareness of its surroundings and at enabling natural human-robot interaction using speech. The auditory system is usually based on a microphone array which constitutes the front end for sound acquisition. Configuration of this array plays a central role in the performance of the system as a whole. There are robot-audition related publications concerned with the optimization of the array configuration for enhancing the spatial information acquired by the array [1] and for improving the sound localisation performance [2]. However, spatial aliasing, which is one of the major problems in array design, remains largely untreated in the humanoid robot audition literature. The current work presents a method for microphone positioning optimisation that extends the aliasing-free frequency range of the array. The method can be used to complement the existing techniques for aliasing cancellation by signal processing [3]. The efficacy of the proposed method to reduce aliasing is demonstrated by showing a significant performance improvement, when compared to using the efficient nearly-uniform microphone distribution. The proposed method is applied to the design of a new 12-microphone array for the NAO robot. This design is subject to the real constraints on microphone positioning due to the robot’s cameras, loudspeakers, and other components. An initial evaluation of the new prototype head manufactured by Aldebaran Robotics according to the design is discussed and an example is presented showing its efficacy for reducing the speech-recognition error rates in real-world environments. [/showhide]
Copyright Notice: This material is presented to ensure timely dissemination of scholarly and technical work. Copyright and all rights therein are retained by authors or by other copyright holders. All persons copying this information are expected to adhere to the terms and constraints invoked by each author’s copyright. In most cases, these works may not be reposted without the explicit permission of the copyright holder.
Paper: Paper_ICR_2016_BGU_VT