Acoustic source localisation and tracking


As a result of my research on array-based speaker tracking in reverberant environments, a particle filter using the concept of importance sampling was developed and implemented in real-time on a standard desktop computer, running in conjunction with a 16 microphone array. Below are a few examples of what this real-time acoustic source tracker can do. The audio signal in the movie corresponds to the data recorded with microphone nr. 2, close to the bottom right-hand corner of the display. The green dot displays the estimated location of the acoustic source, the red markers show the history of the source location estimates over the last few frames (estimated source trajectory).

The only information used by the algorithm to localise and track the acoustic source are the signals recorded by the 16 microphones located at known positions in the room. The dimensions of the enclosure are about 3.5m x 4.5m x 2.7m. Odd-numbered microphones are placed at a height of 1.9m, even-numbered at 1.3m. The reverberation time (T60) measured in the room is 0.5s, which is quite substantial for this type of application.

Please be aware that these movies require the proper Xvid codec to be displayed correctly under Windows/Mac. Otherwise, the video stream might freeze from time to time, and the video and audio streams might get out of synchronisation. Open source software (mplayer, xine, vlc) don't seem to have a problem with it though! You can get this type of codecs from various places on the www. Get it for Windows and Mac e.g. here, or just Google it, e.g. like this!


Male speech examples:

Example #1 (261K): malespeech1.avi
Example #2 (248K): malespeech2.avi
Example #3 (250K): malespeech3.avi


Female speech examples:

Example #1 (137K): femalespeech1.avi
Example #2 (140K): femalespeech2.avi
Example #3 (199K): femalespeech3.avi
Example #4 (202K): femalespeech4.avi
Example #5 (202K): femalepseech5.avi


Pink noise examples:

Example #1 (266K): pinknoise1.avi
Example #2 (270K): pinknoise2.avi
Example #3 (252K): pinknoise3.avi


3-way conversation examples: one talker is mobile, the other two stationary

Example #1 (340K): conversation1.avi
Example #2 (348K): conversation2.avi


Home