Real-Time Robot Audition System That Recognizes Simultaneous Speech in The Real World

14 years 9 months ago

Download people.xiph.org

— This paper presents a robot audition system that recognizes simultaneous speech in the real world by using robotembedded microphones. We have previously reported Missing Feature Theory (MFT) based integration of Sound Source Separation (SSS) and Automatic Speech Recognition (ASR) for building robust robot audition. We demonstrated that a MFTbased prototype system drastically improved the performance of speech recognition even when three speakers talked to a robot simultaneously. However, the prototype system had three problems; being ofﬂine, hand-tuning of system parameters, and failure in Voice Activity Detection (VAD). To attain online processing, we introduced FlowDesigner-based architecture to integrate sound source localization (SSL), SSS and ASR. This architecture brings fast processing and easy implementation because it provides a simple framework of shared-object-based integration. To optimize the parameters, we developed Genetic Algorithm (GA) based parameter optimizatio...

Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano

Real-time Traffic

IROS 2006 | Robot Audition | Robotics | Simultaneous Speech | Sound Source |

claim paper

Post Info
More Details (n/a)

Added	12 Jun 2010
Updated	12 Jun 2010
Type	Conference
Year	2006
Where	IROS
Authors	Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Kazunori Komatani, Tetsuya Ogata, Hiroshi G. Okuno

Comments (0)

Sciweavers

Real-Time Robot Audition System That Recognizes Simultaneous Speech in The Real World

IROS 2006 | Robot Audition | Robotics | Simultaneous Speech | Sound Source |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers