Sciweavers

775 search results - page 138 / 155
» Processing Self Corrections in a speech to speech system
Sort
View
ICASSP
2008
IEEE
14 years 2 months ago
Fine-grained pitch accent and boundary tone labeling with parametric F0 features
Motivated by linguistic theories of prosodic categoricity, symbolic representations of prosody have recently attracted the attention of speech technologists. Categorical represent...
Sankaranarayanan Ananthakrishnan, Shrikanth Naraya...
MM
2009
ACM
169views Multimedia» more  MM 2009»
14 years 2 months ago
Visual speaker localization aided by acoustic models
The following paper presents a novel audio-visual approach for unsupervised speaker locationing. Using recordings from a single, low-resolution room overview camera and a single f...
Gerald Friedland, Chuohao Yeo, Hayley Hung
ICMI
2003
Springer
164views Biometrics» more  ICMI 2003»
14 years 26 days ago
A visually grounded natural language interface for reference to spatial scenes
Many user interfaces, from graphic design programs to navigation aids in cars, share a virtual space with the user. Such applications are often ideal candidates for speech interfa...
Peter Gorniak, Deb Roy
AROBOTS
2002
166views more  AROBOTS 2002»
13 years 7 months ago
Multi-Modal Interaction of Human and Home Robot in the Context of Room Map Generation
In robotics, the idea of human and robot interaction is receiving a lot of attention lately. In this paper, we describe a multi-modal system for generating a map of the environment...
Saeed Shiry Ghidary, Yasushi Nakata, Hiroshi Saito...
ICMI
2007
Springer
262views Biometrics» more  ICMI 2007»
14 years 1 months ago
Automated generation of non-verbal behavior for virtual embodied characters
In this paper we introduce a system that automatically adds different types of non-verbal behavior to a given dialogue script between two virtual embodied agents. It allows us to ...
Werner Breitfuss, Helmut Prendinger, Mitsuru Ishiz...