Sciweavers

775 search results - page 70 / 155
» Processing Self Corrections in a speech to speech system
Sort
View
ISM
2008
IEEE
136views Multimedia» more  ISM 2008»
14 years 2 months ago
Multimodal Speaker Segmentation in Presence of Overlapped Speech Segments
We propose a multimodal speaker segmentation algorithm with two main contributions: First, we suggest a hidden Markov model architecture that performs fusion of the three modaliti...
Viktor Rozgic, Kyu Jeong Han, Panayiotis G. Georgi...
CHI
2006
ACM
14 years 8 months ago
Speech pen: predictive handwriting based on ambient multimodal recognition
It is tedious to handwrite long passages of text by hand. To make this process more efficient, we propose predictive handwriting that provides input predictions when the user writ...
Kazutaka Kurihara, Masataka Goto, Jun Ogata, Takeo...
CCECE
2006
IEEE
14 years 1 months ago
An Online System for Synchronized Processing of Video and Audio Signals
For many audio-visual applications, the integration and synchronization of audio and video signals is essential. The objective of this paper is to develop a system that displays t...
Mary Mikhail, Giovanni Palumbo, Jinane Mohammad, M...
ICASSP
2011
IEEE
12 years 11 months ago
Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis
This paper describes a source modeling method for hidden Markov model (HMM) based speech synthesis for improved naturalness. A speech corpus is rst decomposed into the glottal sou...
Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Va...
ICASSP
2008
IEEE
14 years 2 months ago
Audiovisual-to-articulatory speech inversion using Active Appearance Models for the face and Hidden Markov Models for the dynami
We are interested in recovering aspects of vocal tract’s geometry and dynamics from auditory and visual speech cues. We approach the problem in a statistical framework based on ...
Athanassios Katsamanis, George Papandreou, Petros ...