Search Sciweavers | Sciweavers

775 search results - page 70 / 155

» Processing Self Corrections in a speech to speech system

click to vote

ISM
2008
IEEE

136views Multimedia» more ISM 2008»

Multimodal Speaker Segmentation in Presence of Overlapped Speech Segments

14 years 2 months ago

Download sail.usc.edu

We propose a multimodal speaker segmentation algorithm with two main contributions: First, we suggest a hidden Markov model architecture that performs fusion of the three modaliti...

Viktor Rozgic, Kyu Jeong Han, Panayiotis G. Georgi...

claim paper

Read More »

click to vote

CHI
2006
ACM

100views Human Computer Interaction» more CHI 2006»

Speech pen: predictive handwriting based on ambient multimodal recognition

14 years 8 months ago

Download staff.aist.go.jp

It is tedious to handwrite long passages of text by hand. To make this process more efficient, we propose predictive handwriting that provides input predictions when the user writ...

Kazutaka Kurihara, Masataka Goto, Jun Ogata, Takeo...

claim paper

Read More »

click to vote

CCECE
2006
IEEE

222views Electrical And Computer Engi...» more CCECE 2006»

An Online System for Synchronized Processing of Video and Audio Signals

14 years 1 months ago

Download users.encs.concordia.ca

For many audio-visual applications, the integration and synchronization of audio and video signals is essential. The objective of this paper is to develop a system that displays t...

Mary Mikhail, Giovanni Palumbo, Jinane Mohammad, M...

claim paper

Read More »

click to vote

ICASSP
2011
IEEE

190views Signal Processing» more ICASSP 2011»

Utilizing glottal source pulse library for generating improved excitation signal for HMM-based speech synthesis

12 years 11 months ago

Download mirlab.org

This paper describes a source modeling method for hidden Markov model (HMM) based speech synthesis for improved naturalness. A speech corpus is rst decomposed into the glottal sou...

Tuomo Raitio, Antti Suni, Hannu Pulakka, Martti Va...

claim paper

Read More »

click to vote

ICASSP
2008
IEEE

214views Signal Processing» more ICASSP 2008»

Audiovisual-to-articulatory speech inversion using Active Appearance Models for the face and Hidden Markov Models for the dynami

14 years 2 months ago

Download cvsp.cs.ntua.gr

We are interested in recovering aspects of vocal tract’s geometry and dynamics from auditory and visual speech cues. We approach the problem in a statistical framework based on ...

Athanassios Katsamanis, George Papandreou, Petros ...

claim paper

Read More »

« Prev « First page 70 / 155 Last » Next »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers