Sciweavers

2047 search results - page 189 / 410
» The limits of speech recognition
Sort
View
ICASSP
2011
IEEE
13 years 1 months ago
Comparing multilayer perceptron to Deep Belief Network Tandem features for robust ASR
In this paper, we extend the work done on integrating multilayer perceptron (MLP) networks with HMM systems via the Tandem approach. In particular, we explore whether the use of D...
Oriol Vinyals, Suman V. Ravuri
AAAI
2012
12 years 16 days ago
Online Sequence Alignment for Real-Time Audio Transcription by Non-Experts
Real-time transcription provides deaf and hard of hearing people visual access to spoken content, such as classroom instruction, and other live events. Currently, the only reliabl...
Walter S. Lasecki, Christopher D. Miller, Donato B...
MM
2004
ACM
81views Multimedia» more  MM 2004»
14 years 3 months ago
Interactive manipulation of replay speed while listening to speech recordings
Today’s interfaces for time-scaled audio replay have limitations especially regarding highly interactive tasks such as skimming and searching, which require quick temporary spee...
Wolfgang Hürst, Tobias Lauer, Georg Götz
TSD
2007
Springer
14 years 4 months ago
A Study on Speech with Manifest Emotions
We present a study of the prosody – seen in a broader sense – that supports the theory of the interrelationship function of speech. “Pure emotions” are meant to show a rela...
Horia-Nicolai L. Teodorescu, Silvia Monica Feraru
ETRA
2008
ACM
85views Biometrics» more  ETRA 2008»
13 years 12 months ago
Integrated speech and gaze control for realistic desktop environments
Nowadays various are the situations in which people need to interact with a Personal Computer without having the possibility to use traditional pointing devices, such as a keyboar...
Emiliano Castellina, Fulvio Corno, Paolo Pellegrin...