Sciweavers

TASLP
2002
143views more  TASLP 2002»
13 years 11 months ago
Distributed speech processing in miPad's multimodal user interface
This paper describes the main components of MiPad (Multimodal Interactive PAD) and especially its distributed speech processing aspects. MiPad is a wireless mobile PDA prototype th...
Li Deng, Kuansan Wang, Alex Acero, Hsiao-Wuen Hon,...
TASLP
2002
73views more  TASLP 2002»
13 years 11 months ago
Perceptual audio coding using adaptive pre- and post-filters and lossless compression
This paper proposes a versatile perceptual audio coding method that achieves high compression ratios and is capable of low encoding/decoding delay. It accommodates a variety of sou...
Gerald Schuller, Bin Yu, Dawei Huang, Bernd Edler
TASLP
2002
79views more  TASLP 2002»
13 years 11 months ago
Perception-based partial encryption of compressed speech
Mobile multimedia applications, the focus of many forthcoming wireless services, increasingly demand low-power techniques implementing content protection and customer privacy. In t...
Antonio Servetti, Juan Carlos De Martin
TASLP
2002
120views more  TASLP 2002»
13 years 11 months ago
Improved audio coding using a psychoacoustic model based on a cochlear filter bank
Perceptual audio coders use an estimated masked threshold for the determination of the maximum permissible just-inaudible noise level introduced by quantization. This estimate is d...
Frank Baumgarte
TASLP
2002
87views more  TASLP 2002»
13 years 11 months ago
A new audio coding scheme using a forward masking model and perceptually weighted vector quantization
This paper presents a new audio coder that includes two techniques to improve the sound quality of the audio coding system. First, a forward masking model is proposed. This model e...
Yuan-Hao Huang, Tzi-Dar Chiueh
TASLP
2002
96views more  TASLP 2002»
13 years 11 months ago
MAP speaker adaptation of state duration distributions for speech recognition
This paper presents a framework for maximum a posteriori (MAP) speaker adaptation of state duration distributions in hidden Markov models (HMM). Four key issues of MAP estimation, ...
Néstor Becerra Yoma, Jorge Silva Sán...
TASLP
2002
143views more  TASLP 2002»
13 years 11 months ago
Creating conversational interfaces for children
Creating conversational interfaces for children is challenging in several respects. These include acoustic modeling for automatic speech recognition (ASR), language and dialog mode...
Shrikanth S. Narayanan, Alexandros Potamianos
TASLP
2002
84views more  TASLP 2002»
13 years 11 months ago
Maximum likelihood multiple subspace projections for hidden Markov models
The first stage in many pattern recognition tasks is to generate a good set of features from the observed data. Usually, only a single feature space is used. However, in some compl...
Mark J. F. Gales
TASLP
2002
67views more  TASLP 2002»
13 years 11 months ago
Efficient tracking of the cross-correlation coefficient
In many (audio) processing algorithms, involving manipulation of discrete-time signals, the performance can vary strongly over the repertoire that is used. This may be the case whe...
Ronald M. Aarts, Roy Irwan, Augustus J. E. M. Jans...