Sciweavers

TASLP
2002
99views more  TASLP 2002»
13 years 11 months ago
Speech pause detection for noise spectrum estimation by tracking power envelope dynamics
A speech pause detection algorithm is an important and sensitive part of most single-microphone noise reduction schemes for enhancement of speech signals corrupted by additive nois...
M. Marzinzik, Birger Kollmeier
TASLP
2002
60views more  TASLP 2002»
13 years 11 months ago
A psychoacoustic approach to combined acoustic echo cancellation and noise reduction
This paper presents and compares algorithms for combined acoustic echo cancellation and noise reduction for hands-free telephones. A structure is proposed, consisting of a conventi...
Stefan Gustafsson, Rainer Martin, Peter Jax, Peter...
TASLP
2002
156views more  TASLP 2002»
13 years 11 months ago
Musical genre classification of audio signals
Abstract--Musical genres are categorical labels created by humans to characterize pieces of music. A musical genre is characterized by the common characteristics shared by its memb...
George Tzanetakis, Perry R. Cook
TASLP
2002
87views more  TASLP 2002»
13 years 11 months ago
A set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese
This paper presents a set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese. A large speech corpus produced by a single speaker is used, and the speech out...
Fu-Chiang Chou, Chiu-yu Tseng, Lin-Shan Lee
TASLP
2002
82views more  TASLP 2002»
13 years 11 months ago
Graceful degradation of speech recognition performance over packet-erasure networks
Abstract--This paper explores packet loss recovery for automatic speech recognition (ASR) in spoken dialog systems, assuming an architecture in which a lightweight client communica...
Constantinos Boulis, Mari Ostendorf, Eve A. Riskin...
TASLP
2002
93views more  TASLP 2002»
13 years 11 months ago
Robust endpoint detection and energy normalization for real-time speech and speaker recognition
When automatic speech recognition (ASR) and speaker verification (SV) are applied in adverse acoustic environments, endpoint detection and energy normalization can be crucial to th...
Qi Li, Jinsong Zheng, A. Tsai, Qiru Zhou
TASLP
2002
65views more  TASLP 2002»
13 years 11 months ago
Low-bitrate distributed speech recognition for packet-based and wireless communication
In this paper, we present a framework for developing source coding, channel coding and decoding as well as erasure concealment techniques adapted for distributed (wireless or packe...
A. Bernard, Abeer Alwan
TASLP
2002
99views more  TASLP 2002»
13 years 11 months ago
A system for spoken query information retrieval on mobile devices
Abstract--With the proliferation of handheld devices, information access on mobile devices is a topic of growing relevance. This paper presents a system that allows the user to sea...
E. Chang, Frank Seide, Helen M. Meng, Zhuoran Chen...
TASLP
2002
86views more  TASLP 2002»
13 years 11 months ago
High-level approaches to confidence estimation in speech recognition
Abstract--We describe some high-level approaches to estimating confidence scores for the words output by a speech recognizer. By "high-level" we mean that the proposed me...
Stephen Cox, Srinandan Dasmahapatra
TASLP
2002
124views more  TASLP 2002»
13 years 11 months ago
A robust compensation strategy for extraneous acoustic variations in spontaneous speech recognition
In this paper, we propose a robust compensation strategy to deal effectively with extraneous acoustic variations for spontaneous speech recognition. This strategy extends speaker a...
Hui Jiang, Li Deng