Sciweavers

TASLP
2010
126views more  TASLP 2010»
13 years 2 months ago
Sound Field Reproduction using the Lasso
Reproducing a sampled sound field using an array of loudspeakers is a problem with well-appreciated applications to acoustics and ultrasound treatment. Loudspeaker signal design ha...
G. N. Lilis, Daniele Angelosante, Georgios B. Gian...
TASLP
2010
169views more  TASLP 2010»
13 years 2 months ago
Integration of Statistical Models for Dictation of Document Translations in a Machine-Aided Human Translation Task
Abstract--This paper presents a model for machine aided human translation (MAHT) that integrates source language text and target language acoustic information to produce the text t...
Aarthi Reddy, Richard C. Rose
TASLP
2010
133views more  TASLP 2010»
13 years 2 months ago
Unsupervised Equalization of Lombard Effect for Speech Recognition in Noisy Adverse Environments
In the presence of environmental noise, speakers tend to adjust their speech production in an effort to preserve intelligible communication. The noise-induced speech adjustments, c...
Hynek Boril, John H. L. Hansen
TASLP
2010
177views more  TASLP 2010»
13 years 2 months ago
A Watermarking-Based Method for Informed Source Separation of Audio Signals With a Single Sensor
In this paper, the issue of audio source separation from a single channel is addressed, i.e. the estimation of several source signals from a single observation of their mixture. Th...
Mathieu Parvaix, Laurent Girin, Jean-Marc Brossier
TASLP
2010
78views more  TASLP 2010»
13 years 2 months ago
Solving Demodulation as an Optimization Problem
We introduce two new methods for the demodulation of acoustic signals by posing the problem in a convex optimization framework. This allows the parameters of the modulator and carr...
Gregory Sell, Malcolm Slaney
TASLP
2010
97views more  TASLP 2010»
13 years 2 months ago
Hierarchical Bayesian Language Models for Conversational Speech Recognition
Traditional n-gram language models are widely used in state-of-the-art large vocabulary speech recognition systems. This simple model suffers from some limitations, such as overfi...
Songfang Huang, Steve Renals
TASLP
2010
124views more  TASLP 2010»
13 years 2 months ago
Audio Signal Representations for Indexing in the Transform Domain
Indexing audio signals directly in the transform domain can potentially save a significant amount of computation when working on a large database of signals stored in a lossy compr...
Emmanuel Ravelli, Gaël Richard, Laurent Daude...
TASLP
2010
144views more  TASLP 2010»
13 years 2 months ago
Active Learning With Sampling by Uncertainty and Density for Data Annotations
To solve the knowledge bottleneck problem, active learning has been widely used for its ability to automatically select the most informative unlabeled examples for human annotation...
Jingbo Zhu, Huizhen Wang, Benjamin K. Tsou, Matthe...
TASLP
2010
106views more  TASLP 2010»
13 years 2 months ago
Efficient and Robust Music Identification With Weighted Finite-State Transducers
We present an approach to music identification based on weighted finite-state transducers and Gaussian mixture models, inspired by techniques used in large-vocabulary speech recogn...
Mehryar Mohri, Pedro Moreno, Eugene Weinstein
TASLP
2010
157views more  TASLP 2010»
13 years 2 months ago
Multichannel Nonnegative Matrix Factorization in Convolutive Mixtures for Audio Source Separation
Abstract--We consider inference in a general data-driven object-based model of multichannel audio data, assumed generated as a possibly underdetermined convolutive mixture of sourc...
Alexey Ozerov, Cédric Févotte