Sciweavers

ICASSP
2010
IEEE
13 years 7 months ago
Searching with expectations
Handling large amounts of data, such as large image databases, requires the use of approximate nearest neighbor search techniques. Recently, Hamming embedding methods such as spec...
Harsimrat Sandhawalia, Herve Jegou
ICASSP
2010
IEEE
13 years 7 months ago
Robust similarity metrics between audio signals based on asymmetrical spectral envelope matching
In this paper, a new type of metric that defines the similarity between musical audio signals is proposed. Based on the spectral flatness criterion, those metrics achieve low co...
Mathieu Lagrange, Roland Badeau, Gaël Richard
ICASSP
2010
IEEE
13 years 7 months ago
Sub-Nyquist processing with the modulated wideband converter
Sub-Nyquist systems capture the signal information in a different fashion than uniform high-rate samples. Consequently, digital processing, which is the prime reason for leaving t...
Moshe Mishali, Asaf Elron, Yonina C. Eldar
ICASSP
2010
IEEE
13 years 7 months ago
The IBM 2008 GALE Arabic speech transcription system
This paper describes the Arabic broadcast transcription system fielded by IBM in the GALE Phase 3.5 machine translation evaluation. Key advances compared to our Phase 2.5 system ...
George Saon, Hagen Soltau, Upendra Chaudhari, Step...
ICASSP
2010
IEEE
13 years 7 months ago
Optimal delayed decoding of predictively encoded sources
Predictive coding eliminates redundancy due to correlations between the current and past signal samples, so that only the innovation, or prediction residual, needs to be encoded. ...
Vinay Melkote, Kenneth Rose
ICASSP
2010
IEEE
13 years 7 months ago
A union of incoherent spaces model for classification
We present a new and computationally efficient scheme for classifying signals into a fixed number of known classes. We model classes as subspaces in which the corresponding data...
Karin Schnass, Pierre Vandergheynst
ICASSP
2010
IEEE
13 years 7 months ago
Tuning phone decoders for language identification
Phonotactic approach, phone recognition to be followed by language modeling, is one of the most popular approaches to language identification (LID). In this work, we explore how ...
C. P. Santhosh Kumar, Haizhou Li, Rong Tong, Pavel...
ICASSP
2010
IEEE
13 years 7 months ago
Variational nonparametric Bayesian Hidden Markov Model
The Hidden Markov Model (HMM) has been widely used in many applications such as speech recognition. A common challenge for applying the classical HMM is to determine the structure...
Nan Ding, Zhijian Ou
ICASSP
2010
IEEE
13 years 7 months ago
Approximate eigenvalue decomposition of para-Hermitian systems through successive FIR paraunitary transformations
The eigenvalue decomposition (EVD) of a Hermitian matrix in terms of unitary matrices is well known. In this paper, we present an algorithm for the approximate EVD (AEVD) of a par...
Andre Tkacenko
ICASSP
2010
IEEE
13 years 7 months ago
Voice source estimation for artificial bandwidth extension of telephone speech
Artificial bandwidth extension (ABWE) of speech signals aims to estimate wideband speech (50 Hz – 7 kHz) from narrowband signals (300 Hz – 3.4 kHz). Applying the source-filt...
Mark R. P. Thomas, Jon Gudnason, Patrick A. Naylor...