Sciweavers

TASLP
2010
134views more  TASLP 2010»
13 years 2 months ago
Autoregressive Models of Amplitude Modulations in Audio Compression
We present a scalable medium bit-rate wide-band audio coding technique based on frequency domain linear prediction (FDLP). FDLP is an efficient method for representing the long-ter...
Sriram Ganapathy, Petr Motlícek, Hynek Herm...
TASLP
2010
117views more  TASLP 2010»
13 years 2 months ago
The CALO Meeting Assistant System
Abstract-The CALO Meeting Assistant (MA) provides for distributed meeting capture, annotation, automatic transcription and semantic analysis of multiparty meetings, and is part of ...
Gökhan Tür, Andreas Stolcke, L. Voss, St...
TASLP
2010
109views more  TASLP 2010»
13 years 2 months ago
Multipitch Estimation of Piano Sounds Using a New Probabilistic Spectral Smoothness Principle
Abstract--A new method for the estimation of multiple concurrent pitches in piano recordings is presented. It addresses the issue of overlapping overtones by modeling the spectral ...
Valentin Emiya, Roland Badeau, Bertrand David
TASLP
2010
159views more  TASLP 2010»
13 years 2 months ago
Under-Determined Reverberant Audio Source Separation Using a Full-Rank Spatial Covariance Model
This article addresses the modeling of reverberant recording environments in the context of under-determined convolutive blind source separation. We model the contribution of each ...
Ngoc Q. K. Duong, Emmanuel Vincent, Rémi Gr...
TASLP
2010
128views more  TASLP 2010»
13 years 2 months ago
Three Dimensions of Pitched Instrument Onset Detection
In this paper, we suggest a novel group delay based method for the onset detection of pitched instruments. It is proposed to approach the problem of onset detection by examining th...
Andre Holzapfel, Yannis Stylianou, Ali Cenk Gedik,...
TASLP
2010
150views more  TASLP 2010»
13 years 2 months ago
Diffuse Reverberation Model for Efficient Image-Source Simulation of Room Impulse Responses
Abstract-- In many research fields of engineering and acoustics, the image-source model represents one of the most popular tools for the simulation of sound fields in virtual rever...
Eric A. Lehmann, Anders M. Johansson
TASLP
2010
135views more  TASLP 2010»
13 years 2 months ago
Audio-Based Semantic Concept Classification for Consumer Video
Abstract--This paper presents a novel method for automatically classifying consumer video clips based on their soundtracks. We use a set of 25 overlapping semantic classes, chosen ...
Keansub Lee, Daniel P. W. Ellis
TASLP
2010
134views more  TASLP 2010»
13 years 2 months ago
Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-Peak Regions
This paper presents a maximum likelihood approach to multiple fundamental frequency (F0) estimation for a mixture of harmonic sound sources, where the power spectrum of a time fra...
Zhiyao Duan, Bryan Pardo, Changshui Zhang
TASLP
2010
138views more  TASLP 2010»
13 years 2 months ago
Glimpsing IVA: A Framework for Overcomplete/Complete/Undercomplete Convolutive Source Separation
Abstract--Independent vector analysis (IVA) is a method for separating convolutedly mixed signals that significantly reduces the occurrence of the well-known permutation problem in...
Alireza Masnadi-Shirazi, Wenyi Zhang, Bhaskar D. R...
TASLP
2010
167views more  TASLP 2010»
13 years 2 months ago
Broadband Source Localization From an Eigenanalysis Perspective
Abstract--Broadband source localization has several applications ranging from automatic video camera steering to target signal tracking and enhancement through beamforming. Consequ...
Mehrez Souden, Jacob Benesty, Sofiène Affes