TASLP 2010 | Sciweavers

39

TASLP
2010

135views more TASLP 2010»

Audio-Based Semantic Concept Classification for Consumer Video

13 years 6 months ago

Abstract--This paper presents a novel method for automatically classifying consumer video clips based on their soundtracks. We use a set of 25 overlapping semantic classes, chosen ...

Keansub Lee, Daniel P. W. Ellis

claim paper

Read More »

45

click to vote

TASLP
2010

134views more TASLP 2010»

Multiple Fundamental Frequency Estimation by Modeling Spectral Peaks and Non-Peak Regions

13 years 6 months ago

Download music.cs.northwestern.edu

This paper presents a maximum likelihood approach to multiple fundamental frequency (F0) estimation for a mixture of harmonic sound sources, where the power spectrum of a time fra...

Zhiyao Duan, Bryan Pardo, Changshui Zhang

claim paper

Read More »

48

click to vote

TASLP
2010

138views more TASLP 2010»

Glimpsing IVA: A Framework for Overcomplete/Complete/Undercomplete Convolutive Source Separation

13 years 6 months ago

Download dsp.ucsd.edu

Abstract--Independent vector analysis (IVA) is a method for separating convolutedly mixed signals that significantly reduces the occurrence of the well-known permutation problem in...

Alireza Masnadi-Shirazi, Wenyi Zhang, Bhaskar D. R...

claim paper

Read More »

35

click to vote

TASLP
2010

167views more TASLP 2010»

Broadband Source Localization From an Eigenanalysis Perspective

13 years 6 months ago

Download externe.emt.inrs.ca

Abstract--Broadband source localization has several applications ranging from automatic video camera steering to target signal tracking and enhancement through beamforming. Consequ...

Mehrez Souden, Jacob Benesty, Sofiène Affes

claim paper

Read More »

38

click to vote

TASLP
2010

101views more TASLP 2010»

Gaussian Model-Based Multichannel Speech Presence Probability

13 years 6 months ago

Download externe.emt.inrs.ca

The knowledge of the target speech presence probability in a mixture of signals captured by a speech communication system is of paramount importance in several applications includi...

Mehrez Souden, Jingdong Chen, Jacob Benesty, Sofi&...

claim paper

Read More »

42

click to vote

TASLP
2010

132views more TASLP 2010»

Using Reverberation to Improve Range and Elevation Discrimination for Small Array Sound Source Localization

13 years 6 months ago

Download research.microsoft.com

Sound source localization (SSL) is an essential task in many applications involving speech capture and enhancement. As such, speaker localization with microphone arrays has receive...

Flavio Ribeiro, Cha Zhang, Dinei A. F. Florê...

claim paper

Read More »

47

click to vote

TASLP
2010

117views more TASLP 2010»

Speech Enhancement Using Gaussian Scale Mixture Models

13 years 6 months ago

Download papers.cnl.salk.edu

This paper presents a novel probabilistic approach to speech enhancement. Instead of a deterministic logarithmic relationship, we assume a probabilistic relationship between the fr...

Jiucang Hao, Te-Won Lee, Terrence J. Sejnowski

claim paper

Read More »

34

click to vote

TASLP
2010

122views more TASLP 2010»

Error Approximation and Minimum Phone Error Acoustic Model Estimation

13 years 6 months ago

Download mi.eng.cam.ac.uk

Minimum phone error (MPE) acoustic parameter estimation involves calculation of edit distances (errors) between correct and incorrect hypotheses. In the context of large vocabulary...

Matt Gibson 0002, Thomas Hain

claim paper

Read More »

31

click to vote

TASLP
2010

96views more TASLP 2010»

Evaluating Source Separation Algorithms With Reverberant Speech

13 years 6 months ago

Download cns-web.bu.edu

This paper examines the performance of several source separation systems on a speech separation task for which human intelligibility has previously been measured. For anechoic mixt...

Michael I. Mandel, S. Bressler, Barbara G. Shinn-C...

claim paper

Read More »

33

click to vote

TASLP
2010

148views more TASLP 2010»

Batch and Adaptive PARAFAC-Based Blind Separation of Convolutive Speech Mixtures

13 years 6 months ago

Download perso-etis.ensea.fr

We present a frequency-domain technique based on PARAllel FACtor (PARAFAC) analysis that performs multichannel blind source separation (BSS) of convolutive speech mixtures. PARAFAC...

Dimitri Nion, Kleanthis N. Mokios, Nicholas D. Sid...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers