TASLP 2010 | Sciweavers

150

TASLP
2010

97views more TASLP 2010»

Voice Conversion Using Partial Least Squares Regression

15 years 8 days ago

Abstract--Voice conversion can be formulated as finding a mapping function which transforms the features of the source speaker to those of the target speaker. Gaussian mixture mode...

Elina Helander, Tuomas Virtanen, Jani Nurminen, Mo...

claim paper

Read More »

138

click to vote

TASLP
2010

128views more TASLP 2010»

Speaker Diarization Exploiting the Eigengap Criterion and Cluster Ensembles

15 years 8 days ago

Download poseidon.csd.auth.gr

A novel system for speaker diarization is proposed that combines the eigengap criterion and cluster ensembles. No explicit assumptions on the number of speakers are made. Two varia...

Nikoletta Bassiou, Vassiliki Moschou, Constantine ...

claim paper

Read More »

130

click to vote

TASLP
2010

85views more TASLP 2010»

Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech

15 years 3 months ago

Download www.icsi.berkeley.edu

Ümit Güz, Sébastien Cuendet, Dile...

claim paper

Read More »

153

click to vote

TASLP
2010

118views more TASLP 2010»

Time-Frequency Sparsity by Removing Perceptually Irrelevant Components Using a Simple Model of Simultaneous Masking

15 years 3 months ago

Download www.kfs.oeaw.ac.at

Abstract—We present an algorithm for removing timefrequency components, found by a standard Gabor transform, of a “real-world” sound while causing no audible difference to th...

Péter Balázs, Bernhard Laback, Gerha...

claim paper

Read More »

112

click to vote

TASLP
2010

78views more TASLP 2010»

Modulation Spectral Features for Robust Far-Field Speaker Identification

15 years 3 months ago

Download individual.utoronto.ca

Tiago H. Falk, Wai-Yip Chan

claim paper

Read More »

181

click to vote

TASLP
2010

138views more TASLP 2010»

Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals

15 years 3 months ago

Download perso.telecom-paristech.fr

— Extracting the main melody from a polyphonic music recording seems natural even to untrained human listeners. To a certain extent it is related to the concept of source separat...

Jean-Louis Durrieu, Gaël Richard, Bertrand Da...

claim paper

Read More »

134

click to vote

TASLP
2010

102views more TASLP 2010»

Representing Musical Sounds With an Interpolating State Model

15 years 3 months ago

Download www.cs.tut.fi

—A computationally efﬁcient algorithm is proposed for modeling and representing time-varying musical sounds. The aim is to encode individual sounds and not the statistical prop...

Anssi Klapuri, Tuomas Virtanen

claim paper

Read More »

157

click to vote

TASLP
2010

137views more TASLP 2010»

High-Pitch Formant Estimation by Exploiting Temporal Change of Pitch

15 years 3 months ago

Download web.mit.edu

—This paper considers the problem of obtaining an accurate spectral representation of speech formant structure when the voicing source exhibits a high fundamental frequency. Our ...

Tianyu T. Wang, Thomas F. Quatieri

claim paper

Read More »

161

click to vote

TASLP
2010

153views more TASLP 2010»

On Optimal Frequency-Domain Multichannel Linear Filtering for Noise Reduction

15 years 3 months ago

Download externe.emt.inrs.ca

Abstract—Several contributions have been made so far to develop optimal multichannel linear ﬁltering approaches and show their ability to reduce the acoustic noise. However, th...

Mehrez Souden, Jacob Benesty, Sofiène Affes

claim paper

Read More »

116

click to vote

TASLP
2010

127views more TASLP 2010»

New Insights Into the MVDR Beamformer in Room Acoustics

15 years 3 months ago

Download webee.technion.ac.il

—The minimum variance distortionless response (MVDR) beamformer, also known as Capon’s beamformer, is widely studied in the area of speech enhancement. The MVDR beamformer can ...

Emanuël Anco Peter Habets, Jacob Benesty, Isr...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers