Sciweavers

TASLP
2010
97views more  TASLP 2010»
13 years 6 months ago
Voice Conversion Using Partial Least Squares Regression
Abstract--Voice conversion can be formulated as finding a mapping function which transforms the features of the source speaker to those of the target speaker. Gaussian mixture mode...
Elina Helander, Tuomas Virtanen, Jani Nurminen, Mo...
TASLP
2010
128views more  TASLP 2010»
13 years 6 months ago
Speaker Diarization Exploiting the Eigengap Criterion and Cluster Ensembles
A novel system for speaker diarization is proposed that combines the eigengap criterion and cluster ensembles. No explicit assumptions on the number of speakers are made. Two varia...
Nikoletta Bassiou, Vassiliki Moschou, Constantine ...
TASLP
2010
85views more  TASLP 2010»
13 years 9 months ago
Multi-View Semi-Supervised Learning for Dialog Act Segmentation of Speech
Ümit Güz, Sébastien Cuendet, Dile...
TASLP
2010
118views more  TASLP 2010»
13 years 9 months ago
Time-Frequency Sparsity by Removing Perceptually Irrelevant Components Using a Simple Model of Simultaneous Masking
Abstract—We present an algorithm for removing timefrequency components, found by a standard Gabor transform, of a “real-world” sound while causing no audible difference to th...
Péter Balázs, Bernhard Laback, Gerha...
TASLP
2010
138views more  TASLP 2010»
13 years 9 months ago
Source/Filter Model for Unsupervised Main Melody Extraction From Polyphonic Audio Signals
— Extracting the main melody from a polyphonic music recording seems natural even to untrained human listeners. To a certain extent it is related to the concept of source separat...
Jean-Louis Durrieu, Gaël Richard, Bertrand Da...
TASLP
2010
102views more  TASLP 2010»
13 years 9 months ago
Representing Musical Sounds With an Interpolating State Model
—A computationally efficient algorithm is proposed for modeling and representing time-varying musical sounds. The aim is to encode individual sounds and not the statistical prop...
Anssi Klapuri, Tuomas Virtanen
TASLP
2010
137views more  TASLP 2010»
13 years 9 months ago
High-Pitch Formant Estimation by Exploiting Temporal Change of Pitch
—This paper considers the problem of obtaining an accurate spectral representation of speech formant structure when the voicing source exhibits a high fundamental frequency. Our ...
Tianyu T. Wang, Thomas F. Quatieri
TASLP
2010
153views more  TASLP 2010»
13 years 9 months ago
On Optimal Frequency-Domain Multichannel Linear Filtering for Noise Reduction
Abstract—Several contributions have been made so far to develop optimal multichannel linear filtering approaches and show their ability to reduce the acoustic noise. However, th...
Mehrez Souden, Jacob Benesty, Sofiène Affes
TASLP
2010
127views more  TASLP 2010»
13 years 9 months ago
New Insights Into the MVDR Beamformer in Room Acoustics
—The minimum variance distortionless response (MVDR) beamformer, also known as Capon’s beamformer, is widely studied in the area of speech enhancement. The MVDR beamformer can ...
Emanuël Anco Peter Habets, Jacob Benesty, Isr...