Sciweavers

467 search results - page 77 / 94
» Phoneme segmentation of speech
Sort
View
ICASSP
2010
IEEE
13 years 2 months ago
Background music identification through content filtering and min-hash matching
A novel framework for background music identification is proposed in this paper. Given a piece of audio signals that mixes background music with speech/noise, we identify the musi...
Chih-Yi Chiu, Dimitrios Bountouridis, Ju-Chiang Wa...
ICASSP
2011
IEEE
12 years 11 months ago
Pitch transposition and breathiness modification using a glottal source model and its adapted vocal-tract filter
The transformation of the voiced segments of a speech recording has many applications such as expressivity synthesis or voice conversion. This paper addresses the pitch transposit...
Gilles Degottex, Axel Röbel, Xavier Rodet
MM
2005
ACM
187views Multimedia» more  MM 2005»
14 years 1 months ago
Augmented segmentation and visualization for presentation videos
We investigate methods of segmenting, visualizing, and indexing presentation videos by both audio and visual data. The audio track is segmented by speaker, and augmented with key ...
Alexander Haubold, John R. Kender
ICMCS
2008
IEEE
187views Multimedia» more  ICMCS 2008»
14 years 2 months ago
Accommodating sample size effect on similarity measures in speaker clustering
We investigate the symmetric Kullback-Leibler (KL2) distance in speaker clustering and its unreported effects for differently-sized feature matrices. Speaker data is represented a...
Alexander Haubold, John R. Kender
IVA
2007
Springer
14 years 1 months ago
T2D: Generating Dialogues Between Virtual Agents Automatically from Text
The Text2Dialogue (T2D) system that we are developing allows digital content creators to generate attractive multi-modal dialogues presented by two virtual agents—by simply provi...
Paul Piwek, Hugo Hernault, Helmut Prendinger, Mits...