Sciweavers

68 search results - page 5 / 14
» Multimodal Speaker Identification Based on Text and Speech
Sort
View
ICASSP
2010
IEEE
13 years 7 months ago
Robust speaker identification using an auditory-based feature
An auditory-based feature extraction algorithm is presented. The feature is based on a recently published time-frequency transform plus a set of modules to simulate the signal pro...
Qi Li, Yan Huang
TASLP
2002
87views more  TASLP 2002»
13 years 7 months ago
A set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese
This paper presents a set of corpus-based text-to-speech synthesis technologies for Mandarin Chinese. A large speech corpus produced by a single speaker is used, and the speech out...
Fu-Chiang Chou, Chiu-yu Tseng, Lin-Shan Lee
ICDAR
2005
IEEE
14 years 1 months ago
From Searching to Browsing through Multimodal Documents Linking
Relationships that link static documents discussed during meetings to the corresponding speech transcripts can be of various kinds. The most important ones, thematic links, quotat...
Dalila Mekhaldi, Denis Lalanne, Rolf Ingold
ICMCS
2007
IEEE
214views Multimedia» more  ICMCS 2007»
14 years 1 months ago
Exploring Discriminative Learning for Text-Independent Speaker Recognition
Speaker verification is a technology of verifying the claimed identity of a speaker based on the speech signal from the speaker (voice print). To learn the score of similarity be...
Ming Liu, Zhengyou Zhang, Mark Hasegawa-Johnson, T...
INTERSPEECH
2010
13 years 2 months ago
Expectations for discourse genre identification: a prosodic study
Speech can be divided into discourse genres based on the contextual environment it occurs in (e.g. political speech, sport commentary speech, etc.). The present study investigated...
Nicolas Obin, Volker Dellwo, Anne Lacheret, Xavier...