Sciweavers

964 search results - page 50 / 193
» A multimedia retrieval system using speech input
Sort
View
133
Voted
ISM
2008
IEEE
136views Multimedia» more  ISM 2008»
15 years 10 months ago
Multimodal Speaker Segmentation in Presence of Overlapped Speech Segments
We propose a multimodal speaker segmentation algorithm with two main contributions: First, we suggest a hidden Markov model architecture that performs fusion of the three modaliti...
Viktor Rozgic, Kyu Jeong Han, Panayiotis G. Georgi...
ANLP
2000
109views more  ANLP 2000»
15 years 5 months ago
Predicting Automatic Speech Recognition Performance Using Prosodic Cues
In spoken dialogue systems, it is important for a system to know how likely a speech recognition hypothesis is to be correct, so it can reprompt for fresh input, or, in cases wher...
Diane J. Litman, Julia Hirschberg, Marc Swerts
141
Voted
ICPR
2008
IEEE
16 years 5 months ago
Fuzzy rule selection using Iterative Rule Learning for speech data classification
Fuzzy rule base systems have been successfully used for pattern classification. These systems focus on generating a rule-base from numerical input data. The resulting rule-base ca...
Bin Ma, Chng Eng Siong, Haizhou Li, Omid Dehzangi
ICMCS
2005
IEEE
100views Multimedia» more  ICMCS 2005»
15 years 9 months ago
Projekt Quebex: A Query by Example System for Audio Retrieval
This paper describes an audio retrieval system,Quebex,that works on raw audio data. The system is able to retrieve songs that are rhythmically and timbrewise similar from a databa...
Balaji Thoshkahna, K. R. Ramakrishnan
130
Voted
CIKM
2007
Springer
15 years 10 months ago
Latent semantic fusion model for image retrieval and annotation
This paper studies the effect of Latent Semantic Analysis (LSA) on two different tasks: multimedia document retrieval (MDR) and automatic image annotation (AIA). The contributio...
Trong-Ton Pham, Nicolas Maillot, Joo-Hwee Lim, Jea...