Sciweavers

964 search results - page 53 / 193
» A multimedia retrieval system using speech input
Sort
View
MIR
2003
ACM
169views Multimedia» more  MIR 2003»
15 years 9 months ago
Design, implementation and testing of an interactive video retrieval system
In this paper we present and discuss the system we developed for the search task of the TRECVID 2002, and its evaluation in an interactive search task. To do this we will look at ...
Georgina Gaughan, Alan F. Smeaton, Cathal Gurrin, ...
ICASSP
2011
IEEE
14 years 7 months ago
Spectral magnitude minimum mean-square error binary masks for DFT based speech enhancement
Originally, ideal binary mask (idbm) techniques have been used as a tool for studying aspects of the auditory system. More recently, idbm techniques have been adapted to the pract...
Jesper Jensen, Richard C. Hendriks
146
Voted
MM
2009
ACM
177views Multimedia» more  MM 2009»
15 years 10 months ago
Transfer non-metric measures into metric for similarity search
Similarity search is widely used in multimedia retrieval systems to find the most similar ones for a given object. Some similarity measures, however, are not metric, leading to e...
Danzhou Liu, Kien A. Hua
COST
2008
Springer
136views Multimedia» more  COST 2008»
15 years 5 months ago
Articulatory Synthesis of Speech and Singing: State of the Art and Suggestions for Future Research
Articulatory synthesis of speech and singing aims for modeling the production process of speech and singing as human-like or natural as possible. The state of the art is described ...
Bernd J. Kröger, Peter Birkholz
COGSCI
2002
99views more  COGSCI 2002»
15 years 3 months ago
Learning words from sights and sounds: a computational model
This paper presents an implemented computational model of word acquisition which learns directly from raw multimodal sensory input. Set in an information theoretic framework, the ...
Deb Roy, Alex Pentland