Sciweavers

ISM
2008
IEEE
136views Multimedia» more  ISM 2008»
14 years 1 months ago
Multimodal Speaker Segmentation in Presence of Overlapped Speech Segments
We propose a multimodal speaker segmentation algorithm with two main contributions: First, we suggest a hidden Markov model architecture that performs fusion of the three modaliti...
Viktor Rozgic, Kyu Jeong Han, Panayiotis G. Georgi...
ISM
2008
IEEE
161views Multimedia» more  ISM 2008»
14 years 1 months ago
Deblocking of Block-Transform Compressed Images Using Phase-Adaptive Shifted Thresholding
Many popular image compression schemes are based on block-transform coding, a technique where images are broken into small blocks of pixels prior to transformation and compression...
Alexander Wong, William Bishop
ISM
2008
IEEE
101views Multimedia» more  ISM 2008»
14 years 1 months ago
Photo Context as a Bag of Words
In the recent years, photo context metadata (e.g., date, GPS coordinates) have been proved to be useful in the management of personal photos. However, these metadata are still poo...
Windson Viana, Samira Hammiche, Marlène Vil...
ISM
2008
IEEE
166views Multimedia» more  ISM 2008»
14 years 1 months ago
Extended Macroblock Bipartitioning Modes for H.264/AVC Inter Coding
We present a family of new macroblock partitions for H.264/AVC inter prediction. These modes allow a macroblock to be bipartitioned along a horizontal, vertical, or diagonal edge ...
Kenneth Vermeirsch, Jan De Cock, Stijn Notebaert, ...
ISM
2008
IEEE
153views Multimedia» more  ISM 2008»
14 years 1 months ago
Extended Interface Solutions for Musical Robotics
We present a framework for coupling musical robots with interfaces based on open-ended control architecture, allowing for new and expanded forms of expression. The MahaDeviBot all...
Owen Vallis, Jordan Hochenbaum, Ajay Kapur
ISM
2008
IEEE
111views Multimedia» more  ISM 2008»
14 years 1 months ago
Secure and Low Cost Selective Encryption for JPEG2000
Selective encryption is a new trend in content protection. It aims at reducing the amount of data to encrypt while achieving a sufficient and inexpensive security. This approach ...
Ayoub Massoudi, Frédéric Lefè...
ISM
2008
IEEE
100views Multimedia» more  ISM 2008»
14 years 1 months ago
Pseudo-3D Video Conferencing with a Generic Webcam
Chris Harrison, Scott E. Hudson
ISM
2008
IEEE
79views Multimedia» more  ISM 2008»
14 years 1 months ago
Web Lectures and Web 2.0
At many universities, web lectures have become an integral part of the e-learning portfolio over the last few years. While many aspects of the technology involved, like automatic ...
Markus Ketterl, Robert Mertens, Oliver Vornberger
ISM
2008
IEEE
194views Multimedia» more  ISM 2008»
14 years 1 months ago
Spoken Term Detection Using Visual Spectrogram Matching
This work proposes a novel spoken term detection technique, where the query is in audio format. Detection and retrieval are performed by matching the spectrograms of the spoken do...
Nevena Lazic, Parham Aarabi
ISM
2008
IEEE
134views Multimedia» more  ISM 2008»
14 years 1 months ago
GeM-Tree: Towards a Generalized Multidimensional Index Structure Supporting Image and Video Retrieval
In this paper, we propose a tree-based multidimensional structure, GeM-Tree, which indexes both images and videos within a single general framework utilizing Earth Mover’s Dista...
Kasturi Chatterjee, Shu-Ching Chen