Sciweavers

MM
2010
ACM

Multimodal location estimation

13 years 11 months ago
Multimodal location estimation
In this article we define a multimedia content analysis problem, which we call multimodal location estimation: Given a video/image/audio file, the task is to determine where it was recorded. A single indication, such as a unique landmark, might already pinpoint a location precisely. In most cases, however, a combination of evidence from the visual and the acoustic domain will only narrow down the set of possible answers. Therefore, approaches to tackle this task should be inherently multimedia. While the task is hard, in fact sometimes unsolvable, training data can be leveraged from the Internet in large amounts. Moreover, even partially successful automatic estimation of location opens up new possibilities in video content matching, archiving, and organization. It could revolutionize law enforcement and computer-aided intelligence agency work, especially since both semi-automatic and fully automatic approaches would be possible. In this article, we describe our idea of growing multim...
Gerald Friedland, Oriol Vinyals, Trevor Darrell
Added 06 Dec 2010
Updated 06 Dec 2010
Type Conference
Year 2010
Where MM
Authors Gerald Friedland, Oriol Vinyals, Trevor Darrell
Comments (0)