A new approach to cross-modal multimedia retrieval

15 years 7 months ago

Download www.svcl.ucsd.edu

The problem of joint modeling the text and image components of multimedia documents is studied. The text component is represented as a sample from a hidden topic model, learned with latent Dirichlet allocation, and images are represented as bags of visual (SIFT) features. Two hypotheses are investigated: that 1) there is a benefit to explicitly modeling correlations between the two components, and 2) this modeling is more effective in feature spaces with higher levbstraction. Correlations between the two components are learned with canonical correlation analysis. Abstraction is achieved by representing text and images at a more general, semantic level. The two hypotheses are studied in the context of the task of cross-modal document retrieval. This includes retrieving the text that most closely matches a query image, or retrieving the images that most closely match a query text. It is shown that accounting for crossrrelations and semantic abstraction both improve retrieval accuracy. T...

Nikhil Rasiwasia, Jose Costa Pereira, Emanuele Cov

Real-time Traffic

Cross-modal Document Retrieval | Image | Latent Dirichlet Allocation | MM 2010 | Multimedia |

claim paper

» Design of a CrossMedia Indexing System

» Contentbased image retrieval approaches and trends of the new age

» The MPEG Query Format a new standard for querying digital content Usage in scholarly lite...

» Overview of VideoCLEF 2009 New Perspectives on SpeechBased Multimedia Content Enrichment

» XCRAB A Content and AnnotationBased Multimedia Indexing and Retrieval System

» A New Text Detection Algorithm for ContentOriented Line Drawing Image Retrieval

» Information Access in Multimedia Databases Based on Feature Models

» Distinct nearest neighbors queries for similarity search in very large multimedia database...

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	MM
Authors	Nikhil Rasiwasia, Jose Costa Pereira, Emanuele Coviello, Gabriel Doyle, Gert R. G. Lanckriet, Roger Levy, Nuno Vasconcelos

Comments (0)

Sciweavers

A new approach to cross-modal multimedia retrieval

Cross-modal Document Retrieval | Image | Latent Dirichlet Allocation | MM 2010 | Multimedia |

Explore & Download

Productivity Tools

Sciweavers