Top-down visual saliency via joint CRF and dictionary learning

13 years 5 months ago

Download faculty.ucmerced.edu

Top-down visual saliency facilities object localization by providing a discriminative representation of target objects and a probability map for reducing the search space. In this paper, we propose a novel top-down saliency model that jointly learns a Conditional Random Field (CRF) and a discriminative dictionary. The proposed model is formulated based on a CRF with latent variables. By using sparse codes as latent variables, we train the dictionary modulated by CRF, and meanwhile a CRF with sparse coding. We propose a max-margin approach to train our model via fast inference algorithms. We evaluate our model on the Graz02 and PASCAL VOC 2007 datasets. Experimental results show that our model performs favorably against the stateof-the-art top-down saliency methods. We also observe that the dictionary update signiﬁcantly improves the model performance.

Jimei Yang, Ming-Hsuan Yang

Real-time Traffic

Computer Vision | Cvpr 2012 | Inference Algorithms | Object Localization | Target Objects |

claim paper

Post Info
More Details (n/a)

Added	28 Sep 2012
Updated	28 Sep 2012
Type	Journal
Year	2012
Where	CVPR
Authors	Jimei Yang, Ming-Hsuan Yang

Comments (0)

Sciweavers

Top-down visual saliency via joint CRF and dictionary learning

Computer Vision | Cvpr 2012 | Inference Algorithms | Object Localization | Target Objects |

Explore & Download

Productivity Tools

Sciweavers