Abstract. We propose a graph based method to improve the performance of person queries in large news video collections. The method benefits from the multi-modal structure of videos...
In many recent object recognition systems, feature extraction
stages are generally composed of a filter bank, a
non-linear transformation, and some sort of feature pooling
layer...
Kevin Jarrett, Koray Kavukcuoglu, Marc’Aurelio R...
Humans can recognize biological motion from strongly impoverished stimuli, like point-light displays. Although the neural mechanism underlying this robust perceptual process have n...
Rodrigo Sigala, Thomas Serre, Tomaso Poggio, Marti...
A novel approach is presented for estimating human body posture and motion from a video sequence. Human pose is defined as the instantaneous image plane configuration of a singl...
We present a discriminative Hough transform based ob-
ject detector where each local part casts a weighted vote for
the possible locations of the object center. We show that the
...
Subhransu Maji (University of California, Berkeley...