Local part-based human detectors are capable of handling partial occlusions efficiently and modeling shape articulations flexibly, while global shape template-based human detector...
Zhe Lin, Larry S. Davis, David S. Doermann, Daniel...
It has recently been shown that deformable 3D surfaces
could be recovered from single video streams. However, ex-
isting techniques either require a reference view in which
the ...
Aydin Varol, Mathieu Salzmann, Engin Tola, Pascal ...
Acoustic event detection (AED) aims to identify both timestamps and types of multiple events and has been found to be very challenging. The cues for these events often times exist...
Po-Sen Huang, Xiaodan Zhuang, Mark Hasegawa-Johnso...
The robust localization and tracking of faces in video streams is a fundamental concern for many subsequent multi-modal recognition approaches. Especially in meeting scenarios sev...
Frank Wallhoff, Martin Zobl, Gerhard Rigoll, Igor ...
We introduce a novel energy minimization method to decompose a video into a set of super-resolved moving layers. The proposed energy corresponds to the cost of coding the sequence...