This paper presents a bottom-up approach that combines audio and video to simultaneously locate individual speakers in the video (2-D source localization) and segment their speech ...
Multiview video coding (MVC) is currently being standardized by the Joint Video Team as an extension of H264/AVC. When an MVC bitstream is decoded, some views (named target views)...
Ying Chen, Ye-Kui Wang, Miska M. Hannuksela, Monce...
— In this paper, we propose a novel approach for the visual navigation of unmanned aerial vehicles (UAV). In contrast to most available methods, a single perspective camera is us...
Chunrong Yuan, Fabian Recktenwald, Hanspeter A. Ma...
VXL is a collection of C++ libraries designed for computer vision research and implementation. VXL is written in ANSI/ISO C++ and is designed to be portable over many platforms. Th...
— Ubiquitous image processing tasks (such as transform decompositions, filtering and motion estimation) do not currently provide graceful degradation when their clock-cycles budg...