In this paper, we present our system used and evaluated on the CLEAR’07 benchmarks, both on single- and multi-view head pose estimation. The benchmarks show a high contrast in the application domain: whereas the single-view task provides meeting recordings involving high-quality captures of the participants, the multi-view benchmark targets at low-quality, unobtrusive observations of people by means of multiple cameras in an unconstrained scenario. We show that our system performs with state-of-the-art results under both conditions.