Boosting and Structure Learning in Dynamic Bayesian Networks for Audio-Visual Speaker Detection

16 years 7 months ago

Download www.cc.gatech.edu

Bayesian networks are an attractive modeling tool for human sensing, as they combine an intuitive graphical representation with ef?cient algorithms for inference and learning. Earlier work has demonstrated that boosted parameter learning could be used to improve the performance of Bayesian network classi?ers for complex multi-modal inference problems such as speaker detection. In speaker detection, the goal is to use video and audio cues to infer when a person is speaking to a user interface. In this paper we introduce a new boosted structure learning algorithm based on AdaBoost. Given labeled data, our algorithm modi?es both the network structure and parameters so as to improve classi?cation accuracy. We compare its performance to both standard structure learning and boosted parameter learning on a ?xed structure. We present results for speaker detection and for the UCI "chess" dataset.

Tanzeem Choudhury, James M. Rehg, Vladimir Pavlovi

Real-time Traffic

Complex Multi-modal Inference | Computer Vision | ICPR 2002 | Speaker Detection | Standard Structure Learning |

claim paper

» Multimodal Speaker Detection Using Error Feedback Dynamic Bayesian Networks

» Structure learning in a Bayesian networkbased video indexing framework

» Mocapy A toolkit for inference and learning in dynamic Bayesian networks

» Proactive Network Fault Detection

» OpenCV Open Source Computer Vision Reference Manual

Post Info
More Details (n/a)

Added	09 Nov 2009
Updated	09 Nov 2009
Type	Conference
Year	2002
Where	ICPR
Authors	Tanzeem Choudhury, James M. Rehg, Vladimir Pavlovic, Alex Pentland

Comments (0)

Sciweavers

Boosting and Structure Learning in Dynamic Bayesian Networks for Audio-Visual Speaker Detection

Complex Multi-modal Inference | Computer Vision | ICPR 2002 | Speaker Detection | Standard Structure Learning |

Explore & Download

Productivity Tools

Sciweavers