Boosting multi-modal camera selection with semantic features

15 years 4 months ago

Download www.mmk.ei.tum.de

In this work semantic features are used to improve the results of the camera selection. These semantic features are group action, person action and person speaking. For this purpose low level acoustic and visual features are combined with high level semantic ones. After the feature fusion, a segmentation and classification are performed by Hidden Markov Models. The evaluation shows that an absolute improvement of 6.5% can be achieved. The frame error rate is reduced to 38.1% by using acoustic and all semantic features. The best model using only low level features achieves a frame error rate of 44.6%, which is the best one reported on this data set.

Benedikt Hörnler, Dejan Arsic, Björn Sch

Real-time Traffic

Frame Error Rate | ICMCS 2009 | Low Level | Multimedia | Semantic Features |

claim paper

» A Framework for Feature Selection for Background Subtraction

» Feature Selection and Generalisation for Retrieval of Textual Cases

» Exploiting Concept Association to Boost Multimedia Semantic Concept Detection

» Mining Compositional Features From GPS and Visual Cues for Event Recognition in Photo Coll...

» Camera ViewBased American Football Video Analysis

» Attention region selection with information from professional digital camera

» Learning to associate HybridBoosted multitarget tracker for crowded scene

» TextonBoost Joint Appearance Shape and Context Modeling for Multiclass Object Recognition ...

Post Info
More Details (n/a)

Added	19 Feb 2011
Updated	19 Feb 2011
Type	Journal
Year	2009
Where	ICMCS
Authors	Benedikt Hörnler, Dejan Arsic, Björn Schuller, Gerhard Rigoll

Comments (0)

Sciweavers

Boosting multi-modal camera selection with semantic features

Frame Error Rate | ICMCS 2009 | Low Level | Multimedia | Semantic Features |

Explore & Download

Productivity Tools

Sciweavers