Parallel model combination and word recognition in soccer audio

14 years 7 months ago

Download personal.ee.surrey.ac.uk

The audio scene from broadcast soccer can be used for identifying highlights from the game. Audio cues derived from these sources provide valuable information about game events, as can the detection of key words used by the commentators. In this paper we interpret the feasibility of incorporating both commentator word recognition and information about the additive background noise in an HMM structure. A limited set of audio cues, which have been extracted from data collected from the 2006 FIFA World Cup, are used to create an extension to the Aurora-2 database. The new database is then tested with various PMC models and compared to the standard baseline, clean and multi-condition training methods. It is found that incorporating SNR and noise type information into the PMC process is beneﬁcial to recognition performance.

Jack H. Longton, Philip J. B. Jackson

Real-time Traffic

Additive Background Noise | Audio Cues | Commentator Word Recognition | ICMCS 2008 | Multimedia |

claim paper

Post Info
More Details (n/a)

Added	30 May 2010
Updated	30 May 2010
Type	Conference
Year	2008
Where	ICMCS
Authors	Jack H. Longton, Philip J. B. Jackson

Comments (0)

Sciweavers

Parallel model combination and word recognition in soccer audio

Additive Background Noise | Audio Cues | Commentator Word Recognition | ICMCS 2008 | Multimedia |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers