Using spatial cues for meeting speech segmentation

14 years 6 months ago

Download www.cecs.uci.edu

This work investigates the validity and accuracy of using spatial cues with Time-Delay Estimation (TDE) as a method of segmenting multichannel recorded speech by speaker location. In environments such as meetings where speakers do not signiﬁcantly alter position, segmentation by speaker location essentially leads to segmentation by speaker ‘turn’. The proposed system calculates location information using TDEs and spatial cues extracted from multichannel meeting audio recordings. This location information is then input into a simple segmentation algorithm. Experiments have been performed on both theoretical and real meeting recordings with non-overlapping speakers, and theoretical recordings with overlapping speakers. Segmentation results reveal the most robust cue to be a combination of spatial information and TDEs. This cue combination leads to greater segmentation accuracy for classifying individual speakers and detecting overlapping sections than using spatial cues or time-de...

Eva Cheng, Jason Lukasiak, Ian S. Burnett, David S

Real-time Traffic

ICMCS 2005 | Simple Segmentation Algorithm | Spatial Cues | Speaker Location |

claim paper

Post Info
More Details (n/a)

Added	24 Jun 2010
Updated	24 Jun 2010
Type	Conference
Year	2005
Where	ICMCS
Authors	Eva Cheng, Jason Lukasiak, Ian S. Burnett, David Stirling

Comments (0)

Sciweavers

Using spatial cues for meeting speech segmentation

ICMCS 2005 | Simple Segmentation Algorithm | Spatial Cues | Speaker Location |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers