Low-latency online speaker tracking on the AMI Corpus of meeting conversations

14 years 1 months ago

Download gtts.ehu.es

Ambient Inteligence aims to create smart spaces providing services in a transparent and non-intrusive fashion, so context awareness and user adaptation are key issues. Speech can be exploited for user adaptation in such scenarios by continuously tracking speaker identity. However, most speaker tracking approaches require processing the full audio recording before determining speaker turns, which makes them unsuitable for online processing and low-latency decision-making. In this work a low-latency speaker tracking system is presented, which deals with continuous audio streams and outputs decisions at one-second intervals, by scoring fixed-length audio segments with a set of target speaker models. A smoothing technique is explored, based on the scores of past segments, which increases the robustness of tracking decisions to local variability. Experimental results are reported on the AMI Corpus of meeting conversations, revealing the effectiveness of the proposed approach when compared ...

Maider Zamalloa, Luis Javier Rodríguez-Fuen

Real-time Traffic

ICASSP 2010 | Low-latency Speaker | Signal Processing | Speaker Identity | Target Speaker Models |

claim paper

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Maider Zamalloa, Luis Javier Rodríguez-Fuentes, Germán Bordel, Mikel Peñagarikano, Juan Pedro Uribe

Comments (0)

Sciweavers

Low-latency online speaker tracking on the AMI Corpus of meeting conversations

ICASSP 2010 | Low-latency Speaker | Signal Processing | Speaker Identity | Target Speaker Models |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers