Multistream speaker diarization through Information Bottleneck system outputs combination

13 years 4 months ago

Download mirlab.org

Speaker diarization of meetings recorded with Multiple Distant Microphones makes extensive use of multiple feature streams like MFCC and Time Delay of Arrivals (TDOA). Typically the combination happens using separate models for each feature stream. This work investigates if the combination of multiple feature streams can happen through the combination of multiple diarization systems performed using those features. The paper extends the previously proposed Information Bottleneck method to handle the combination of several probabilistic diarization outputs. In contrast to the conventional model-based feature combination, this technique is referred as system-based combination. Furthermore the paper introduces an hybrid model-system combination. Experiments are run on data from the Rich Transcription campaigns and show that the system based combination largely outperforms the model based combination by 37% relative. The hybrid approaches improve by 10 − 20%. The analysis of errors shows...

Deepu Vijayasenan, Fabio Valente, Petr Motlí

Real-time Traffic

Feature Streams | ICASSP 2011 | Model-based Feature Combination | Multiple Feature Streams | Signal Processing |

claim paper

Post Info
More Details (n/a)

Added	20 Aug 2011
Updated	20 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Deepu Vijayasenan, Fabio Valente, Petr Motlícek

Comments (0)

Sciweavers

Multistream speaker diarization through Information Bottleneck system outputs combination

Feature Streams | ICASSP 2011 | Model-based Feature Combination | Multiple Feature Streams | Signal Processing |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers