Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

149

Voted

ICASSP
2010
IEEE

175views Signal Processing» more ICASSP 2010»

Perceptual audio features for unsupervised key-phrase detection

15 years 6 months ago

Perceptual audio features for unsupervised key-phrase detection

Download domino.mpi-inf.mpg.de

We propose a new type of audio feature (HFCC-ENS) as well as an unsupervised method for detecting short sequences of spoken words (key-phrases) within long speech recordings. Our technical contributions are threefold: Firstly, we propose to use bandwidth-adapted ﬁlterbanks instead of classical MFCC-style ﬁlters in the feature extraction step. Secondly, the time resolution of the resulting features is adapted to account for the temporal characteristics of the spoken phrases. Thirdly, the key-phrase detection step is performed by matching sequences of the resulting HFCC-ENS features with features extracted from a target speech recording. We evaluate the proposed method using the German Kiel Corpus and furthermore investigate speech-related properties of the proposed feature.

Dirk von Zeddelmann, Frank Kurth, Meinard Mül

Real-time Traffic

Classical Mfcc-style ﬁlters | Feature Extraction Step | ICASSP 2010 | Long Speech Recordings | Signal Processing |

claim paper

Related Content

» Databionic Visualization of Music Collections According to Perceptual Distance

» Audio retrieval by latent perceptual indexing

» Coherent bagof audio words model for efficient largescale video copy detection

» OpenBliSSART Design and evaluation of a research toolkit for Blind Source Separation in Au...

» A Histogram Algorithm for Fast Audio Retrieval

» Unsupervised distributional anomaly detection for a selfdiagnostic speech activity detecto...

» Detection of audio covert channels using statistical footprints of hidden messages

» Summarizing multiple spoken documents finding evidence from untranscribed audio

» Automatic Speaker Segmentation using Multiple Features and Distance Measures A Comparison ...

Post Info
More Details (n/a)

Added	06 Dec 2010
Updated	06 Dec 2010
Type	Conference
Year	2010
Where	ICASSP
Authors	Dirk von Zeddelmann, Frank Kurth, Meinard Müller

Comments (0)