Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

171

ISMIR
2005
Springer

150views Music» more ISMIR 2005»

A Bootstrap Method for Training an Accurate Audio Segmenter

16 years 4 days ago

A Bootstrap Method for Training an Accurate Audio Segmenter

Download ismir2005.ismir.net

Supervised learning can be used to create good systems for note segmentation in audio data. However, this requires a large set of labeled training examples, and handlabeling is quite difﬁcult and time consuming. A bootstrap approach is introduced in which audio alignment techniques are ﬁrst used to ﬁnd the correspondence between a symbolic music representation (such as MIDI data) and an acoustic recording. This alignment provides an initial estimate of note boundaries which can be used to train a segmenter. Once trained, the segmenter can be used to reﬁne the initial set of note boundaries and training can be repeated. This iterative training process eliminates the need for hand-segmented audio. Tests show that this training method can improve a segmenter initially trained on synthetic data.

Ning Hu, Roger B. Dannenberg

Real-time Traffic

Audio Alignment Techniques | Information Retrieval | ISMIR 2005 | Note Boundaries | Note Segmentation |

claim paper

Related Content

» A framework for classification and segmentation of massive audio data streams

» Accurate repeat finding and object skipping using fingerprints

» Audio Features for Noisy Sound Segmentation

» A bootstrapping approach to annotating large image collection

» Homogeneous segmentation and classifier ensemble for audio tag annotation and retrieval

» Using Virtual Humans to Bootstrap the Creation of Other Virtual Humans

» Detecting bandlimited audio in broadcast television shows

» A Statistical Model for DomainIndependent Text Segmentation

» Lexicalized Phonotactic Word Segmentation

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ISMIR
Authors	Ning Hu, Roger B. Dannenberg

Comments (0)