A Bootstrap Method for Training an Accurate Audio Segmenter

14 years 7 months ago

Download ismir2005.ismir.net

Supervised learning can be used to create good systems for note segmentation in audio data. However, this requires a large set of labeled training examples, and handlabeling is quite difﬁcult and time consuming. A bootstrap approach is introduced in which audio alignment techniques are ﬁrst used to ﬁnd the correspondence between a symbolic music representation (such as MIDI data) and an acoustic recording. This alignment provides an initial estimate of note boundaries which can be used to train a segmenter. Once trained, the segmenter can be used to reﬁne the initial set of note boundaries and training can be repeated. This iterative training process eliminates the need for hand-segmented audio. Tests show that this training method can improve a segmenter initially trained on synthetic data.

Ning Hu, Roger B. Dannenberg

Real-time Traffic

Audio Alignment Techniques | Information Retrieval | ISMIR 2005 | Note Boundaries | Note Segmentation |

claim paper

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ISMIR
Authors	Ning Hu, Roger B. Dannenberg

Comments (0)

Sciweavers

A Bootstrap Method for Training an Accurate Audio Segmenter

Audio Alignment Techniques | Information Retrieval | ISMIR 2005 | Note Boundaries | Note Segmentation |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers