Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

90

Voted

ISMIR
2005
Springer

favoriteEmaildiscussreport

150views Music» more ISMIR 2005»

A Bootstrap Method for Training an Accurate Audio Segmenter

15 years 5 months ago

A Bootstrap Method for Training an Accurate Audio Segmenter

Download ismir2005.ismir.net

Supervised learning can be used to create good systems for note segmentation in audio data. However, this requires a large set of labeled training examples, and handlabeling is quite difﬁcult and time consuming. A bootstrap approach is introduced in which audio alignment techniques are ﬁrst used to ﬁnd the correspondence between a symbolic music representation (such as MIDI data) and an acoustic recording. This alignment provides an initial estimate of note boundaries which can be used to train a segmenter. Once trained, the segmenter can be used to reﬁne the initial set of note boundaries and training can be repeated. This iterative training process eliminates the need for hand-segmented audio. Tests show that this training method can improve a segmenter initially trained on synthetic data.

Ning Hu, Roger B. Dannenberg

Real-time Traffic

Audio Alignment Techniques | Information Retrieval | ISMIR 2005 | Note Boundaries | Note Segmentation |

claim paper

Related Content

» A framework for classification and segmentation of massive audio data streams

» Accurate repeat finding and object skipping using fingerprints

» Audio Features for Noisy Sound Segmentation

» A bootstrapping approach to annotating large image collection

» Homogeneous segmentation and classifier ensemble for audio tag annotation and retrieval

» Using Virtual Humans to Bootstrap the Creation of Other Virtual Humans

» Detecting bandlimited audio in broadcast television shows

» A Statistical Model for DomainIndependent Text Segmentation

» Lexicalized Phonotactic Word Segmentation

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	ISMIR
Authors	Ning Hu, Roger B. Dannenberg

Comments (0)