Concurrent estimation of singing voice F0 and phonemes by using spectral envelopes estimated from polyphonic music

13 years 5 months ago

Download mirlab.org

The scarcity of available multi-track recordings constitutes a severe constraint on the training of probabilistic models for voice extraction from polyphonic music. We propose a novel training method to estimate a spectral envelope of a singing voice that makes it possible to train the models from a polyphonic music without segregating a singing voice. We implement this method as an extension to the existing W-PST method, which concurrently estimates singing voice fundamental frequency (F0) and phoneme from polyphonic music. The novel training method is based on random sampling from probabilistic distributions. We conducted experiments on concurrent F0 and phoneme estimation and conﬁrm the effectiveness of our method.

Hiromasa Fujihara, Masataka Goto

Real-time Traffic

Available Multi-track Recordings | ICASSP 2011 | Polyphonic Music | Signal Processing | Singing Voice |

claim paper

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Hiromasa Fujihara, Masataka Goto

Comments (0)

Sciweavers

Concurrent estimation of singing voice F0 and phonemes by using spectral envelopes estimated from polyphonic music

Available Multi-track Recordings | ICASSP 2011 | Polyphonic Music | Signal Processing | Singing Voice |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers