Using F0 to constrain the unit selection Viterbi network

14 years 11 months ago

Download mirlab.org

The goal of the work described here is to limit the computation needed in unit selection Viterbi search for text-to-speech synthesis. The broader goal is to improve speech quality through the practical use of signiﬁcantly larger databases. We focus in this paper on trying to reduce the number of concatenation cost calculations. By making certain weak assumptions about f0 distributions we estimate that only a fraction of possible concatenations are relevant. A method for selecting the relevant concatenations by imposing an ordering constraint on candidate units is proposed. The ordering is based on unit f0 value(s). Strengths and weaknesses of this approach are discussed and data is presented about calculation complexity compared with naive Viterbi search. A listening test was conducted to investigate the effect on synthesis quality under various conﬁgurations of algorithm and database.

Alistair Conkie, Ann K. Syrdal

Real-time Traffic

Certain Weak Assumptions | ICASSP 2011 | Signal Processing | Unit Selection Viterbi | Viterbi Search |

claim paper

» Key management in wireless sensor networks with internetwork sensor roaming

» Decomposition Principles and Online Learning in CrossLayer Optimization for DelaySensitive...

Post Info
More Details (n/a)

Added	21 Aug 2011
Updated	21 Aug 2011
Type	Journal
Year	2011
Where	ICASSP
Authors	Alistair Conkie, Ann K. Syrdal

Comments (0)

Sciweavers

Using F0 to constrain the unit selection Viterbi network

Certain Weak Assumptions | ICASSP 2011 | Signal Processing | Unit Selection Viterbi | Viterbi Search |

Explore & Download

Productivity Tools

Sciweavers