Context adaptive training with factorized decision trees for HMM-based speech synthesis

15 years 1 months ago

Download mi.eng.cam.ac.uk

To achieve natural high quality synthesised speech in HMMbased speech synthesis, the effective modelling of complex acoustic and linguistic contexts is critical. Traditional approaches use context-dependent HMMs with decision tree based parameter clustering to model the full combination of contexts. However, weak contexts, such as word-level emphasis in neutral speech, are difficult to capture using this approach. To effectively model weak contexts and reduce the data sparsity problem, weak and normal contexts should be treated independently. Context adaptive training provides a structured framework for this whereby standard HMMs represent normal contexts and linear transforms represent additional effects of weak contexts. In contrast to speaker adaptive training, separate decision trees have to be built for the weak and normal context factors. This paper describes the general framework of context adaptive training and investigates three concrete forms: MLLR, CMLLR and CAT based syste...

Kai Yu, Heiga Zen, François Mairesse, Steve

Real-time Traffic

Context Adaptive Training | INTERSPEECH 2010 | Normal Context | Signal Processing | Weak Contexts |

claim paper

Added	18 May 2011
Updated	18 May 2011
Type	Journal
Year	2010
Where	INTERSPEECH
Authors	Kai Yu, Heiga Zen, François Mairesse, Steve Young

Sciweavers

Context adaptive training with factorized decision trees for HMM-based speech synthesis

Context Adaptive Training | INTERSPEECH 2010 | Normal Context | Signal Processing | Weak Contexts |

Explore & Download

Productivity Tools

Sciweavers