We present a novel approach to represent transients using spectral-domain amplitude-modulated/frequency-modulated (AM-FM) functions. The model is applied to the real and imaginary...
This paper takes phonetic information into account for data alignment in text-independent voice conversion. Hidden Markov Models are used for representing the phonetic structure o...
Meng Zhang, Jiaohua Tao, Jani Nurminen, Jilei Tian...
This paper discusses the problem of learning language from unprocessed text and speech signals, concentrating on the problem of learning a lexicon. In particular, it argues for a ...
There are numerous models of varying complexities which seek to efficiently represent the voice source signal. These models are typically based on data and observations which can...
JAVOX provides a mechanism for the development of spoken-language systems from existing desktop applications. We present an architecture that allows existing Java1 programs to be ...