This paper investigates the problem of incorporating auxiliary information (e.g. pitch) for speech recognition using dynamic Bayesian networks (DBNs). Previous works usually model...
While developing lexical resources for a particular language variety (Viennese), we experimented with a set of 5 different phonetic encodings, termed phone sets, used for unit sel...
Video contains multiple types of audio and visual information, which are difficult to extract, combine or trade-off in general video information retrieval. This paper provides an ...
In thispaper, we present a novelhybridarchitecture forcontinuousspeech recognition systems. It consists of a continuous HMM system extended by an arbitrary neural network that is ...
We describe a new approach to speech recognition, in which all Hidden Markov Model (HMM) states share the same Gaussian Mixture Model (GMM) structure with the same number of Gauss...
Daniel Povey, Lukas Burget, Mohit Agarwal, Pinar A...