Sciweavers

ICASSP
2010
IEEE
13 years 7 months ago
Statistical parametric speech synthesis based on product of experts
Heiga Zen, Mark J. F. Gales, Yoshihiko Nankaku, Ke...
ICASSP
2010
IEEE
13 years 7 months ago
Joint estimate of shape and time-synchronization of a glottal source model by phase flatness
A new method is proposed to jointly estimate the shape parameter of a glottal model and its time position in a voiced segment. We show that, the idea of phase flatness (or phase ...
Gilles Degottex, Axel Röbel, Xavier Rodet
ICASSP
2010
IEEE
13 years 7 months ago
Distributed Lasso for in-network linear regression
The least-absolute shrinkage and selection operator (Lasso) is a popular tool for joint estimation and continuous variable selection, especially well-suited for the under-determin...
Juan Andrés Bazerque, Gonzalo Mateos, Georg...
ICASSP
2010
IEEE
13 years 7 months ago
Flexcode - flexible audio coding
Modern networks are highly variable and, as a result, source coders are commonly used under conditions that they were not designed for. We address this problem with a source-codin...
Janusz Klejsa, Minyue Li, W. Bastiaan Kleijn
ICASSP
2010
IEEE
13 years 7 months ago
Automatic matched filter recovery via the audio camera
The sound reaching the acoustic sensor in a realistic environment contains not only the part arriving directly from the sound source but also a number of environmental re ections....
Adam O'Donovan, Ramani Duraiswami, Dmitry N. Zotki...
ICASSP
2010
IEEE
13 years 7 months ago
Fast semi-supervised image segmentation by novelty selection
The goal of semi-supervised image segmentation is to obtain the segmentation from a partially labeled image. By utilizing the image manifold structure in labeled and unlabeled pix...
António R. C. Paiva, Tolga Tasdizen
ICASSP
2010
IEEE
13 years 7 months ago
Collaborative spectrum sensing from sparse observations using matrix completion for cognitive radio networks
— In cognitive radio, spectrum sensing is a key component to detect spectrum holes (i.e., channels not used by any primary users). Collaborative spectrum sensing among the cognit...
Jia Meng, Wotao Yin, Husheng Li, Ekram Hossain, Zh...
ICASSP
2010
IEEE
13 years 7 months ago
Improved statistical models for SMT-based speaking style transformation
Automatic speech recognition (ASR) results contain not only ASR errors, but also disfluencies and colloquial expressions that must be corrected to create readable transcripts. We...
Graham Neubig, Yuya Akita, Shinsuke Mori, Tatsuya ...
ICASSP
2010
IEEE
13 years 7 months ago
Subspace Gaussian Mixture Models for speech recognition
We describe an acoustic modeling approach in which all phonetic states share a common Gaussian Mixture Model structure, and the means and mixture weights vary in a subspace of the...
Daniel Povey, Lukas Burget, Mohit Agarwal, Pinar A...
ICASSP
2010
IEEE
13 years 7 months ago
Using cross-decoder phone coocurrences in phonotactic language recognition
Phonotactic language recognizers are based on the ability of phone decoders to produce phone sequences containing acoustic, phonetic and phonological information, which is partial...
Mikel Peñagarikano, Amparo Varona, Luis Jav...