Speech can be represented as a time/frequency distribution of energy using a multi-band filter bank. A Markov random field model, which takes into account the possible time asynch...
- Real-time speaker verification, with speech acquired using the NIST Mk-III microphone array and an autodirective beamforming algorithm, is demonstrated. The software and hardware...
Gang Mei, Roger Xu, Debang Lao, Chiman Kwan, Vince...
Natural sounds are structured on many time-scales. A typical segment of speech, for example, contains features that span four orders of magnitude: Sentences (∼1 s); phonemes (âˆ...
In this study, the generalized parametric spectral subtraction estimator is employed in the context of a ROVER speech enhancement framework to develop a robust phoneme class selec...
User modelling is widely used in HCI but there are very few systematic HCI modelling tools for people with disabilities. We are developing user models to help with the design and ...