In recent years, the proliferation of VOIP data has created a number of applications in which it is desirable to perform quick online classification and recognition of massive voi...
Discriminatory information about person identity is multimodal. Yet, most person recognition systems are unimodal, e.g. the use of facial appearance. With a view to exploiting the ...
Niall A. Fox, Ralph Gross, Jeffrey F. Cohn, Richar...
This paper presents a new wavelet based retrieval approach based on Spherically Invariant Random Vector (SIRV) modeling of wavelet subbands. Under this multivariate model, wavelet...
Audio segmentation has applications in a variety of contexts, such as audio information retrieval, automatic sound analysis, and as a pre-processing step in speech recognition. Ex...
Tara N. Sainath, Dimitri Kanevsky, Giridharan Iyen...
In this paper we evaluate the effect of the emotional state of a speaker when text-independent speaker identification is performed. The spectral features used for speaker recogni...
Marius Vasile Ghiurcau, Corneliu Rusu, Jaakko Asto...