It has long been recognised that interactivity improves the effectiveness of Information Retrieval systems. Speech is the most natural and interactive medium of communication and ...
In this paper, we present a systems approach for channel modeling of an Automatic Speech Recognition (ASR) system. This can have implications in improving speech recognition compo...
Qun Feng Tan, Kartik Audhkhasi, Panayiotis G. Geor...
Phoneme posterior probabilities estimated using Multi-Layer Perceptrons (MLPs) are extensively used both as acoustic scores and features for speech recognition. In this paper we e...
Samuel Thomas, Patrick Nguyen, Geoffrey Zweig, Hyn...
Video contains multiple types of audio and visual information, which are difficult to extract, combine or trade-off in general video information retrieval. This paper provides an ...
Present speech recognition systems are becoming more complex due to technology advances, optimizations and special requirements such as small computation and memory footprints. Pro...