Incremental processing is relevant for language modeling, speech recognition and language generation. In this paper we devise a dynamic version of Tree Adjoining Grammar (DVTAG) th...
We present a framework to apply Volterra series to analyze multilayered perceptrons trained to estimate the posterior probabilities of phonemes in automatic speech recognition. Th...
Joel Pinto, Garimella S. V. S. Sivaram, Hynek Herm...
Automatic Language Identification (LID) in music has received significantly less attention than LID in speech. Here, we study the problem of LID in music videos uploaded on YouT...
Vijay Chandrasekhar, Mehmet Emre Sargin, David A. ...
Creating high-quality multimediapresentationsrequiresmuch skill, time, and effort. This is particularly true when temporal media, such as speech and animation, are involved. We de...
Mukesh Dalal, Steven Feiner, Kathleen McKeown, Shi...
Blind source separation (BSS) is a process to reconstruct source signals from the mixed signals. The standard BSS methods assume a fixed set of stationary source signals with the ...