In this paper, we investigate the use of bilingual parsing on parallel corpora to better estimate the rule parameters in a formal syntax-based machine translation system, which ar...
Children need to master reading letter-names and lettersounds before reading phrases and sentences. Pronunciation assessment of letter-names and letter-sounds read aloud is an imp...
Matthew Black, Joseph Tepperman, Abe Kazemzadeh, S...
We provide an amplitude-phase representation of the dual-tree complex wavelet transform by extending the fixed quadrature relationship of the dual-tree wavelets to arbitrary phas...
Speaker diarization is originally defined as the task of determining “who spoke when” given an audio track and no other prior knowledge of any kind. The following article sho...
We consider streaming video content over an overlay network of peer nodes. Each of the nodes employs a mesh-pull mechanism to organize the download of data units from its neighbou...
In this paper, we present a novel speech-rhythm-guided syllablenuclei location detection algorithm. As a departure from conventional methods, we introduce an instantaneous speech ...
We propose a new active learning algorithm to address the problem of selecting a limited subset of utterances for transcribing from a large amount of unlabeled utterances so that ...
Balakrishnan Varadarajan, Dong Yu, Li Deng, Alex A...
In this paper, we address the problem of super-resolution from multiple low-resolution omnidirectional images with inexact registration. Such a problem is typically encountered in...