Full duplex hands-free man/machine interface often suffers from directional non-stationary interference (such as a competing speaker or an echo signal) as well as a stationary int...
Language modeling for an inflected language such as Arabic poses new challenges for speech recognition and machine translation due to its rich morphology. Rich morphology results i...
Abstract--This paper presents a technique to transform high-effort voices into breathy voices using adaptive pre-emphasis linear prediction (APLP). The primary benefit of this tech...
K. I. Nordstrom, George Tzanetakis, Peter F. Dries...
We present LyricAlly, a prototype that automatically aligns acoustic musical signals with their corresponding textual lyrics, in a manner similar to manually-aligned karaoke. We ta...
Min-Yen Kan, Ye Wang, Denny Iskandar, Tin Lay Nwe,...
Finding a piece of music based on its content is a key problem in music information retrieval. For example, a user may be interested in finding music based on knowledge of only a s...
Iman S. H. Suyoto, Alexandra L. Uitdenbogerd, Falk...