This paper presents a singing synthesis system, VocaListener2, that can automatically synthesize a singing voice by mimicking the timbre changes of a user’s singing voice. The s...
Language models for speech recognition tend to be brittle across domains, since their performance is vulnerable to changes in the genre or topic of the text on which they are trai...
Mobile voice search provides users an easier way to search for information using voice from mobile devices. Most mobile search applications have access to the latitude/longitude c...
We study efficient algorithms for soft-input soft-output (SISO) encoding of convolutional codes. While the BCJR algorithm has been suggested for SISO encoding, we show that a for...
The development of mobile platform has raised an emergent requirement for face-related multimedia applications. However, as the basis of such applications, face detection and trac...
Feedback of channel state information (CSI) is necessary to achieve high throughput and low outage probability in multiuser multiantenna systems. There are two types of CSI: direc...
For a reproduced sound field, the competing goals between the listening area and reproduction accuracy in an actual environment is one of the most important problems in sound fi...
We describe a new approach for phoneme recognition which aims at minimizing the phoneme error rate. Building on structured prediction techniques, we formulate the phoneme recogniz...
In this paper we advocate a new technique for the fast identi cation of physical objects based on their physical unclonable features (surface microstructures). The proposed identi...
This paper presents a unified model for image editing in terms of Sparse Matrix-Vector (SpMV) multiplication. In our framework, we cast image editing as a linear energy minimizat...