In this paper, we present our recent studies of F0 estimation from the surface electromyographic (EMG) data using a Gaussian mixture model (GMM)-based voice conversion (VC) techni...
Keigo Nakamura, Matthias Janke, Michael Wand, Tanj...
Feature-space transforms such as feature-space maximum likelihood linear regression (FMLLR) are very effective speaker adaptation technique, especially on mismatched test data. In...
Jing Huang, Karthik Visweswariah, Peder A. Olsen, ...
An accurate identification dialog acts (DAs), which represent the illocutionary aspect of communication, is essential to support the understanding of human conversations. This re...
Silvia Quarteroni, Alexei V. Ivanov, Giuseppe Ricc...
We present an approach for dictionary learning of action attributes via information maximization. We unify the class distribution and appearance information into an objective func...
We present an active learning approach to choose image annotation requests among both object category labels and the objects’ attribute labels. The goal is to solicit those labe...