This paper takes phonetic information into account for data alignment in text-independent voice conversion. Hidden Markov Models are used for representing the phonetic structure o...
Meng Zhang, Jiaohua Tao, Jani Nurminen, Jilei Tian...
In this paper, a novel method for speaker adaptation using bilinear model is proposed. Bilinear model can express both characteristics of speakers (style) and phonemes across spea...
A unified framework to jointly solve the two problems of localization and synchronization at the same time is presented in this paper. The joint approach is attractive because it ...
We introduce a missing data recovery methodology based on a weighted least squares iterative adaptive approach (IAA). The proposed method is referred to as the missing-data IAA (M...
Orthogonal frequency division multiplexing (OFDM) is noted for its resistance to narrowband interference when equipped with forward error correction. This technique along with era...
We consider new optimization problems for transceivers with DFE receivers and linear precoders, which also use bit loading at the transmitter. First, we consider the MIMO QoS (qual...
Ching-Chih Weng, Chun-Yang Chen, P. P. Vaidyanatha...
This paper presents a novel social media summarization framework. Summarizing media created and shared in large scale online social networks unfolds challenging research problems....
We investigate how to effectively incorporate spatial structure information into histogram features for boosting visual classification performance motivated by recently proposed M...
In this paper we are interested in non-negative matrix factorization (NMF) with the Itakura-Saito (IS) divergence. Previous work has demonstrated the relevance of this cost functi...
Distributed video coding (DVC) is a recent paradigm which aims at transferring part of the coding complexity from the encoder to the decoder. The performance of such a coding sche...