The use of block transforms for coding intra-frames in video coding may preclude higher coding performance due to residual correlation across block boundaries and insufficient en...
Dynamic Bayesian Networks (DBNs) have been widely studied in multi-modal speech recognition applications. Here, we introduce DBNs into an acoustically-driven talking face synthesi...
Jianxia Xue, Jonas Borgstrom, Jintao Jiang, Lynne ...
In this paper, we introduce the design of a Personalized Education (PE) search approach that employs multiple ontologies to automatically generate queries for educational resource...
In this paper we present a clustering-based method for representing semantic concepts on multimodal low-level feature spaces and study the evaluation of the goodness of such model...
In this paper, a new approach for automatic audio classification using non-negative matrix factorization (NMF) is presented. Training is performed onto each audio class individua...
We developed a novel multipoint measurement system capable of acquiring video and sound at more than 100 points in a "synchronized" manner. In this paper, we first descr...
With the advent and proliferation of digital cameras and computers, the number of digital photos created and stored by consumers has grown extremely large. This created increasing...
Yi Wu, Igor Kozintsev, Jean-Yves Bouguet, Carole D...
For large scale automatic semantic video characterization, it is necessary to learn and model a large number of semantic concepts. These semantic concepts do not exist in isolatio...
In order to store, and retrieve images from large databases, we propose a framework, based on multiple description coding paradigm, that disseminates images over distributed serve...
A method is proposed to encode multiple regions of interest(ROI) in JPEG2000 image. It rearranges truncation point for every codeblock in each layer. It assigns higher bitrate to ...
Jun Hou, Xiangzhong Fang, Jiliang Li, Haibin Yin, ...