A new method for three dimensional (3D) genus-zero shape classification is proposed. It conformally maps a 3D mesh onto a unit sphere and uses normal vectors to generate a spheric...
Traditionally, the Universal Background Model (UBM) is viewed as the background model of the entire acoustic feature space. We propose a novel interpretation of the UBM model, and...
Latent Layout Analysis (LLA) is a novel unsupervised learning technique to discover objects in unseen images using a set of un-annotated training images. LLA defines a generative ...
Kernel Fisher Discriminant Analysis (KFDA) has achieved great success in pattern recognition recently. However, the training process of KFDA is too time consuming (even intractabl...
Although detecting text lines in machine printed documents is typically considered a solved problem, it is still a challenge to segment handwritten text lines in the general sense...
This paper investigates the problem of incorporating auxiliary information (e.g. pitch) for speech recognition using dynamic Bayesian networks (DBNs). Previous works usually model...
We aim to improve the accuracy of handwritten Chinese character recognition using two advanced techniques: discriminative feature extraction (DFE) and discriminative learning quad...
Image registration is a critical step in medical image analysis. In this paper, a novel image registration method based on the discrete wavelet frame transform (DWFT) and the sum ...
Shutao Li, Jinglin Peng, James T. Kwok, Jing Zhang
With the proliferation of camera phones, new information retrieval applications will emerge. The image of a scene captured by a camera phone can be a query to a remote server to i...