The problem of multimodal data mining in a multimedia database can be addressed as a structured prediction problem where we learn the mapping from an input to the structured and i...
Zhen Guo, Zhongfei Zhang, Eric P. Xing, Christos F...
In this paper, we investigate to which extent the "raw" mapping of Taylor series coefficients into jet-space can be used as a "language" for describing local i...
In this paper, we present a multi-label sparse coding
framework for feature extraction and classification within
the context of automatic image annotation. First, each image
is ...
Changhu Wang (University of Science and Technology...
We propose a new method for handwritten word-spotting which does not require prior training or gathering examples for querying. More precisely, a model is trained "on the fly...
In many modern applications such as biometric identification systems, sensor networks, medical imaging, geology, and multimedia databases, the data objects are not described exact...