Skimming or browsing audio recordings is much more difficult than visually scanning a document because of the temporal nature of audio. By exploiting properties of spontaneous spe...
This work addresses the soundtrack indexing of multimedia documents. We present and merge two audio classification tools that we have developed. The first one, a speech music clas...
Reliable estimation of visual saliency allows appropriate processing of images without prior knowledge of their content, and thus remains an important step in many computer vision ...
A new adaptive multispectral image compression technique based on the regions identified is proposed. The algorithm is adaptive in the sense that according to the data type class ...
In this paper, an unsupervised learning algorithm, neighborhood linear embedding (NLE), is proposed to discover the intrinsic structures such as neighborhood relationships, global ...
Shuzhi Sam Ge, Feng Guan, Yaozhang Pan, Ai Poh Loh