This paper presents a supervised manifold learning model for dimensionality reduction in image and video classification tasks. Unlike most manifold learning models that emphasize the distance preserving, we propose a novel algorithm called maximum distance embedding (MDE), which aims to maximize the distances between some particular pairs of data points, with the intention of flattening the local nonlinearity and keeping the discriminant information simultaneously in the embedded feature space. Moreover, MDE measures the dissimilarity between data points using L1-norm distance, which is more robust to outliers than widely used Frobenius norm distance. To adapt the nature tensor structure of image and video data, we further propose the multilinear MDE (M2 DE). Experiments on various datasets demonstrate that both MDE and M2 DE achieve impressive embedding results of image and video data for classification tasks. Categories and Subject Descriptors I.4.7 [Image Processing and Computer Vi...
Yang Liu, Yan Liu, Keith C. C. Chan