In this paper we present a novel shape descriptor based on shape context, which in combination with hierarchical distance based hashing is used for word and graphical pattern based document image indexing and retrieval. The shape descriptor represents the relative arrangement of points sampled on the boundary of the shape of object. We also demonstrate the applicability of the novel shape descriptor for classification of characters and symbols. For indexing, we provide a new formulation for distance based hierarchical locality sensitive hashing. Experiments have yielded promising results.
Ehtesham Hassan, Santanu Chaudhury, M. Gopal