Similarity measure for CCITT Group 4 compressed document images

16 years 8 months ago

Download www.comp.nus.edu.sg

Similarity measure of document images acts a crucial role in the area of document image retrieval. A method of measuring the similarity of CCITT Group 4 compressed document images is proposed in this paper. The features are extracted directly from the changing elements of the compressed images. Weighted Hausdorff distance is utilized to assign all of the word objects from two document images to corresponding classes by an unsupervised classifier, whereas the possible stop words are excluded. Document vectors are built by the occurrence frequency of the word object classes, and the pair-wise similarity of two document images is represented by the scalar product of the document vectors. Five groups of articles relating to different domains are used to test the validity of the presented approach.

Yue Lu, Chew Lim Tan, Liying Fan, Weihua Huang

Real-time Traffic

Document Image Retrieval | Document Images | Document Vectors | ICIP 2001 | Image Processing |

claim paper

» Group 4 Compressed Document Matching

» Evaluation of Lossless Compression Methods for Gray Scale Document Images

Post Info
More Details (n/a)

Added	25 Oct 2009
Updated	27 Oct 2009
Type	Conference
Year	2001
Where	ICIP
Authors	Yue Lu, Chew Lim Tan, Liying Fan, Weihua Huang

Comments (0)

Sciweavers

Similarity measure for CCITT Group 4 compressed document images

Document Image Retrieval | Document Images | Document Vectors | ICIP 2001 | Image Processing |

Explore & Download

Productivity Tools

Sciweavers