Sciweavers

ICIP
2001
IEEE

Similarity measure for CCITT Group 4 compressed document images

14 years 5 months ago
Similarity measure for CCITT Group 4 compressed document images
Similarity measure of document images acts a crucial role in the area of document image retrieval. A method of measuring the similarity of CCITT Group 4 compressed document images is proposed in this paper. The features are extracted directly from the changing elements of the compressed images. Weighted Hausdorff distance is utilized to assign all of the word objects from two document images to corresponding classes by an unsupervised classifier, whereas the possible stop words are excluded. Document vectors are built by the occurrence frequency of the word object classes, and the pair-wise similarity of two document images is represented by the scalar product of the document vectors. Five groups of articles relating to different domains are used to test the validity of the presented approach.
Yue Lu, Chew Lim Tan, Liying Fan, Weihua Huang
Added 25 Oct 2009
Updated 27 Oct 2009
Type Conference
Year 2001
Where ICIP
Authors Yue Lu, Chew Lim Tan, Liying Fan, Weihua Huang
Comments (0)