Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

154

HICSS
2002
IEEE

130views Biometrics» more HICSS 2002»

A Novel Method for Detecting Similar Documents

16 years 6 days ago

A Novel Method for Detecting Similar Documents

Download labsoftware.com

We describe a system for rapidly determining document similarity among a set of documents obtained from an information retrieval (IR) system. We obtain a ranked list of the most important terms in each document using a rapid phrase recognizer system. We store these in a database and compute document similarity using a simple database query. If the number of terms found to not be contained in both documents is less than some predetermined threshold compared to the total number of terms in the document, these documents are determined to be very similar.

James W. Cooper, Anni Coden, Eric W. Brown

Real-time Traffic

Biometrics | Compute Document Similarity | Document Similarity | Documents | HICSS 2002 |

claim paper

Related Content

» A Novel Italic Detection and Rectification Method for Chinese Advertising Images

» Adaptive nearduplicate detection via similarity learning

» Detecting Ontology Mappings via Descriptive Statistical Methods

» A novel method for measuring semantic similarity for XML schema matching

» From Identical to Similar Fusing Retrieved Lists Based on InterDocument Similarities

» A Novel Scheme for Video Similarity Detection

» Functional annotation by identification of local surface similarities a novel tool for str...

» Locality preserving indexing for document representation

» A novel unsupervised classification approach for network anomaly detection by kMeans clust...

Post Info
More Details (n/a)

Added	14 Jul 2010
Updated	14 Jul 2010
Type	Conference
Year	2002
Where	HICSS
Authors	James W. Cooper, Anni Coden, Eric W. Brown

Comments (0)