Sciweavers

329 search results - page 9 / 66
» A Novel Method for Detecting Similar Documents
Sort
View
AIRWEB
2006
Springer
14 years 9 days ago
Tracking Web Spam with Hidden Style Similarity
Automatically generated content is ubiquitous in the web: dynamic sites built using the three-tier paradigm are good examples (e.g. commercial sites, blogs and other sites powered...
Tanguy Urvoy, Thomas Lavergne, Pascal Filoche
ICIP
2007
IEEE
14 years 10 months ago
Abnormal Event Detection from Surveillance Video by Dynamic Hierarchical Clustering
The clustering-based approach for detecting abnormalities in surveillance video requires the appropriate definition of similarity between events. The HMM-based similarity defined ...
Fan Jiang, Ying Wu, Aggelos K. Katsaggelos
DRR
2008
13 years 10 months ago
Versatile page numbering analysis
In this paper, we revisit the problem of detecting the page numbers of a document. This work is motivated by a need for a generic method which applies on a large variety of docume...
Hervé Déjean, Jean-Luc Meunier
AAAI
2006
13 years 10 months ago
Novel Relationship Discovery Using Opinions Mined from the Web
This paper proposes relationship discovery models using opinions mined from the Web instead of only conventional collocations. Web opinion mining extracts subjective information f...
Lun-Wei Ku, Hsiu-Wei Ho, Hsin-Hsi Chen
CIKM
2008
Springer
13 years 10 months ago
Achieving both high precision and high recall in near-duplicate detection
To find near-duplicate documents, fingerprint-based paradigms such as Broder's shingling and Charikar's simhash algorithms have been recognized as effective approaches a...
Lian'en Huang, Lei Wang, Xiaoming Li