Sciweavers

290 search results - page 44 / 58
» Document normalization revisited
Sort
View
SAC
2011
ACM
12 years 10 months ago
Hierarchical comments-based clustering
Information resources on the Web like videos, images, and documents are increasingly becoming more “social” through user engagement via commenting systems. These commenting sy...
Chiao-Fang Hsu, James Caverlee, Elham Khabiri
CICLING
2010
Springer
13 years 11 months ago
Word Length n-Grams for Text Re-use Detection
Abstract. The automatic detection of shared content in written documents –which includes text reuse and its unacknowledged commitment, plagiarism– has become an important probl...
Alberto Barrón-Cedeño, Chiara Basile...
DAS
2010
Springer
13 years 11 months ago
Nearest neighbor based collection OCR
Conventional optical character recognition (OCR) systems operate on individual characters and words, and do not normally exploit document or collection context. We describe a Coll...
K. Pramod Sankar, C. V. Jawahar, Raghavan Manmatha
ICIP
2002
IEEE
14 years 9 months ago
A comparative analysis of two distance measures in color image databases
Euclidean distance measure has been used in comparing feature vectors of images, while cosine angle distance measure is used in document retrieval. In this paper, we theoretically...
Shamik Sural, Gang Qian, Sakti Pramanik
ICPR
2010
IEEE
14 years 2 months ago
Non-Rigid Image Registration for Historical Manuscript Restoration
This paper presents a non-rigid registration method for the restoration of double-sided historical manuscripts. Firstly, the gradient direction maps of the two images of a manuscr...
Jie Wang, Chew-Lim Tan