Sciweavers

808 search results - page 84 / 162
» Keyword-based document clustering
Sort
View
ICTAI
2007
IEEE
14 years 2 months ago
Document Length Normalization by Statistical Regression
The document-length normalization problem has been widely studied in the field of Information Retrieval. The Cosine Normalization [2], the Maximum tf Normalization [1] and the By...
Sylvain Lamprier, Tassadit Amghar, Bernard Levrat,...
DOCENG
2010
ACM
13 years 9 months ago
Picture detection in document page images
We present a method for picture detection in document page images, which can come from scanned or camera images, or rendered from electronic file formats. Our method uses OCR to s...
Patrick Chiu, Francine Chen, Laurent Denoue
DOCENG
2010
ACM
13 years 9 months ago
Glyph extraction from historic document images
This paper is about the reproduction of ancient texts with vectorised fonts. While for OCR only recognition rates count, a reproduction process does not necessarily require the re...
Lothar Meyer-Lerbs, Arne Schuldt, Björn Gottf...
ICMCS
2005
IEEE
91views Multimedia» more  ICMCS 2005»
14 years 1 months ago
An Intuitive Graphic Environment for Navigation and Classification of Multimedia Documents
In this work we propose an intuitive graphic framework for the effective visualization of MPEG-7 low-level features, in the context of classification and annotation of audio-visu...
Marco Campanella, Riccardo Leonardi, Pierangelo Mi...
DKE
2007
132views more  DKE 2007»
13 years 7 months ago
Automated ontology construction for unstructured text documents
Ontology is playing an increasingly important role in knowledge management and the Semantic Web. This study presents a novel episode-based ontology construction mechanism to extra...
Chang-Shing Lee, Yuan-Fang Kao, Yau-Hwang Kuo, Mei...