Sciweavers

832 search results - page 88 / 167
» Document clustering with committees
Sort
View
DOCENG
2010
ACM
13 years 10 months ago
Glyph extraction from historic document images
This paper is about the reproduction of ancient texts with vectorised fonts. While for OCR only recognition rates count, a reproduction process does not necessarily require the re...
Lothar Meyer-Lerbs, Arne Schuldt, Björn Gottf...
ICMCS
2005
IEEE
91views Multimedia» more  ICMCS 2005»
14 years 2 months ago
An Intuitive Graphic Environment for Navigation and Classification of Multimedia Documents
In this work we propose an intuitive graphic framework for the effective visualization of MPEG-7 low-level features, in the context of classification and annotation of audio-visu...
Marco Campanella, Riccardo Leonardi, Pierangelo Mi...
DKE
2007
132views more  DKE 2007»
13 years 9 months ago
Automated ontology construction for unstructured text documents
Ontology is playing an increasingly important role in knowledge management and the Semantic Web. This study presents a novel episode-based ontology construction mechanism to extra...
Chang-Shing Lee, Yuan-Fang Kao, Yau-Hwang Kuo, Mei...
EMNLP
2010
13 years 7 months ago
Evaluating Models of Latent Document Semantics in the Presence of OCR Errors
Models of latent document semantics such as the mixture of multinomials model and Latent Dirichlet Allocation have received substantial attention for their ability to discover top...
Daniel David Walker, William B. Lund, Eric K. Ring...
ECIR
2006
Springer
13 years 10 months ago
Automatic Document Organization in a P2P Environment
Abstract. This paper describes an efficient method to construct reliable machine learning applications in peer-to-peer (P2P) networks by building ensemble based meta methods. We co...
Stefan Siersdorfer, Sergej Sizov