Sciweavers

765 search results - page 32 / 153
» Documenting SODA: An Evaluation of the Process Documentation...
Sort
View
ICDAR
1997
IEEE
14 years 8 days ago
Representing OCRed documents in HTML
ABSTRACT: OCR is an error-prone process. It is time-consuming and expensive to manually proofread OCR results. The errors remaining in OCRed texts can cause serious problems in rea...
Tao Hong, Sargur N. Srihari
ECIR
2007
Springer
13 years 9 months ago
Multinomial Randomness Models for Retrieval with Document Fields
Document fields, such as the title or the headings of a document, offer a way to consider the structure of documents for retrieval. Most of the proposed approaches in the literatu...
Vassilis Plachouras, Iadh Ounis
IS
2006
13 years 8 months ago
Negations and document length in logical retrieval
Abstract. Terms which are not explicitly mentioned in the text of a document receive often a minor role in current retrieval systems. In this work we connect the management of such...
David E. Losada, Alvaro Barreiro
ICDM
2003
IEEE
138views Data Mining» more  ICDM 2003»
14 years 1 months ago
Ontologies Improve Text Document Clustering
Text document clustering plays an important role in providing intuitive navigation and browsing mechanisms by organizing large sets of documents into a small number of meaningful ...
Andreas Hotho, Steffen Staab, Gerd Stumme
WWW
2006
ACM
14 years 8 months ago
Visually guided bottom-up table detection and segmentation in web documents
In the AllRight project, we are developing an algorithm for unsupervised table detection and segmentation that uses the visual rendition of a Web page rather than the HTML code. O...
Bernhard Krüpl, Marcus Herzog