Sciweavers

410 search results - page 36 / 82
» Word Retrieval in Historical Document Using Character-Primit...
Sort
View
CORR
2006
Springer
178views Education» more  CORR 2006»
13 years 9 months ago
A tool set for the quick and efficient exploration of large document collections
: We are presenting a set of multilingual text analysis tools that can help analysts in any field to explore large document collections quickly in order to determine whether the do...
Camelia Ignat, Bruno Pouliquen, Ralf Steinberger, ...
JCDL
2005
ACM
100views Education» more  JCDL 2005»
14 years 2 months ago
Automatic extraction of titles from general documents using machine learning
In this paper, we propose a machine learning approach to title extraction from general documents. By general documents, we mean documents that can belong to any one of a number of...
Yunhua Hu, Hang Li, Yunbo Cao, Dmitriy Meyerzon, Q...
CLEF
2007
Springer
14 years 3 months ago
Robust Question Answering for Speech Transcripts Using Minimal Syntactic Analysis
Abstract. This paper describes the participation of the Technical University of Catalonia in the CLEF 2007 Question Answering on Speech Transcripts track. For the processing of man...
Pere Comas, Jordi Turmo, Mihai Surdeanu
SIGIR
2003
ACM
14 years 2 months ago
Text categorization by boosting automatically extracted concepts
Term-based representations of documents have found widespread use in information retrieval. However, one of the main shortcomings of such methods is that they largely disregard le...
Lijuan Cai, Thomas Hofmann
WWW
2006
ACM
14 years 9 months ago
Visually guided bottom-up table detection and segmentation in web documents
In the AllRight project, we are developing an algorithm for unsupervised table detection and segmentation that uses the visual rendition of a Web page rather than the HTML code. O...
Bernhard Krüpl, Marcus Herzog