Sciweavers

2827 search results - page 89 / 566
» Marking Text Documents
Sort
View
PLDI
2010
ACM
14 years 5 months ago
A Context-free Markup Language for Semi-structured Text
An ad hoc data format is any non-standard, semi-structured data format for which robust data processing tools are not available. In this paper, we present ANNE, a new kind of mark...
Qian Xi, David Walker
ICPR
2004
IEEE
14 years 9 months ago
Morphological Tagging Approach in Document Analysis of Invoices
In this paper a morphological tagging approach for document image invoice analysis is described. Tokens close by their morphology and confirmed in their location within different ...
Abdel Belaïd, Yolande Belaïd
SIGIR
2009
ACM
14 years 2 months ago
Identifying the original contribution of a document via language modeling
Abstract. One major goal of text mining is to provide automatic methods to help humans grasp the key ideas in ever-increasing text corpora. To this effect, we propose a statistica...
Benyah Shaparenko, Thorsten Joachims
ICMCS
1999
IEEE
153views Multimedia» more  ICMCS 1999»
14 years 11 days ago
CamWorks: A Video-Based Tool for Efficient Capture from Paper Source Documents
We describe the design and evaluation of CamWorks, a system that employs a video camera as a means of supporting capture from paper sources during reading and writing. The user ca...
William M. Newman, Christopher R. Dance, Alex S. T...
LREC
2010
170views Education» more  LREC 2010»
13 years 9 months ago
Building a Domain-specific Document Collection for Evaluating Metadata Effects on Information Retrieval
This paper describes the development of a structured document collection containing user-generated text and numerical metadata for exploring the exploitation of metadata in inform...
Walid Magdy, Jinming Min, Johannes Leveling, Garet...