Sciweavers

898 search results - page 28 / 180
» Making Documents Work: Challenges for Document Understanding
Sort
View
152
Voted
TKDE
2010
284views more  TKDE 2010»
15 years 1 months ago
Unsupervised Semantic Similarity Computation between Terms Using Web Documents
Abstract— In this work, web-based metrics for semantic similarity computation between words or terms are presented and compared with the state-of-the-art. Starting from the funda...
Elias Iosif, Alexandros Potamianos
COMAD
2009
15 years 4 months ago
Business Insight from Collection of Unstructured Formatted Documents with IBM Content Harvester
In this paper, we report the development and experiments of IBM Content Harvester (CH), a tool to analyze and recover templates and content from word processor created text docume...
Biplav Srivastava, Yuan-Chi Chang
125
Voted
SIGDOC
2003
ACM
15 years 8 months ago
An interaction initiative model for documentation
In this paper we propose a model of creation and use of documentation based on the concept of mixed-initiative interaction. In our model, successful single-initiative interaction ...
David G. Novick, Karen Ward
119
Voted
DRR
2003
15 years 5 months ago
Information retrieval for OCR documents: a content-based probabilistic correction model
The difficulty with information retrieval for OCR documents lies in the fact that OCR documents comprise of a significant amount of erroneous words and unfortunately most informat...
Rong Jin, ChengXiang Zhai, Alexander G. Hauptmann
99
Voted
SSWMC
2004
15 years 5 months ago
Signature-embedding in printed documents for security and forensic applications
Despite the increase in email and other forms of digital communication, the use of printed documents continues to increase every year. Many types of printed documents need to be &...
Aravind K. Mikkilineni, Gazi N. Ali, Pei-Ju Chiang...