Sciweavers

222 search results - page 4 / 45
» Building a test collection for complex document information ...
Sort
View
JUCS
2008
167views more  JUCS 2008»
13 years 7 months ago
A Generic Architecture for the Conversion of Document Collections into Semantically Annotated Digital Archives
: Mass digitization of document collections with further processing and semantic annotation is an increasing activity among libraries and archives at large for preservation, browsi...
Josep Lladós, Dimosthenis Karatzas, Joan Ma...
SIGIR
2009
ACM
14 years 1 months ago
Building enriched document representations using aggregated anchor text
It is well known that anchor text plays a critical role in a variety of search tasks performed over hypertextual domains, including enterprise search, wiki search, and web search....
Donald Metzler, Jasmine Novak, Hang Cui, Srihari R...
AND
2009
13 years 4 months ago
Tools for monitoring, visualizing, and refining collections of noisy documents
Developing better systems for document image analysis requires understanding errors, their sources, and their effects. The interactions between various processing steps are comple...
Daniel P. Lopresti, George Nagy
ICDAR
2009
IEEE
14 years 1 months ago
Metadata Extraction from PDF Papers for Digital Library Ingest
In this paper we analyze our recent research on the use of document analysis techniques for metadata extraction from PDF papers. We describe a package that is designed to extract ...
Simone Marinai
TVCG
2012
225views Hardware» more  TVCG 2012»
11 years 9 months ago
Evaluating the Role of Time in Investigative Analysis of Document Collections
—Time is a universal and essential aspect of data in any investigative analysis. It helps analysts establish causality, build storylines from evidence, and reject infeasible hypo...
Bum chul Kwon, Waqas Javed, Sohaib Ghani, Niklas E...