Sciweavers

DATASCIENCE
2007
88views more  DATASCIENCE 2007»
13 years 11 months ago
Detecting Family Resemblance: Automated Genre Classification
This paper presents results in automated genre classification of digital documents in PDF format. It describes genre classification as an important ingredient in contextualising s...
Yunhyong Kim, Seamus Ross
JASIS
2006
120views more  JASIS 2006»
13 years 11 months ago
Building a reusable test collection for question answering
In contrast to traditional information retrieval systems, which return ranked lists of documents that users must manually browse through, a question answering system attempts to d...
Jimmy J. Lin, Boris Katz
IVC
2006
105views more  IVC 2006»
13 years 11 months ago
A partition approach for the restoration of camera images of planar and curled document
As camera resolution increases, high-speed non-contact text capture through a digital camera is opening up a new channel for text capture and understanding. Unfortunately, the cap...
Shijian Lu, Ben M. Chen, Chi Chung Ko
IPM
2006
130views more  IPM 2006»
13 years 11 months ago
Exploiting structural information for semi-structured document categorization
This paper examines several different approaches to exploiting structural information in semi-structured document categorization. The methods under consideration are designed for ...
Andrej Bratko, Bogdan Filipic
CORR
2007
Springer
65views Education» more  CORR 2007»
13 years 11 months ago
Text Line Segmentation of Historical Documents: a Survey
There is a huge amount of historical documents in libraries and in various National Archives that have not been exploited electronically. Although automatic reading of complete pa...
Laurence Likforman-Sulem, Abderrazak Zahour, Bruno...
JASIS
2008
102views more  JASIS 2008»
13 years 11 months ago
Hierarchical summarization of large documents
mation science has shown that human abstractors extract sentences for summaries based on the hierarchical structure of documents; however, the existing automatic summarization mode...
Christopher C. Yang, Fu Lee Wang
IVS
2008
90views more  IVS 2008»
13 years 11 months ago
Jigsaw: supporting investigative analysis through interactive visualization
Investigative analysts who work with collections of text documents connect embedded threads of evidence in order to formulate hypotheses about plans and activities of potential in...
John T. Stasko, Carsten Görg, Zhicheng Liu
IPM
2008
114views more  IPM 2008»
13 years 11 months ago
User-assisted query translation for interactive cross-language information retrieval
Interactive Cross-Language Information Retrieval (CLIR), a process in which searcher and system collaborate to find documents that satisfy an information need regardless of the la...
Douglas W. Oard, Daqing He, Jianqiang Wang
CG
2007
Springer
13 years 11 months ago
Visual text mining using association rules
In many situations, individuals or groups of individuals are faced with the need to examine sets of documents to achieve understanding of their structure and to locate relevant in...
Alneu de Andrade Lopes, Roberto Pinho, Fernando Vi...
FTIR
2006
128views more  FTIR 2006»
13 years 11 months ago
Authorship Attribution
Authorship attribution, the science of inferring characteristics of the author from the characteristics of documents written by that author, is a problem with a long history and a...
Patrick Juola