Sciweavers

SIGIR
2010
ACM
13 years 11 months ago
Score distribution models: assumptions, intuition, and robustness to score manipulation
Inferring the score distribution of relevant and non-relevant documents is an essential task for many IR applications (e.g. information filtering, recall-oriented IR, meta-search,...
Evangelos Kanoulas, Keshi Dai, Virgiliu Pavlu, Jav...
HM
2010
Springer
161views Optimization» more  HM 2010»
14 years 16 days ago
A Memetic Algorithm for Reconstructing Cross-Cut Shredded Text Documents
The reconstruction of destroyed paper documents became of more interest during the last years. On the one hand it (often) occurs that documents are destroyed by mistake while on th...
Christian Schauer, Matthias Prandtstetter, Gü...
ESA
2010
Springer
161views Algorithms» more  ESA 2010»
14 years 16 days ago
Top-k Ranked Document Search in General Text Databases
Text search engines return a set of k documents ranked by similarity to a query. Typically, documents and queries are drawn from natural language text, which can readily be partiti...
J. Shane Culpepper, Gonzalo Navarro, Simon J. Pugl...
DRR
2010
14 years 16 days ago
Detecting modifications in paper documents: a coding approach
This paper presents an algorithm called CIPDEC (Content Integrity of Printed Documents using Error Correction), which identifies any modifications made to a printed document. CIPD...
Yogesh Sankarasubramaniam, Badri Narayanan, Kapali...
CLEF
2010
Springer
14 years 17 days ago
Multilingual Expert Search using Linked Open Data as Interlingual Representation
Abstract. Most Information Retrieval models take documents as Bagof-Words and are thereby bound to the language of the documents. In this paper, we present an approach using Linked...
Daniel Herzig, Hristina Taneva
CLEF
2010
Springer
14 years 17 days ago
External and Intrinsic Plagiarism Detection Using a Cross-Lingual Retrieval and Segmentation System - Lab Report for PAN at CLEF
We present our hybrid system for the PAN challenge at CLEF 2010. Our system performs plagiarism detection for translated and non-translated externally as well as intrinsically plag...
Markus Muhr, Roman Kern, Mario Zechner, Michael Gr...
MVA
1990
14 years 18 days ago
Recognition of Document Structure on the Basis of Spatial and Geometric Relationships between Document Items
This paper introduces a new method to extract and classify the meaningful information from documents automatically. The basic idea in our method is to utilize the spatial and geom...
Qin Luo, Toyohide Watanabe, Yuuji Yoshida, Yasuyos...
NAACL
1994
14 years 23 days ago
Learning from Relevant Documents in Large Scale Routing Retrieval
The normal practice of selecting relevant documents for training routing queries is to either use all relevants or the 'best n' of them after a (retrieval) ranking opera...
K. L. Kwok, Laszlo Grunfeld
TREC
2000
14 years 24 days ago
The PISAB Question Answering System
The PISAB Question Answering system is based on a combination of Information Extraction and Information Retrieval techniques. Knowledge extracted from documents is modeled as a se...
Giuseppe Attardi, Cristian Burrini
WEBNET
1998
14 years 24 days ago
Categorisation by Context
Assistance in retrieving of documents on the World Wide Web is provided either by search engines, through keyword based queries, or by catalogues, which organise documents into hi...
Giuseppe Attardi, Sergio Di Marco, Davide Salvi