Controlling overlap in content-oriented XML retrieval

15 years 7 months ago

Download plg.uwaterloo.ca

The direct application of standard ranking techniques to retrieve individual elements from a collection of XML documents often produces a result set in which the top ranks are dominated by a large number of elements taken from a small number of highly relevant documents. This paper presents and evaluates an algorithm that re-ranks this result set, with the aim of minimizing redundant content while preserving the beneﬁts of element retrieval, including the beneﬁt of identifying topic-focused components contained within relevant documents. The test collection developed by the INitiative for the Evaluation of XML Retrieval (INEX) forms the basis for the evaluation. Categories and Subject Descriptors H.3.3 [Information Systems]: Information Storage and Retrieval—Information Search and Retrieval General Terms Algorithms, Measurement, Performance, Experimentation Keywords XML, Ranking, Information Retrieval

Charles L. A. Clarke

Real-time Traffic

Relevant Documents | Retrieval General Terms | SIGIR 2005 | Standard Ranking Techniques |

claim paper

Post Info
More Details (n/a)

Added	26 Jun 2010
Updated	26 Jun 2010
Type	Conference
Year	2005
Where	SIGIR
Authors	Charles L. A. Clarke

Comments (0)

Sciweavers

Controlling overlap in content-oriented XML retrieval

Relevant Documents | Retrieval General Terms | SIGIR 2005 | Standard Ranking Techniques |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers