Sciweavers

DEXAW
1999
IEEE

Textual Similarities Based on a Distributional Approach

14 years 5 months ago
Textual Similarities Based on a Distributional Approach
The design of efficient textual similarities is an important issue in the domain of textual data exploration. Textual similarities are for example central in document collection structuring (e.g. clustering), or in Information Retrieval (IR) which relies on the computation of textual similarities for measuring the adequacy between a query and documents. The objective of this paper is to present and compare several textual similarity measures in the framework of the Distributional Semantics (DS) model for IR. This model is an extension of the standard Vector Space model, which further takes the co-frequencies between the terms in a given reference corpus into account. These co-frequencies are considered to provide a distributional representation of the "semantics" of the terms. The co-occurrence profiles are used to represent the documents as vectors. Practical retrieval experiments using DS-based similarity models have been conducted in the framework of the AMARYLLIS evaluat...
Romaric Besançon, Martin Rajman, Jean-C&eac
Added 03 Aug 2010
Updated 03 Aug 2010
Type Conference
Year 1999
Where DEXAW
Authors Romaric Besançon, Martin Rajman, Jean-Cédric Chappelier
Comments (0)