The Construction and Evaluation of Word Space Models

15 years 8 months ago

Download www.lrec-conf.org

Semantic similarity is a key issue in many computational tasks. This paper goes into the development and evaluation of two common ways of automatically calculating the semantic similarity between two words. On the one hand, such methods may depend on a manually constructed thesaurus like (Euro)WordNet. Their performance is often evaluated on the basis of a very restricted set of human similarity ratings. On the other hand, corpus-based methods rely on the distribution of two words in a corpus to determine their similarity. Their performance is generally quantified through a comparison with the judgements of the first type of approach. This paper introduces a new Gold Standard of more than 5,000 human intra-category similarity judgements. We show that corpus-based methods regularly outperform (Euro)WordNet on this data set, and that the use of the latter as a Gold Standard for the former, is thus often far from ideal.

Yves Peirsman, Simon De Deyne, Kris Heylen, Dirk G

Real-time Traffic

Corpus-based Methods | Education | Human Similarity Ratings | LREC 2008 | Semantic Similarity |

claim paper

» Constructing Semantic Space Models from Parsed Corpora

» The SSpace Package An Open Source Package for Word Space Models

» Orthogonal Negation in Vector Spaces for Modelling WordMeanings and Document Retrieval

» Constructing cylindrical coordinate colour spaces

» Embedding Visual Words into Concept Space for Action and Scene Recognition

» Automatic extraction of roads from aerial images based on scale space and snakes

» Comparing Different Properties Involved in Word Similarity Extraction

» ContextualGuided BagofVisualWords Model for Multiclass Object Categorization

Post Info
More Details (n/a)

Added	29 Oct 2010
Updated	29 Oct 2010
Type	Conference
Year	2008
Where	LREC
Authors	Yves Peirsman, Simon De Deyne, Kris Heylen, Dirk Geeraerts

Comments (0)

Sciweavers

The Construction and Evaluation of Word Space Models

Corpus-based Methods | Education | Human Similarity Ratings | LREC 2008 | Semantic Similarity |

Explore & Download

Productivity Tools

Sciweavers