TR9856: A Multi-word Term Relatedness Benchmark

10 years 3 months ago

Download www.openu.ac.il

Measuring word relatedness is an important ingredient of many NLP applications. Several datasets have been developed in order to evaluate such measures. The main drawback of existing datasets is the focus on single words, although natural language contains a large proportion of multiword terms. We propose the new TR9856 dataset which focuses on multi-word terms and is signiﬁcantly larger than existing datasets. The new dataset includes many real world terms such as acronyms and named entities, and further handles term ambiguity by providing topical context for all term pairs. We report baseline results for common relatedness methods over the new data, and exploit its magnitude to demonstrate that a combination of these methods outperforms each individual method.

Ran Levy, Liat Ein-Dor, Shay Hummel, Ruty Rinott,

Real-time Traffic

ACL 2015 | Computational Linguistics |

claim paper

Post Info
More Details (n/a)

Added	13 Apr 2016
Updated	13 Apr 2016
Type	Journal
Year	2015
Where	ACL
Authors	Ran Levy, Liat Ein-Dor, Shay Hummel, Ruty Rinott, Noam Slonim

Comments (0)

Sciweavers

TR9856: A Multi-word Term Relatedness Benchmark

ACL 2015 | Computational Linguistics |

Explore & Download

Productivity Tools

Sciweavers