Identifying Multi-word Expressions by Leveraging Morphological and Syntactic Idiosyncrasy

14 years 2 months ago

Download aclweb.org

Multi-word expressions constitute a significant portion of the lexicon of every natural language, and handling them correctly is mandatory for various NLP applications. Yet such entities are notoriously hard to define, and are consequently missing from standard lexicons and dictionaries. Multi-word expressions exhibit idiosyncratic behavior on various levels: orthographic, morphological, syntactic and semantic. In this work we take advantage of the morphological and syntactic idiosyncrasy of Hebrew noun compounds and employ it to extract such expressions from text corpora. We show that relying on linguistic information dramatically improves the accuracy of compound extraction, reducing over one third of the errors compared with the best baseline.

Hassan Al-Haj, Shuly Wintner

Real-time Traffic

COLING 2010 | Computational Linguistics | Multi-word Expressions | Syntactic Idiosyncrasy | Various Nlp Applications |

claim paper

Post Info
More Details (n/a)

Added	13 May 2011
Updated	13 May 2011
Type	Journal
Year	2010
Where	COLING
Authors	Hassan Al-Haj, Shuly Wintner

Comments (0)

Sciweavers

Identifying Multi-word Expressions by Leveraging Morphological and Syntactic Idiosyncrasy

COLING 2010 | Computational Linguistics | Multi-word Expressions | Syntactic Idiosyncrasy | Various Nlp Applications |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers