Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

170

ACL
1994

120views Computational Linguistics» more ACL 1994»

A Corpus-Based Approach to Automatic Compound Extraction

15 years 8 months ago

A Corpus-Based Approach to Automatic Compound Extraction

Download www.mt-archive.info

An automatic compound retrieval method is proposed to extract compounds within a text message. It uses n-gram mutual information, relative frequency count and parts of speech as the features for compound extraction. The problem is modeled as a two-class classification problem based on the distributional characteristics of n-gram tokens in the compound and the non-compound clusters. The recall and precision using the proposed approach are 96.2% and 48.2% for bigram compounds and 96.6% and 39.6% for trigram compounds for a testing corpus of 49,314 words. A significant cutdown in processing time has been observed.

Keh-Yih Su, Ming-Wen Wu, Jing-Shin Chang

Real-time Traffic

ACL 1994 | ACL 2007 | Automatic Compound Retrieval | N-gram Mutual Information | Relative Frequency Count |

claim paper

Related Content

» FineGrained Geographical Relation Extraction from Wikipedia

» Learning Visual Compound Models from Parallel ImageText Datasets

» Probabilistic Topic Models for Learning Terminological Ontologies

» Multiword Expressions in the wild The mwetoolkit comes in handy

» Micro pattern evolution

» Summarizing local context to personalize global web search

» Machine Translation of Sentences with Fixed Expressions

Post Info
More Details (n/a)

Added	02 Nov 2010
Updated	02 Nov 2010
Type	Conference
Year	1994
Where	ACL
Authors	Keh-Yih Su, Ming-Wen Wu, Jing-Shin Chang

Comments (0)