Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

202

IJCNLP
2005
Springer

179views Natural Language Processing» more IJCNLP 2005»

Automatic Term Extraction Based on Perplexity of Compound Words

16 years 4 days ago

Automatic Term Extraction Based on Perplexity of Compound Words

Download www.aclweb.org

Many methods of term extraction have been discussed in terms of their accuracy on huge corpora. However, when we try to apply various methods that derive from frequency to a small corpus, we may not be able to achieve suﬃcient accuracy because of the shortage of statistical information on frequency. This paper reports a new way of extracting terms that is tuned for a very small corpus. It focuses on the structure of compound terms and calculates perplexity on the term unit’s left-side and right-side. The results of our experiments revealed that the accuracy with the proposed method was not that advantageous. However, experimentation with the method combining perplexity and frequency information obtained the highest average-precision in comparison with other methods.

Minoru Yoshida, Hiroshi Nakagawa

Real-time Traffic

IJCNLP 2005 | Many Methods | Method Combining Perplexity | Natural Language Processing | Small Corpus |

claim paper

Related Content

» A Statistical CorpusBased Term Extractor

» A CorpusBased Approach to Automatic Compound Extraction

» Automatic CorpusBased Extraction of Chinese Legal Terms

» Automatic extraction of bilingual terms from a ChineseJapanese parallel corpus

» A new probabilistic retrieval model based on the dirichlet compound multinomial distributi...

» TermBased Clustering and Summarization of Web Page Collections

» Using Prosodic Features in Language Models for Meetings

» WordSieve A Method for RealTime Context Extraction

» Multiword Expressions in the wild The mwetoolkit comes in handy

Post Info
More Details (n/a)

Added	27 Jun 2010
Updated	27 Jun 2010
Type	Conference
Year	2005
Where	IJCNLP
Authors	Minoru Yoshida, Hiroshi Nakagawa

Comments (0)