Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

180

EACL
2006
ACL Anthology

91views Natural Language Processing» more EACL 2006»

Multilingual Term Extraction from Domain-specific Corpora Using Morphological Structure

15 years 8 months ago

Multilingual Term Extraction from Domain-specific Corpora Using Morphological Structure

Download acl.ldc.upenn.edu

Morphologically complex terms composed from Greek or Latin elements are frequent in scientific and technical texts. Word forming units are thus relevant cues for the identification of terms in domainspecific texts. This article describes a method for the automatic extraction of terms relying on the detection of classical prefixes and word-initial combining forms. Word-forming units are identified using a regular expression. The system then extracts terms by selecting words which either begin or coalesce with these elements. Next, terms are grouped in families which are displayed as a weighted list in HTML format.

Delphine Bernhard

Real-time Traffic

Complex Terms | EACL 2006 | Natural Language Processing | Word Forming Units | Word-initial Combining Forms |

claim paper

Related Content

» Extracting Multilingual Topics from Unaligned Comparable Corpora

» Phrase Translation Extraction from Aligned Parallel Corpora Using Suffix Arrays and Relate...

» A Cheap and Fast Way to Build Useful Translation Lexicons

» Broad Coverage Multilingual Deep Sentence Generation with a Stochastic MultiLevel Realizer

» MARS Multilingual Access and Retrieval System with Enhanced Query Translation and Document...

» AnCora Multilevel Annotated Corpora for Catalan and Spanish

» Encoding Terms from a Scientific Domain in a Terminological Database Methodology and Crite...

» Constructing and Using Broadcoverage Lexical Resource for Enhancing Morphological Analysis...

» A Comparative Evaluation of Term Recognition Algorithms

Post Info
More Details (n/a)

Added	30 Oct 2010
Updated	30 Oct 2010
Type	Conference
Year	2006
Where	EACL
Authors	Delphine Bernhard

Comments (0)