Sciweavers

268 search results - page 6 / 54
» Extracting significant words from corpora for ontology extra...
Sort
View
LREC
2010
208views Education» more  LREC 2010»
13 years 10 months ago
Extraction of German Multiword Expressions from Parsed Corpora Using Context Features
We report about tools for the extraction of German multiword expressions (MWEs) from text corpora; we extract word pairs, but also longer MWEs of different patterns, e.g. verb-nou...
Marion Weller, Ulrich Heid
ECIR
2010
Springer
13 years 10 months ago
Extracting Multilingual Topics from Unaligned Comparable Corpora
Topic models have been studied extensively in the context of monolingual corpora. Though there are some attempts to mine topical structure from cross-lingual corpora, they require ...
Jagadeesh Jagarlamudi, Hal Daumé III
LREC
2010
189views Education» more  LREC 2010»
13 years 10 months ago
Extracting Surface Realisation Templates from Corpora
In Natural Language Generation (NLG), template-based surface realisation is an effective solution to the problem of producing surface strings from a given semantic representation,...
Thiago D. Tadeu, Eder M. de Novais, Ivandré...
ACL
2008
13 years 10 months ago
Pivot Approach for Extracting Paraphrase Patterns from Bilingual Corpora
Paraphrase patterns are useful in paraphrase recognition and generation. In this paper, we present a pivot approach for extracting paraphrase patterns from bilingual parallel corp...
Shiqi Zhao, Haifeng Wang, Ting Liu, Sheng Li
LREC
2010
216views Education» more  LREC 2010»
13 years 10 months ago
BlogBuster: A Tool for Extracting Corpora from the Blogosphere
This paper presents BlogBuster, a tool for extracting a corpus from the blogosphere. The topic of cleaning arbitrary web pages with the goal of extracting a corpus from web data, ...
Georgios Petasis, Dimitrios Petasis