Sciweavers

31 search results - page 3 / 7
» BlogBuster: A Tool for Extracting Corpora from the Blogosphe...
Sort
View
IJCAI
2001
13 years 8 months ago
Adaptive Information Extraction from Text by Rule Induction and Generalisation
(LP)2 is a covering algorithm for adaptive Information Extraction from text (IE). It induces symbolic rules that insert SGML tags into texts by learning from examples found in a u...
Fabio Ciravegna
CICLING
2004
Springer
13 years 11 months ago
Language-Independent Methods for Compiling Monolingual Lexical Data
Abstract: In this paper we describe a flexible, portable and languageindependent infrastructure for setting up large monolingual language corpora. The approach is based on collecti...
Christian Biemann, Stefan Bordag, Gerhard Heyer, U...
LREC
2010
161views Education» more  LREC 2010»
13 years 8 months ago
An Integrated Digital Tool for Accessing Language Resources
Language resources can be classified under several categories. To be able to query and operate on all (or most of) these categories using a single digital tool would be very helpf...
Anil Kumar Singh, Bharat Ram Ambati
LREC
2010
133views Education» more  LREC 2010»
13 years 8 months ago
Term and Collocation Extraction by Means of Complex Linguistic Web Services
We present a web service-based environment for the use of linguistic resources and tools to address issues of terminology and language varieties. We discuss the architecture, corp...
Ulrich Heid, Fabienne Fritzinger, Erhard W. Hinric...
LREC
2008
90views Education» more  LREC 2008»
13 years 8 months ago
Word Alignment Annotation in a Japanese-Chinese Parallel Corpus
Parallel corpora are critical resources for machine translation research and development since parallel corpora contain translation equivalences of various granularities. Manual a...
Yujie Zhang, Zhulong Wang, Kiyotaka Uchimoto, Qing...