Natural Language Processing

216

CICLING
2009
Springer

151views Natural Language Processing» more CICLING 2009»

Exploiting Parallel Treebanks to Improve Phrase-Based Statistical Machine Translation

16 years 8 months ago

We use existing tools to automatically build two parallel treebanks from existing parallel corpora. We then show that combining the data extracted from both the treebanks and the ...

John Tinsley, Mary Hearne, Andy Way

claim paper

Read More »

203

click to vote

CICLING
2009
Springer

224views Natural Language Processing» more CICLING 2009»

Enriching Statistical Translation Models Using a Domain-Independent Multilingual Lexical Knowledge Base

16 years 8 months ago

Download www.lsi.upc.edu

This paper presents a method for improving phrase-based Statistical Machine Translation systems by enriching the original translation model with information derived from a multilin...

Miguel García, Jesús Giménez,...

claim paper

Read More »

254

click to vote

CICLING
2009
Springer

215views Natural Language Processing» more CICLING 2009»

Cross-Language Frame Semantics Transfer in Bilingual Corpora

16 years 8 months ago

Download dit.unitn.it

Recent work on the transfer of semantic information across languages has been recently applied to the development of resources annotated with Frame information for different non-En...

Roberto Basili, Diego De Cao, Danilo Croce, Bonave...

claim paper

Read More »

219

click to vote

CICLING
2009
Springer

214views Natural Language Processing» more CICLING 2009»

Semi-supervised Clustering for Word Instances and Its Effect on Word Sense Disambiguation

16 years 8 months ago

Download www.comp.nus.edu.sg

We propose a supervised word sense disambiguation (WSD) system that uses features obtained from clustering results of word instances. Our approach is novel in that we employ semi-s...

Kazunari Sugiyama, Manabu Okumura

claim paper

Read More »

220

click to vote

CICLING
2009
Springer

140views Natural Language Processing» more CICLING 2009»

Business Specific Online Information Extraction from German Websites

16 years 8 months ago

Download www.cis.uni-muenchen.de

This paper presents a system that uses the domain name of a German business website to locate its information pages (e.g. company profile, contact page, imprint) and then identifi...

Yeong Su Lee, Michaela Geierhos

claim paper

Read More »

163

Voted

CICLING
2009
Springer

108views Natural Language Processing» more CICLING 2009»

Improved Unsupervised Name Discrimination with Very Wide Bigrams and Automatic Cluster Stopping

16 years 8 months ago

Download www.d.umn.edu

We cast name discrimination as a problem in clustering short contexts. Each occurrence of an ambiguous name is treated independently, and represented using second?order context vec...

Ted Pedersen

claim paper

Read More »

141

Voted

CICLING
2009
Springer

117views Natural Language Processing» more CICLING 2009»

A Karaka Based Annotation Scheme for English

16 years 8 months ago

Download www.iiit.net

Ashwini Vaidya, Samar Husain, Prashanth Mannem, Di...

claim paper

Read More »

197

Voted

CICLING
2009
Springer

92views Natural Language Processing» more CICLING 2009»

Reducing the Plagiarism Detection Search Space on the Basis of the Kullback-Leibler Distance

16 years 8 months ago

Download users.dsic.upv.es

Abstract. Automatic plagiarism detection considering a reference corpus compares a suspicious text to a set of original documents in order to relate the plagiarised fragments to th...

Alberto Barrón-Cedeño, Paolo Rosso, ...

claim paper

Read More »

167

Voted

CICLING
2009
Springer

114views Natural Language Processing» more CICLING 2009»

A General Method for Transforming Standard Parsers into Error-Repair Parsers

16 years 8 months ago

Download www.grupocole.org

A desirable property for any system dealing with unrestricted natural language text is robustness, the ability to analyze any input regardless of its grammaticality. In this paper ...

Carlos Gómez-Rodríguez, Miguel A. Al...

claim paper

Read More »

163

click to vote

CICLING
2009
Springer

201views Natural Language Processing» more CICLING 2009»

Guessers for Finite-State Transducer Lexicons

16 years 8 months ago

Download www.ling.helsinki.fi

Abstract. Language software applications encounter new words, e.g., acronyms, technical terminology, names or compounds of such words. In order to add new words to a lexicon, we ne...

Krister Lindén

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers