Sciweavers

CICLING
2008
Springer
14 years 2 months ago
Identification of Transliterated Foreign Words in Hebrew Script
Abstract. We present a loosely-supervised method for context-free identification of transliterated foreign names and borrowed words in Hebrew text. The method is purely statistical...
Yoav Goldberg, Michael Elhadad
CICLING
2008
Springer
14 years 2 months ago
A Semantics-Enhanced Language Model for Unsupervised Word Sense Disambiguation
An N-gram language model aims at capturing statistical word order dependency information from corpora. Although the concept of language models has been applied extensively to handl...
Shou-de Lin, Karin Verspoor
CICLING
2008
Springer
14 years 2 months ago
Evaluation of Internal Validity Measures in Short-Text Corpora
Short texts clustering is one of the most difficult tasks in natural language processing due to the low frequencies of the document terms. We are interested in analysing these kind...
Diego Ingaramo, David Pinto, Paolo Rosso, Marcelo ...
CICLING
2008
Springer
14 years 2 months ago
A Distributed Database System for Developing Ontological and Lexical Resources in Harmony
In this article, we present the basic ideas of creating a new information-rich lexical database of Dutch, called Cornetto, that is interconnected with corresponding English synsets...
Ales Horák, Piek Vossen, Adam Rambousek
CICLING
2008
Springer
14 years 2 months ago
Deep Lexical Semantics
In the project we describe, we have taken a basic core of about 5000 synsets in WordNet that are the most frequently used, and we have categorized these into sixteen broad categori...
Jerry R. Hobbs
CICLING
2008
Springer
14 years 2 months ago
Natural Language as the Basis for Meaning Representation and Inference
Abstract. Semantic inference is an important component in many natural language understanding applications. Classical approaches to semantic inference rely on logical representatio...
Ido Dagan, Roy Bar-Haim, Idan Szpektor, Iddo Green...
CICLING
2008
Springer
14 years 2 months ago
Dynamic Translation Memory: Using Statistical Machine Translation to Improve Translation Memory Fuzzy Matches
Abstract. Professional translators of technical documents often use Translation Memory (TM) systems in order to capitalize on the repetitions frequently observed in these documents...
Ergun Biçici, Marc Dymetman
CICLING
2008
Springer
14 years 2 months ago
Verb Class Discovery from Rich Syntactic Data
Abstract. Previous research has shown that syntactic features are the most informative features in automatic verb classification. We investigate their optimal characteristics by co...
Lin Sun, Anna Korhonen, Yuval Krymolowski
CICLING
2008
Springer
14 years 2 months ago
Sense Annotation in the Penn Discourse Treebank
Abstract. An important aspect of discourse understanding and generation involves the recognition and processing of discourse relations. These are conveyed by discourse connectives,...
Eleni Miltsakaki, Livio Robaldo, Alan Lee, Aravind...