Sciweavers

NAACL
2007
13 years 9 months ago
Automating Creation of Hierarchical Faceted Metadata Structures
We describe Castanet, an algorithm for automatically generating hierarchical faceted metadata from textual descriptions of items, to be incorporated into browsing and navigation i...
Emilia Stoica, Marti A. Hearst, Megan Richardson
NAACL
2007
13 years 9 months ago
An Information Retrieval Approach to Sense Ranking
In word sense disambiguation, choosing the most frequent sense for an ambiguous word is a powerful heuristic. However, its usefulness is restricted by the availability of sense-an...
Mirella Lapata, Frank Keller
NAACL
2007
13 years 9 months ago
Analysis of Morph-Based Speech Recognition and the Modeling of Out-of-Vocabulary Words Across Languages
We analyze subword-based language models (LMs) in large-vocabulary continuous speech recognition across four “morphologically rich” languages: Finnish, Estonian, Turkish, and ...
Mathias Creutz, Teemu Hirsimäki, Mikko Kurimo...
NAACL
2007
13 years 9 months ago
Generation by Inverting a Semantic Parser that Uses Statistical Machine Translation
This paper explores the use of statistical machine translation (SMT) methods for tactical natural language generation. We present results on using phrase-based SMT for learning to...
Yuk Wah Wong, Raymond J. Mooney
NAACL
2007
13 years 9 months ago
Chinese Named Entity Recognition with Cascaded Hybrid Model
We propose a high-performance cascaded hybrid model for Chinese NER. Firstly, we use Boosting, a standard and theoretically wellfounded machine learning method to combine a set of...
Xiaofeng Yu
NAACL
2007
13 years 9 months ago
Information Retrieval On Empty Fields
Victor Lavrenko, Xing Yi, James Allan
NAACL
2007
13 years 9 months ago
Using Wikipedia for Automatic Word Sense Disambiguation
This paper describes a method for generating sense-tagged data using Wikipedia as a source of sense annotations. Through word sense disambiguation experiments, we show that the Wi...
Rada Mihalcea
NAACL
2007
13 years 9 months ago
Arabic Diacritization through Full Morphological Tagging
We present a diacritization system for written Arabic which is based on a lexical resource. It combines a tagger and a lexeme language model. It improves on the best results repor...
Nizar Habash, Owen Rambow
NAACL
2007
13 years 9 months ago
Direct Translation Model 2
This paper presents a maximum entropy machine translation system using a minimal set of translation blocks (phrase-pairs). While recent phrase-based statistical machine translatio...
Abraham Ittycheriah, Salim Roukos
NAACL
2007
13 years 9 months ago
Combining Probability-Based Rankers for Action-Item Detection
This paper studies methods that automatically detect action-items in e-mail, an important category for assisting users in identifying new tasks, tracking ongoing ones, and searchi...
Paul N. Bennett, Jaime G. Carbonell