Sciweavers

EMNLP
2007
13 years 9 months ago
Lexical Semantic Relatedness with Random Graph Walks
Many systems for tasks such as question answering, multi-document summarization, and information retrieval need robust numerical measures of lexical relatedness. Standard thesauru...
Thad Hughes, Daniel Ramage
EMNLP
2007
13 years 9 months ago
Probabilistic Models of Nonprojective Dependency Trees
A notable gap in research on statistical dependency parsing is a proper conditional probability distribution over nonprojective dependency trees for a given sentence. We exploit t...
David A. Smith, Noah A. Smith
EMNLP
2007
13 years 9 months ago
Large Margin Synchronous Generation and its Application to Sentence Compression
This paper presents a tree-to-tree transduction method for text rewriting. Our model is based on synchronous tree substitution grammar, a formalism that allows local distortion of...
Trevor Cohn, Mirella Lapata
EMNLP
2007
13 years 9 months ago
Semi-Supervised Structured Output Learning Based on a Hybrid Generative and Discriminative Approach
This paper proposes a framework for semi-supervised structured output learning (SOL), specifically for sequence labeling, based on a hybrid generative and discriminative approach...
Jun Suzuki, Akinori Fujino, Hideki Isozaki
EMNLP
2007
13 years 9 months ago
Part-of-Speech Tagging for Middle English through Alignment and Projection of Parallel Diachronic Texts
We demonstrate an approach for inducing a tagger for historical languages based on existing resources for their modern varieties. Tags from Present Day English source text are pro...
Taesun Moon, Jason Baldridge
EMNLP
2007
13 years 9 months ago
Using Foreign Inclusion Detection to Improve Parsing Performance
Inclusions from other languages can be a significant source of errors for monolingual parsers. We show this for English inclusions, which are sufficiently frequent to present a ...
Beatrice Alex, Amit Dubey, Frank Keller
EMNLP
2007
13 years 9 months ago
Compressing Trigram Language Models With Golomb Coding
Trigram language models are compressed using a Golomb coding method inspired by the original Unix spell program. Compression methods trade off space, time and accuracy (loss). The...
Kenneth Church, Ted Hart, Jianfeng Gao
EMNLP
2007
13 years 9 months ago
Characterizing the Errors of Data-Driven Dependency Parsing Models
We present a comparative error analysis of the two dominant approaches in datadriven dependency parsing: global, exhaustive, graph-based models, and local, greedy, transition-base...
Ryan T. McDonald, Joakim Nivre
EMNLP
2007
13 years 9 months ago
Finding Good Sequential Model Structures using Output Transformations
In Sequential Viterbi Models, such as HMMs, MEMMs, and Linear Chain CRFs, the type of patterns over output sequences that can be learned by the model depend directly on the model...
Edward Loper