
13 years 8 months ago
Feature Subsumption for Opinion Analysis
Lexical features are key to many approaches to sentiment analysis and opinion detection. A variety of representations have been used, including single words, multi-word Ngrams, ph...
Ellen Riloff, Siddharth Patwardhan, Janyce Wiebe
13 years 8 months ago
Entity Annotation based on Inverse Index Operations
Entity annotation involves attaching a label such as `name' or `organization' to a sequence of tokens in a document. All the current rule-based and machine learningbased...
Ganesh Ramakrishnan, Sreeram Balakrishnan, Sachind...
13 years 8 months ago
Get out the vote: Determining support or opposition from Congressional floor-debate transcripts
We investigate whether one can determine from the transcripts of U.S. Congressional floor debates whether the speeches represent support of or opposition to proposed legislation. ...
Matt Thomas, Bo Pang, Lillian Lee
13 years 8 months ago
A Hybrid Markov/Semi-Markov Conditional Random Field for Sequence Segmentation
Markov order-1 conditional random fields (CRFs) and semi-Markov CRFs are two popular models for sequence segmentation and labeling. Both models have advantages in terms of the typ...
Galen Andrew
13 years 8 months ago
Distributed Language Modeling for N-best List Re-ranking
Ying Zhang, Almut Silja Hildebrand, Stephan Vogel
13 years 8 months ago
Priming Effects in Combinatory Categorial Grammar
This paper presents a corpus-based account of structural priming in human sentence processing, focusing on the role that syntactic representations play in such an account. We esti...
David Reitter, Julia Hockenmaier, Frank Keller
13 years 8 months ago
A Discriminative Model for Tree-to-Tree Translation
This paper proposes a statistical, treeto-tree model for producing translations. Two main contributions are as follows: (1) a method for the extraction of syntactic structures wit...
Brooke Cowan, Ivona Kucerova, Michael Collins
13 years 8 months ago
Capturing Out-of-Vocabulary Words in Arabic Text
The increasing flow of information between languages has led to a rise in the frequency of non-native or loan words, where terms of one language appear transliterated in another. ...
Abdusalam F. A. Nwesri, Seyed M. M. Tahaghoghi, Fa...
13 years 8 months ago
Learning Field Compatibilities to Extract Database Records from Unstructured Text
Named-entity recognition systems extract entities such as people, organizations, and locations from unstructured text. Rather than extract these mentions in isolation, this paper ...
Michael L. Wick, Aron Culotta, Andrew McCallum