Sciweavers

EMNLP
2004
13 years 8 months ago
Sentiment Analysis using Support Vector Machines with Diverse Information Sources
This paper introduces an approach to sentiment analysis which uses support vector machines (SVMs) to bring together diverse sources of potentially pertinent information, including...
Tony Mullen, Nigel Collier
EMNLP
2004
13 years 8 months ago
Unsupervised WSD based on Automatically Retrieved Examples: The Importance of Bias
This paper explores the large-scale acquisition of sense-tagged examples for Word Sense Disambiguation (WSD). We have applied the "WordNet monosemous relatives" method t...
Eneko Agirre, David Martínez
EMNLP
2004
13 years 8 months ago
Comparing and Combining Generative and Posterior Probability Models: Some Advances in Sentence Boundary Detection in Speech
We compare and contrast two different models for detecting sentence-like units in continuous speech. The first approach uses hidden Markov sequence models based on N-grams and max...
Yang Liu, Andreas Stolcke, Elizabeth Shriberg, Mar...
EMNLP
2004
13 years 8 months ago
Attribute-Based and Value-Based Clustering: An Evaluation
In most research on concept acquisition from corpora, concepts are modeled as vectors of relations extracted from syntactic structures. In the case of modifiers, these relations o...
Abdulrahman Almuhareb, Massimo Poesio
EMNLP
2004
13 years 8 months ago
The Influence of Argument Structure on Semantic Role Assignment
We present a data and error analysis for semantic role labelling. In a first experiment, we build a generic statistical model for semantic role assignment in the FrameNet paradigm...
Sebastian Padó, Gemma Boleda
EMNLP
2004
13 years 8 months ago
Calibrating Features for Semantic Role Labeling
This paper takes a critical look at the features used in the semantic role tagging literature and show that the information in the input, generally a syntactic parse tree, has yet...
Nianwen Xue, Martha Palmer
EMNLP
2004
13 years 8 months ago
Chinese Part-of-Speech Tagging: One-at-a-Time or All-at-Once? Word-Based or Character-Based?
Chinese part-of-speech (POS) tagging assigns one POS tag to each word in a Chinese sentence. However, since words are not demarcated in a Chinese sentence, Chinese POS tagging req...
Hwee Tou Ng, Jin Kiat Low
EMNLP
2004
13 years 8 months ago
Automatic Paragraph Identification: A Study across Languages and Domains
In this paper we investigate whether paragraphs can be identified automatically in different languages and domains. We propose a machine learning approach which exploits textual a...
Caroline Sporleder, Mirella Lapata
EMNLP
2004
13 years 8 months ago
NP Bracketing by Maximum Entropy Tagging and SVM Reranking
We perform Noun Phrase Bracketing by using a local, maximum entropy-based tagging model, which produces bracketing hypotheses. These hypotheses are subsequently fed into a reranki...
Hal Daumé III, Daniel Marcu
EMNLP
2004
13 years 8 months ago
Active Learning and the Total Cost of Annotation
Active learning (AL) promises to reduce the cost of annotating labeled datasets for trainable human language technologies. Contrary to expectations, when creating labeled training...
Jason Baldridge, Miles Osborne