Sciweavers

COLING
2008
13 years 9 months ago
Source Language Markers in EUROPARL Translations
This paper shows that it is very often possible to identify the source language of medium-length speeches in the EUROPARL corpus on the basis of frequency counts of word n-grams (...
Hans van Halteren
COLING
2008
13 years 9 months ago
A Classification of Dialogue Actions in Tutorial Dialogue
In this paper we present a taxonomy of dialogue moves which describe the actions that students and tutors perform in tutorial dialogue. We are motivated by the need for a categori...
Mark Buckley, Magdalena Wolska
COLING
2008
13 years 9 months ago
Statistical Anaphora Resolution in Biomedical Texts
This paper presents a probabilistic model for resolution of non-pronominal anaphora in biomedical texts. The model seeks to find the antecedents of anaphoric expressions, both cor...
Caroline Gasperin, Ted Briscoe
COLING
2008
13 years 9 months ago
Toward a Psycholinguistically-Motivated Model of Language Processing
Psycholinguistic studies suggest a model of human language processing that 1) performs incremental interpretation of spoken utterances or written text, 2) preserves ambiguity by m...
William Schuler, Samir AbdelRahman, Tim Miller, La...
COLING
2008
13 years 9 months ago
Chinese Dependency Parsing with Large Scale Automatically Constructed Case Structures
This paper proposes an approach using large scale case structures, which are automatically constructed from both a small tagged corpus and a large raw corpus, to improve Chinese d...
Kun Yu, Daisuke Kawahara, Sadao Kurohashi
COLING
2008
13 years 9 months ago
Recent Advances in a Feature-Rich Framework for Treebank Annotation
This paper presents recent advances in an established treebank annotation framework comprising of an abstract XMLbased data format, fully customizable editor of tree-based annotat...
Petr Pajas, Jan Stepánek
COLING
2008
13 years 9 months ago
Training Conditional Random Fields Using Incomplete Annotations
We address corpus building situations, where complete annotations to the whole corpus is time consuming and unrealistic. Thus, annotation is done only on crucial part of sentences...
Yuta Tsuboi, Hisashi Kashima, Shinsuke Mori, Hirok...
COLING
2008
13 years 9 months ago
Exploiting Graph Structure for Accelerating the Calculation of Shortest Paths in Wordnets
This paper presents an approach for substantially reducing the time needed to calculate the shortest paths between all concepts in a wordnet. The algorithm exploits the unique &qu...
Holger Wunsch
COLING
2008
13 years 9 months ago
CollabRank: Towards a Collaborative Approach to Single-Document Keyphrase Extraction
Previous methods usually conduct the keyphrase extraction task for single documents separately without interactions for each document, under the assumption that the documents are ...
Xiaojun Wan, Jianguo Xiao
COLING
2008
13 years 9 months ago
Detecting Multiple Facets of an Event using Graph-Based Unsupervised Methods
We propose a new unsupervised method for topic detection that automatically identifies the different facets of an event. We use pointwise Kullback-Leibler divergence along with th...
Pradeep Muthukrishnan, Joshua Gerrish, Dragomir R....