Sciweavers

COLING
2010
13 years 2 months ago
Utilizing Variability of Time and Term Content, within and across Users in Session Detection
In this paper, we describe a SVM classification framework of session detection task on both Chinese and English query logs. With eight features on the aspects of temporal and cont...
Shu-Qi Sun, Sheng Li, Muyun Yang, Haoliang Qi, Tie...
COLING
2010
13 years 2 months ago
Expressing OWL axioms by English sentences: dubious in theory, feasible in practice
With OWL (Web Ontology Language) established as a standard for encoding ontologies on the Semantic Web, interest has begun to focus on the task of verbalising OWL code in controll...
Richard Power, Allan Third
COLING
2010
13 years 2 months ago
Acquisition of Unknown Word Paradigms for Large-Scale Grammars
Unknown words are a major issue for large-scale grammars of natural language. We propose a machine learning based algorithm for acquiring lexical entries for all forms in the para...
Kostadin Cholakov, Gertjan van Noord
COLING
2010
13 years 2 months ago
Varro: An Algorithm and Toolkit for Regular Structure Discovery in Treebanks
The Varro toolkit is a system for identifying and counting a major class of regularity in treebanks and annotated natural language data in the form of treestructures: frequently r...
Scott Martens
COLING
2010
13 years 2 months ago
Global topology of word co-occurrence networks: Beyond the two-regime power-law
Word co-occurrence networks are one of the most common linguistic networks studied in the past and they are known to exhibit several interesting topological characteristics. In th...
Monojit Choudhury, Diptesh Chatterjee, Animesh Muk...
COLING
2010
13 years 2 months ago
Opinion Summarization with Integer Linear Programming Formulation for Sentence Extraction and Ordering
In this paper we propose a novel algorithm for opinion summarization that takes account of content and coherence, simultaneously. We consider a summary as a sequence of sentences ...
Hitoshi Nishikawa, Takaaki Hasegawa, Yoshihiro Mat...
COLING
2010
13 years 2 months ago
Bilingual lexicon extraction from comparable corpora using in-domain terms
Many existing methods for bilingual lexicon learning from comparable corpora are based on similarity of context vectors. These methods suffer from noisy vectors that greatly affec...
Azniah Ismail, Suresh Manandhar
COLING
2010
13 years 2 months ago
Streaming Cross Document Entity Coreference Resolution
Previous research in cross-document entity coreference has generally been restricted to the offline scenario where the set of documents is provided in advance. As a consequence, t...
Delip Rao, Paul McNamee, Mark Dredze
COLING
2010
13 years 2 months ago
Automatic Persian WordNet Construction
In this paper, an automatic method for Persian WordNet construction based on Prenceton WordNet 2.1 (PWN) is introduced. The proposed approach uses Persian and English corpora as w...
Mortaza Montazery, Feshaam Faili
COLING
2010
13 years 2 months ago
Fast-Champollion: A Fast and Robust Sentence Alignment Algorithm
Sentence-level aligned parallel texts are important resources for a number of natural language processing (NLP) tasks and applications such as statistical machine translation and ...
Peng Li, Maosong Sun, Ping Xue