Sciweavers

EMNLP
2009
13 years 7 months ago
Discovery of Term Variation in Japanese Web Search Queries
In this paper we address the problem of identifying a broad range of term variations in Japanese web search queries, where these variations pose a particularly thorny problem due ...
Hisami Suzuki, Xiao Li, Jianfeng Gao
EMNLP
2009
13 years 7 months ago
Large-Scale Verb Entailment Acquisition from the Web
Textual entailment recognition plays a fundamental role in tasks that require indepth natural language understanding. In order to use entailment recognition technologies for real-...
Chikara Hashimoto, Kentaro Torisawa, Kow Kuroda, S...
EMNLP
2009
13 years 7 months ago
Stream-based Randomised Language Models for SMT
Randomised techniques allow very big language models to be represented succinctly. However, being batch-based they are unsuitable for modelling an unbounded stream of language whi...
Abby Levenberg, Miles Osborne
EMNLP
2009
13 years 7 months ago
Person Cross Document Coreference with Name Perplexity Estimates
The Person Cross Document Coreference systems depend on the context for making decisions on the possible coreferences between person name mentions. The amount of context required ...
Octavian Popescu
EMNLP
2009
13 years 7 months ago
Multilingual Spectral Clustering Using Document Similarity Propagation
We present a novel approach for multilingual document clustering using only comparable corpora to achieve cross-lingual semantic interoperability. The method models document colle...
Dani Yogatama, Kumiko Tanaka-Ishii
EMNLP
2009
13 years 7 months ago
Acquiring Translation Equivalences of Multiword Expressions by Normalized Correlation Frequencies
In this paper, we present an algorithm for extracting translations of any given multiword expression from parallel corpora. Given a multiword expression to be translated, the meth...
Ming-Hong Bai, Jia-Ming You, Keh-Jiann Chen, Jason...
EMNLP
2009
13 years 7 months ago
Efficient kernels for sentence pair classification
In this paper, we propose a novel class of graphs, the tripartite directed acyclic graphs (tDAGs), to model first-order rule feature spaces for sentence pair classification. We in...
Fabio Massimo Zanzotto, Lorenzo Dell'Arciprete
EMNLP
2009
13 years 7 months ago
Finding Short Definitions of Terms on Web Pages
We present a system that finds short definitions of terms on Web pages. It employs a Maximum Entropy classifier, but it is trained on automatically generated examples; hence, it i...
Gerasimos Lampouras, Ion Androutsopoulos
EMNLP
2009
13 years 7 months ago
Learning Term-weighting Functions for Similarity Measures
Measuring the similarity between two texts is a fundamental problem in many NLP and IR applications. Among the existing approaches, the cosine measure of the term vectors represen...
Wen-tau Yih
EMNLP
2009
13 years 7 months ago
Wikipedia as Frame Information Repository
In this paper, we address the issue of automatic extending lexical resources by exploiting existing knowledge repositories. In particular, we deal with the new task of linking Fra...
Sara Tonelli, Claudio Giuliano