Search Sciweavers | Sciweavers

182

COLING
2010

108views Computational Linguistics» more COLING 2010»

Large Scale Parallel Document Mining for Machine Translation

15 years 2 months ago

A distributed system is described that reliably mines parallel text from large corpora. The approach can be regarded as cross-language near-duplicate detection, enabled by an init...

Jakob Uszkoreit, Jay Ponte, Ashok C. Popat, Moshe ...

claim paper

Read More »

159

click to vote

COLING
2010

117views Computational Linguistics» more COLING 2010»

Corpus-based Semantic Class Mining: Distributional vs. Pattern-Based Approaches

15 years 2 months ago

Download research.microsoft.com

Main approaches to corpus-based semantic class mining include distributional similarity (DS) and pattern-based (PB). In this paper, we perform an empirical comparison of them, bas...

Shuming Shi, Huibin Zhang, Xiaojie Yuan, Ji-Rong W...

claim paper

Read More »

191

click to vote

ACL
2007

123views Computational Linguistics» more ACL 2007»

PageRanking WordNet Synsets: An Application to Opinion Mining

15 years 9 months ago

Download acl.ldc.upenn.edu

This paper presents an application of PageRank, a random-walk model originally devised for ranking Web search results, to ranking WordNet synsets in terms of how strongly they pos...

Andrea Esuli, Fabrizio Sebastiani

claim paper

Read More »

222

click to vote

IJPRAI
2002

142views more IJPRAI 2002»

Improving Encarta Search Engine Performance by Mining User Logs

15 years 7 months ago

Download research.microsoft.com

We propose a data-mining approach that produces generalized query patterns (with generalized keywords) from the raw user logs of the Microsoft Encarta search engine (http://encart...

Charles X. Ling, Jianfeng Gao, Huajie Zhang, Weini...

claim paper

Read More »

202

click to vote

EMNLP
2009

154views Natural Language Processing» more EMNLP 2009»

Mining Search Engine Clickthrough Log for Matching N-gram Features

15 years 5 months ago

Download www.aclweb.org

User clicks on a URL in response to a query are extremely useful predictors of the URL's relevance to that query. Exact match click features tend to suffer from severe data s...

Huihsin Tseng, Longbin Chen, Fan Li, Ziming Zhuang...

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers