Sciweavers

829 search results - page 98 / 166
» Minimal document set retrieval
Sort
View
AIRWEB
2005
Springer
14 years 1 months ago
Blocking Blog Spam with Language Model Disagreement
We present an approach for detecting link spam common in blog comments by comparing the language models used in the blog post, the comment, and pages linked by the comments. In co...
Gilad Mishne, David Carmel, Ronny Lempel
DAS
2010
Springer
13 years 11 months ago
Towards more effective distance functions for word image matching
Matching word images has many applications in document recognition and retrieval systems. Dynamic Time Warping (DTW) is popularly used to estimate the similarity between word imag...
Raman Jain, C. V. Jawahar
GFKL
2006
Springer
78views Data Mining» more  GFKL 2006»
13 years 11 months ago
Putting Successor Variety Stemming to Work
Stemming algorithms find canonical forms for inflected words, e. g. for declined nouns or conjugated verbs. Since such a unification of words with respect to gender, number, time, ...
Benno Stein, Martin Potthast
LREC
2008
98views Education» more  LREC 2008»
13 years 9 months ago
Producing a Test Collection for Patent Machine Translation in the Seventh NTCIR Workshop
In aiming at research and development on machine translation, we produced a test collection for Japanese-English machine translation in the seventh NTCIR Workshop. This paper desc...
Atsushi Fujii, Masao Utiyama, Mikio Yamamoto, Take...
CLEF
2010
Springer
13 years 7 months ago
Creating a Persian-English Comparable Corpus
Multilingual corpora are valuable resources for cross-language information retrieval and are available in many language pairs. However the Persian language does not have rich multi...
Homa Baradaran Hashemi, Azadeh Shakery, Heshaam Fe...