Sciweavers

69 search results - page 12 / 14
» Statistical Ranking in Tactical Generation
Sort
View
WWW
2006
ACM
14 years 4 months ago
Do not crawl in the DUST: different URLs with similar text
We consider the problem of dust: Different URLs with Similar Text. Such duplicate URLs are prevalent in web sites, as web server software often uses aliases and redirections, and...
Uri Schonfeld, Ziv Bar-Yossef, Idit Keidar
LREC
2008
124views Education» more  LREC 2008»
14 years 9 days ago
Acquiring a Poor Man's Inflectional Lexicon for German
Many NLP modules and applications require the availability of a module for wide-coverage inflectional analysis. One way to obtain such analyses is to use an morphological analyser...
Peter Adolphs
SIGIR
2002
ACM
13 years 10 months ago
Term-specific smoothing for the language modeling approach to information retrieval: the importance of a query term
This paper follows a formal approach to information retrieval based on statistical language models. By introducing some simple reformulations of the basic language modeling approa...
Djoerd Hiemstra
EMNLP
2009
13 years 8 months ago
Collocation Extraction Using Monolingual Word Alignment Method
Statistical bilingual word alignment has been well studied in the context of machine translation. This paper adapts the bilingual word alignment algorithm to monolingual scenario ...
Zhan-yi Liu, Haifeng Wang, Hua Wu, Sheng Li
SIGMOD
2008
ACM
164views Database» more  SIGMOD 2008»
14 years 11 months ago
Finding frequent items in probabilistic data
Computing statistical information on probabilistic data has attracted a lot of attention recently, as the data generated from a wide range of data sources are inherently fuzzy or ...
Qin Zhang, Feifei Li, Ke Yi