Sciweavers

Free Online Productivity Tools i2Speak i2Symbol i2OCR iTex2Img iWeb2Print iWeb2Shot i2Type iPdf2Split iPdf2Merge i2Bopomofo i2Arabic i2Style i2Image i2PDF iLatex2Rtf Sci2ools

171

CIKM
2005
Springer

130views Information Technology» more CIKM 2005»

Web-centric language models

16 years 4 days ago

Web-centric language models

Download staff.science.uva.nl

We investigates language models for informational and navigational web search. Retrieval on the web is a task that diﬀers substantially from ordinary ad hoc retrieval. We perform an analysis of prior probability of relevance for a wide range of non-content features, shedding further light on the importance of non-content features for web retrieval. Language models can naturally incorporate multiple document representations, as well as non-content information. For the former, we employ mixture language models based on document full-text, incoming anchor-text, and document titles. For the latter, we study a range of priors based on document length, URL structure, and link topology. We look at three types of topics—distillation, home page, and named page— as well as for a mixed query set. We ﬁnd that the mixture models lead to considerable improvement of retrieval eﬀectiveness for all topic types. The web-centric priors generally lead to further improvement of retrieval eﬀect...

Jaap Kamps

Real-time Traffic

Ad Hoc Retrieval | CIKM 2005 | Language Models | Retrieval Eﬀectiveness |

claim paper

Related Content

» Comparison of Different Modeling Units for Language Model Adaptation for Inflected Languag...

» Modeling the Internet

» Joint MorphologicalLexical Language Modeling for Processing Morphologically Rich Languages...

» Neural network based language models for highly inflective languages

» Language and Task Independent Text Categorization with Simple Language Models

» Enhanced Suffix Arrays as Language Models Virtual kTestable Languages

» On Using Written Language Training Data for Spoken Language Modeling

» UB at CLEF2004 Cross Language Information Retrieval Using Statistical Language Models

» Deciphering Foreign Language by Combining Language Models and Context Vectors

» MultiClass Composite Ngram Language Model for Spoken Language Processing Using Multiple Wo...

Post Info
More Details (n/a)

Added	26 Jun 2010
Updated	26 Jun 2010
Type	Conference
Year	2005
Where	CIKM
Authors	Jaap Kamps

Comments (0)