Information Technology

190

Voted

SIGIR
2008
ACM

84views Information Technology» more SIGIR 2008»

Semi-supervised spam filtering: does it work?

15 years 7 months ago

The results of the 2006 ECML/PKDD Discovery Challenge suggest that semi-supervised learning methods work well for spam filtering when the source of available labeled examples diff...

Mona Mojdeh, Gordon V. Cormack

claim paper

Read More »

178

click to vote

SIGIR
2008
ACM

155views Information Technology» more SIGIR 2008»

Selecting good expansion terms for pseudo-relevance feedback

15 years 7 months ago

Download www.iro.umontreal.ca

Pseudo-relevance feedback assumes that most frequent terms in the pseudo-feedback documents are useful for the retrieval. In this study, we re-examine this assumption and show tha...

Guihong Cao, Jian-Yun Nie, Jianfeng Gao, Stephen R...

claim paper

Read More »

178

click to vote

SIGIR
2008
ACM

89views Information Technology» more SIGIR 2008»

XML-aided phrase indexing for hypertext documents

15 years 7 months ago

Download www.cs.helsinki.fi

We combine techniques of XML Mining and Text Mining for the benefit of Information Retrieval. By manipulating the word sequence according to the XML structure of the marked-up tex...

Miro Lehtonen, Antoine Doucet

claim paper

Read More »

210

Voted

SIGIR
2008
ACM

93views Information Technology» more SIGIR 2008»

Relevance assessment: are judges exchangeable and does it matter

15 years 7 months ago

Download es.csiro.au

We investigate to what extent people making relevance judgements for a reusable IR test collection are exchangeable. We consider three classes of judge: "gold standard" ...

Peter Bailey, Nick Craswell, Ian Soboroff, Paul Th...

claim paper

Read More »

194

click to vote

SIGIR
2008
ACM

86views Information Technology» more SIGIR 2008»

Query-drift prevention for robust query expansion

15 years 7 months ago

Download www.technion.ac.il

Pseudo-feedback-based automatic query expansion yields effective retrieval performance on average, but results in performance inferior to that of using the original query for many...

Liron Zighelnic, Oren Kurland

claim paper

Read More »

192

click to vote

SIGIR
2008
ACM

95views Information Technology» more SIGIR 2008»

Score standardization for inter-collection comparison of retrieval systems

15 years 7 months ago

Download ww2.cs.mu.oz.au

The goal of system evaluation in information retrieval has always been to determine which of a set of systems is superior on a given collection. The tool used to determine system ...

William Webber, Alistair Moffat, Justin Zobel

claim paper

Read More »

205

click to vote

SIGIR
2008
ACM

192views Information Technology» more SIGIR 2008»

A user browsing model to predict search engine click data from past observations

15 years 7 months ago

Download www.bpiwowar.net

Search engine click logs provide an invaluable source of relevance information but this information is biased because we ignore which documents from the result list the users have...

Georges Dupret, Benjamin Piwowarski

claim paper

Read More »

158

click to vote

SIGIR
2008
ACM

92views Information Technology» more SIGIR 2008»

Detecting synonyms in social tagging systems to improve content retrieval

15 years 7 months ago

Download www.cs.vu.nl

Collaborative tagging used in online social content systems is naturally characterized by many synonyms, causing low precision retrieval. We propose a mechanism based on user pref...

Maarten Clements, Arjen P. de Vries, Marcel J. T. ...

claim paper

Read More »

191

click to vote

SIGIR
2008
ACM

103views Information Technology» more SIGIR 2008»

A study of query length

15 years 7 months ago

Download staff.science.uva.nl

We analyse query length, and fit power-law and Poisson distributions to four different query sets. We provide a practical model for query length, based on the truncation of a Pois...

Avi Arampatzis, Jaap Kamps

claim paper

Read More »

122

click to vote

SIGIR
2008
ACM

87views Information Technology» more SIGIR 2008»

Real-time automatic tag recommendation

15 years 7 months ago