SIGIR 2006 | Sciweavers

152

SIGIR
2006
ACM

110views Information Technology» more SIGIR 2006»

Building a test collection for complex document information processing

16 years 1 months ago

Research and development of information access technology for scanned paper documents has been hampered by the lack of public test collections of realistic scope and complexity. A...

David D. Lewis, Gady Agam, Shlomo Argamon, Ophir F...

claim paper

Read More »

201

click to vote

SIGIR
2006
ACM

107views Information Technology» more SIGIR 2006»

Is XML retrieval meaningful to users?: searcher preferences for full documents vs. elements

16 years 1 months ago

Download www.eecs.qmul.ac.uk

The aim of this study is to investigate whether element retrieval (as opposed to full-text retrieval) is meaningful and useful for searchers when carrying out information-seeking ...

Birger Larsen, Anastasios Tombros, Saadia Malik

claim paper

Read More »

199

Voted

SIGIR
2006
ACM

70views Information Technology» more SIGIR 2006»

User modeling for full-text federated search in peer-to-peer networks

16 years 1 months ago

Download www.cs.cmu.edu

User modeling for information retrieval has mostly been studied to improve the effectiveness of information access in centralized repositories. In this paper we explore user model...

Jie Lu, James P. Callan

claim paper

Read More »

202

Voted

SIGIR
2006
ACM

106views Information Technology» more SIGIR 2006»

Learning to advertise

16 years 1 months ago

Download homepages.dcc.ufmg.br

Content-targeted advertising, the task of automatically associating ads to a Web page, constitutes a key Web monetization strategy nowadays. Further, it introduces new challenging...

Anísio Lacerda, Marco Cristo, Marcos Andr&e...

claim paper

Read More »

188

Voted

SIGIR
2006
ACM

104views Information Technology» more SIGIR 2006»

Respect my authority!: HITS without hyperlinks, utilizing cluster-based language models

16 years 1 months ago

Download www.cs.cornell.edu

We present an approach to improving the precision of an initial document ranking wherein we utilize cluster information within a graph-based framework. The main idea is to perform...

Oren Kurland, Lillian Lee

claim paper

Read More »

197

click to vote

SIGIR
2006
ACM

81views Information Technology» more SIGIR 2006»

Text clustering with extended user feedback

16 years 1 months ago

Download www.cs.cmu.edu

Text clustering is most commonly treated as a fully automated task without user feedback. However, a variety of researchers have explored mixed-initiative clustering methods which...

Yifen Huang, Tom M. Mitchell

claim paper

Read More »

156

click to vote

SIGIR
2006
ACM

91views Information Technology» more SIGIR 2006»

Information retrieval with commonsense knowledge

16 years 1 months ago

Download nlg.csie.ntu.edu.tw

This paper employs ConceptNet, which covers a rich set of commonsense concepts, to retrieve images with text descriptions by focusing on spatial relationships. Evaluation on test ...

Ming-Hung Hsu, Hsin-Hsi Chen

claim paper

Read More »

211

Voted

SIGIR
2006
ACM

169views Information Technology» more SIGIR 2006»

Identifying comparative sentences in text documents

16 years 1 months ago

Download www.cs.uic.edu

This paper studies the problem of identifying comparative sentences in text documents. The problem is related to but quite different from sentiment/opinion sentence identification...

Nitin Jindal, Bing Liu

claim paper

Read More »

196

click to vote

SIGIR
2006
ACM

91views Information Technology» more SIGIR 2006»

A framework to predict the quality of answers with non-textual features

16 years 1 months ago

Download ciir.cs.umass.edu

New types of document collections are being developed by various web services. The service providers keep track of non-textual features such as click counts. In this paper, we pre...

Jiwoon Jeon, W. Bruce Croft, Joon Ho Lee, Soyeon P...

claim paper

Read More »

221

click to vote

SIGIR
2006
ACM

209views Information Technology» more SIGIR 2006»

Finding near-duplicate web pages: a large-scale evaluation of algorithms

16 years 1 months ago

Download ltaa.epfl.ch

Broder et al.’s [3] shingling algorithm and Charikar’s [4] random projection based approach are considered “state-of-theart” algorithms for ﬁnding near-duplicate web pag...

Monika Rauch Henzinger

claim paper

Read More »

Sciweavers

Explore & Download

Productivity Tools

Sciweavers