Sciweavers

WWW
2008
ACM
15 years 6 days ago
Generating diverse and representative image search results for landmarks
Can we leverage the community-contributed collections of rich media on the web to automatically generate representative and diverse views of the world's landmarks? We use a c...
Lyndon S. Kennedy, Mor Naaman
WWW
2008
ACM
15 years 6 days ago
A larger scale study of robots.txt
A website can regulate search engine crawler access to its content using the robots exclusion protocol, specified in its robots.txt file. The rules in the protocol enable the site...
Santanu Kolay
WWW
2008
ACM
15 years 6 days ago
Gsp-exr: gsp protocol with an exclusive right for keyword auctions
We propose a keyword auction protocol called the Generalized Second Price with an Exclusive Right (GSP-ExR). In existing keyword auctions, the number of displayed advertisements i...
Yuko Sakurai, Atsushi Iwasaki, Yasumasa Saito, Mak...
WWW
2008
ACM
15 years 6 days ago
Fourth international workshop on adversarial information retrieval on the web (AIRWeb 2008)
Adversarial IR in general, and search engine spam, in particular, are engaging research topics with a real-world impact for Web users, advertisers and publishers. The AIRWeb works...
Carlos Castillo, Kumar Chellapilla, Dennis Fetterl...
WWW
2008
ACM
15 years 6 days ago
Which "Apple" are you talking about ?
In a higher level task such as clustering of web results or word sense disambiguation, knowledge of all possible distinct concepts in which an ambiguous word can be expressed woul...
Mandar Rahurkar, Dan Roth, Thomas S. Huang
WWW
2008
ACM
15 years 6 days ago
As we may perceive: finding the boundaries of compound documents on the web
This paper considers the problem of identifying on the Web compound documents (cDocs) ? groups of web pages that in aggregate constitute semantically coherent information entities...
Pavel Dmitriev
WWW
2008
ACM
15 years 6 days ago
Online learning from click data for sponsored search
Sponsored search is one of the enabling technologies for today's Web search engines. It corresponds to matching and showing ads related to the user query on the search engine...
Massimiliano Ciaramita, Vanessa Murdock, Vassilis ...
WWW
2008
ACM
15 years 6 days ago
An initial investigation on evaluating semantic web instance data
Many emerging semantic web applications include ontologies from one set of authors and instance data from another (often much larger) set of authors. Often ontologies are reused a...
Li Ding, Jiao Tao, Deborah L. McGuinness
WWW
2008
ACM
15 years 6 days ago
Personalized web exploration with task models
Personalized Web search has emerged as one of the hottest topics for both the Web industry and academic researchers. However, the majority of studies on personalized search focuse...
Jae-wook Ahn, Peter Brusilovsky, Daqing He, Jonath...
WWW
2008
ACM
15 years 6 days ago
Extracting spam blogs with co-citation clusters
This paper reports the estimated number of spam blogs in order to assess their current state in the blogosphere. To extract spam blogs, I developed a traversal method among co-cit...
Kazunari Ishida