Sciweavers

333 search results - page 19 / 67
» www 2008
Sort
View
WWW
2008
ACM
14 years 9 months ago
Detecting image spam using visual features and near duplicate detection
Email spam is a much studied topic, but even though current email spam detecting software has been gaining a competitive edge against text based email spam, new advances in spam g...
Bhaskar Mehta, Saurabh Nangia, Manish Gupta 0002, ...
WWW
2008
ACM
14 years 9 months ago
Exploring social annotations for information retrieval
Social annotation has gained increasing popularity in many Web-based applications, leading to an emerging research area in text analysis and information retrieval. This paper is c...
Ding Zhou, Jiang Bian, Shuyi Zheng, Hongyuan Zha, ...
WWW
2008
ACM
14 years 9 months ago
What do they think?: aggregating local views about news events and topics
The web has become an important medium for news delivery and consumption. Fresh content about a variety of topics and events is constantly being created and published on the web b...
Jiahui Liu, Larry Birnbaum
WWW
2008
ACM
14 years 9 months ago
Recrawl scheduling based on information longevity
It is crucial for a web crawler to distinguish between ephemeral and persistent content. Ephemeral content (e.g., quote of the day) is usually not worth crawling, because by the t...
Christopher Olston, Sandeep Pandey
WWW
2008
ACM
14 years 9 months ago
Cm-pmi: improved web-based association measure with contextual label matching
WebPMI is a popular web-based association measure to evaluate the semantic similarity between two queries (i.e. words or entities) by leveraging search results returned by search ...
Xiaojun Wan