Sciweavers

2469 search results - page 448 / 494
» Self-Protection of Web Content
Sort
View
KDD
2010
ACM
244views Data Mining» more  KDD 2010»
14 years 28 days ago
Connecting the dots between news articles
The process of extracting useful knowledge from large datasets has become one of the most pressing problems in today’s society. The problem spans entire sectors, from scientists...
Dafna Shahaf, Carlos Guestrin
SIGIR
2010
ACM
14 years 27 days ago
Query log analysis in the context of information retrieval for children
In this paper we analyze queries and sessions intended to satisfy children’s information needs using a large-scale query log. The aim of this analysis is twofold: i) To identify...
Sergio Duarte Torres, Djoerd Hiemstra, Pavel Serdy...
SIGIR
2010
ACM
14 years 27 days ago
Adaptive near-duplicate detection via similarity learning
In this paper, we present a novel near-duplicate document detection method that can easily be tuned for a particular domain. Our method represents each document as a real-valued s...
Hannaneh Hajishirzi, Wen-tau Yih, Aleksander Kolcz
ACMDIS
2006
ACM
14 years 23 days ago
Randomness as a resource for design
Randomness is being harnessed in the design of some interactive systems. This is observed in random blogs, random web searching, and in particular Apple's iPod Shuffle. Yet t...
Tuck Wah Leong, Frank Vetere, Steve Howard
APSEC
2004
IEEE
14 years 23 days ago
MUDABlue: An Automatic Categorization System for Open Source Repositories
Open Source communities typically use a software repository to archive various software projects with their source code, mailing list discussions, documentation, bug reports, and ...
Shinji Kawaguchi, Pankaj K. Garg, Makoto Matsushit...