Sciweavers

262 search results - page 19 / 53
» Three-Tier Clustering: An Online Citation Clustering System
Sort
View
ICDM
2009
IEEE
148views Data Mining» more  ICDM 2009»
14 years 2 months ago
Online System Problem Detection by Mining Patterns of Console Logs
Abstract—We describe a novel application of using data mining and statistical learning methods to automatically monitor and detect abnormal execution traces from console logs in ...
Wei Xu, Ling Huang, Armando Fox, David Patterson, ...
EUROSYS
2006
ACM
14 years 4 months ago
Database replication policies for dynamic content applications
The database tier of dynamic content servers at large Internet sites is typically hosted on centralized and expensive hardware. Recently, research prototypes have proposed using d...
Gokul Soundararajan, Cristiana Amza, Ashvin Goel
MM
2009
ACM
197views Multimedia» more  MM 2009»
14 years 2 months ago
Visual summaries of popular landmarks from community photo collections
We present a novel data-driven algorithm that leverages online image repositories such as Flickr for automatically generating tourist maps. Our hypothesis is that, given a large e...
Wei-Chao Chen, Agathe Battestini, Natasha Gelfand,...
SIGMOD
1998
ACM
271views Database» more  SIGMOD 1998»
13 years 7 months ago
Towards On-Line Analytical Mining in Large Databases
Great e orts have been paid in the Intelligent Database Systems Research Lab for the research and development of e cient data mining methods and construction of on-line analytical...
Jiawei Han
KDD
2008
ACM
156views Data Mining» more  KDD 2008»
14 years 8 months ago
Unsupervised deduplication using cross-field dependencies
Recent work in deduplication has shown that collective deduplication of different attribute types can improve performance. But although these techniques cluster the attributes col...
Robert Hall, Charles A. Sutton, Andrew McCallum