Sciweavers

245 search results - page 26 / 49
» A New Data Cube for Integrating Data Mining and OLAP
Sort
View
KDD
2008
ACM
176views Data Mining» more  KDD 2008»
14 years 9 months ago
Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface
Matching records that refer to the same entity across databases is becoming an increasingly important part of many data mining projects, as often data from multiple sources needs ...
Peter Christen
KDD
2004
ACM
142views Data Mining» more  KDD 2004»
14 years 9 months ago
Meta-classification of Multi-type Cancer Gene Expression Data
Massive publicly available gene expression data consisting of different experimental conditions and microarray platforms introduce new challenges in data mining when integrating m...
Benny Y. M. Fung, Vincent T. Y. Ng
AIR
2000
91views more  AIR 2000»
13 years 8 months ago
PlanMine: Predicting Plan Failures Using Sequence Mining
This paper presents the PLANMINE sequence mining algorithm to extract patterns of events that predict failures in databases of plan executions. New techniques were needed because p...
Mohammed Javeed Zaki, Neal Lesh, Mitsunori Ogihara
ICDE
2006
IEEE
144views Database» more  ICDE 2006»
14 years 10 months ago
Super-Scalar RAM-CPU Cache Compression
High-performance data-intensive query processing tasks like OLAP, data mining or scientific data analysis can be severely I/O bound, even when high-end RAID storage systems are us...
Marcin Zukowski, Niels Nes, Peter A. Boncz, S&aacu...
SIGSOFT
2010
ACM
13 years 6 months ago
Software is data too
Software systems are designed and engineered to process data. However, software is data too. The size and variety of today's software artifacts and the multitude of stakehold...
Andrian Marcus, Tim Menzies