Sciweavers

2423 search results - page 33 / 485
» Outlier detection in performance data of parallel applicatio...
Sort
View
SIGMOD
2010
ACM
211views Database» more  SIGMOD 2010»
14 years 11 days ago
ERACER: a database approach for statistical inference and data cleaning
Real-world databases often contain syntactic and semantic errors, in spite of integrity constraints and other safety measures incorporated into modern DBMSs. We present ERACER, an...
Chris Mayfield, Jennifer Neville, Sunil Prabhakar
AMW
2010
13 years 9 months ago
Robust Clustering of Data Streams using Incremental Optimization
Discovering the patterns in evolving data streams is a very important and challenging task. In many applications, it is useful to detect the dierent patterns evolving over time and...
Basheer Hawwash, Olfa Nasraoui
KAIS
2007
112views more  KAIS 2007»
13 years 7 months ago
The pairwise attribute noise detection algorithm
Analyzing the quality of data prior to constructing data mining models is emerging as an important issue. Algorithms for identifying noise in a given data set can provide a good me...
Jason Van Hulse, Taghi M. Khoshgoftaar, Haiying Hu...
CSB
2004
IEEE
136views Bioinformatics» more  CSB 2004»
13 years 11 months ago
Minimum Entropy Clustering and Applications to Gene Expression Analysis
Clustering is a common methodology for analyzing the gene expression data. In this paper, we present a new clustering algorithm from an information-theoretic point of view. First,...
Haifeng Li, Keshu Zhang, Tao Jiang
CGO
2007
IEEE
14 years 1 months ago
SuperPin: Parallelizing Dynamic Instrumentation for Real-Time Performance
Dynamic instrumentation systems have proven to be extremely valuable for program introspection, architectural simulation, and bug detection. Yet a major drawback of modern instrum...
Steven Wallace, Kim M. Hazelwood