Sciweavers

775 search results - page 77 / 155
» Email data cleaning
Sort
View
ICDM
2008
IEEE
97views Data Mining» more  ICDM 2008»
14 years 4 months ago
Semi-supervised Learning from General Unlabeled Data
We consider the problem of Semi-supervised Learning (SSL) from general unlabeled data, which may contain irrelevant samples. Within the binary setting, our model manages to better...
Kaizhu Huang, Zenglin Xu, Irwin King, Michael R. L...
KDD
1997
ACM
128views Data Mining» more  KDD 1997»
14 years 2 months ago
An Interactive Visualization Environment for Data Exploration
Exploratory data analysis is a process of sifting through data in search of interesting information or patterns. Analysts’ current tools for exploring data include database mana...
Mark Derthick, John Kolojejchick, Steven F. Roth
SOFTWARE
2002
13 years 9 months ago
Temporal Probabilistic Concepts from Heterogeneous Data Sequences
We consider the problem of characterisation of sequences of heterogeneous symbolic data that arise from a common underlying temporal pattern. The data, which are subject to impreci...
Sally I. McClean, Bryan W. Scotney, Fiona Palmer
ICDE
2012
IEEE
277views Database» more  ICDE 2012»
12 years 13 days ago
Aggregate Query Answering on Possibilistic Data with Cardinality Constraints
— Uncertainties in data arise for a number of reasons: when the data set is incomplete, contains conflicting information or has been deliberately perturbed or coarsened to remov...
Graham Cormode, Divesh Srivastava, Entong Shen, Ti...
DATAMINE
2006
130views more  DATAMINE 2006»
13 years 10 months ago
Mining Adaptive Ratio Rules from Distributed Data Sources
Different from traditional association-rule mining, a new paradigm called Ratio Rule (RR) was proposed recently. Ratio rules are aimed at capturing the quantitative association kno...
Jun Yan, Ning Liu, Qiang Yang, Benyu Zhang, QianSh...