Improving Mining Quality by Exploiting Data Dependency

16 years 16 hour ago

Download www.cs.ucla.edu

The usefulness of the results produced by data mining methods can be critically impaired by several factors such as (1) low quality of data, including errors due to contamination, or incompleteness due to limited bandwidth for data acquisition, and (2) inadequacy of the data model for capturing complex probabilistic relationships in data. Fortunately, a wide spectrum of applications exhibit strong dependencies between data samples. For example, the readings of nearby sensors are generally correlated, and proteins interact with each other when performing crucial functions. Therefore, dependencies among data can be successfully exploited to remedy the problems mentioned above. In this paper, we propose a uniﬁed approach to improving mining quality using Markov networks as the data model to exploit local dependencies. Belief propagation is used to efﬁciently compute the marginal or maximum posterior probabilities, so as to clean the data, to infer missing values, or to improve the min...

Fang Chu, Yizhou Wang, Carlo Zaniolo, Douglas Stot

Real-time Traffic

Data Acquisition | Data Mining | Data Mining Methods | Data Model | PAKDD 2005 |

claim paper

» Improving Quality of Training Data for Learning to Rank Using ClickThrough Data

» Improving Scalability in a Scientific Discovery System by Exploiting Parallelism

» Exploiting Unlabeled Data for Improving Accuracy of Predictive Data Mining

» Get another label improving data quality and data mining using multiple noisy labelers

» Conditional Dependencies A Principled Approach to Improving Data Quality

» Exploiting Network Structure for Active Inference in Collective Classification

» Exploiting relationships for object consolidation

» UMiner A Data Mining System Handling Uncertainty and Quality

Post Info
More Details (n/a)

Added	28 Jun 2010
Updated	28 Jun 2010
Type	Conference
Year	2005
Where	PAKDD
Authors	Fang Chu, Yizhou Wang, Carlo Zaniolo, Douglas Stott Parker Jr.

Comments (0)

Sciweavers

Improving Mining Quality by Exploiting Data Dependency

Data Acquisition | Data Mining | Data Mining Methods | Data Model | PAKDD 2005 |

Explore & Download

Productivity Tools

Sciweavers