Exploiting relationships for object consolidation

14 years 8 months ago

Download www.itr-rescue.org

Researchers in the data mining area frequently have to spend signiﬁcant portion of their time on preprocessing the data in order to apply their algorithms to real-world datasets. Many real-world datasets are not perfect: they contain missing, erroneous, duplicate data and other problems. It is a well established fact that, in general, if such problems with data are not corrected, applying data mining algorithm can lead to wrong results (“garbage in, garbage out” principle). Therefore data cleaning techniques should be applied in-advance to the data to ensure high quality of the results. In this paper we address a data cleaning challenge called object consolidation. This challenge arises because often objects in datasets are represented via descriptions (a set of instantiated attributes) which alone might not always uniquely identify the object. The goal of object consolidation is to correctly consolidate (i.e., to group/determine) all the representations of the same object, for ...

Zhaoqi Chen, Dmitri V. Kalashnikov, Sharad Mehrotr

Real-time Traffic

Data Cleaning | Data Mining | Information System | IQIS 2005 | Object Consolidation |

claim paper

Post Info
More Details (n/a)

Added	26 Jun 2010
Updated	26 Jun 2010
Type	Conference
Year	2005
Where	IQIS
Authors	Zhaoqi Chen, Dmitri V. Kalashnikov, Sharad Mehrotra

Comments (0)

Sciweavers

Exploiting relationships for object consolidation

Data Cleaning | Data Mining | Information System | IQIS 2005 | Object Consolidation |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers