Reconciling Attribute Values from Multiple Data Sources

15 years 8 months ago

Download www.bus.iastate.edu

Because of the heterogeneous nature of multiple data sources, data integration is often one of the most challenging tasks of today's information systems. While the existing literature has focused on problems such as schema integration and entity identification, our current study attempts to answer a basic question: When an attribute value for a real-world entity is recorded differently in two databases, how should the "best" value be chosen from the set of possible values? We first show how probabilities for attribute values can be derived, and then propose a framework for deciding the cost-minimizing value based on the total cost of type I, type II, and misrepresentation errors.

Zhengrui Jiang, Sumit Sarkar, Prabuddha De, Debabr

Real-time Traffic

Attribute Values | ICIS 2004 | Information Technology | Multiple Data Sources | Today's Information Systems |

claim paper

» Reconciling Inconsistent Data in Probabilistic XML Data Integration

» Creating Relational Data from Unstructured and Ungrammatical Data Sources

» Global Detection of Complex Copying Relationships Between Sources

» Phoebus A System for Extracting and Integrating Data from Unstructured and Ungrammatical S...

» Selectivity Estimation Without the Attribute Value Independence Assumption

» Hierarchical Bitmap Index An Efficient and Scalable Indexing Technique for SetValued Attri...

» Concurrent Viewing of Multiple AttributeSpecific Subspaces

» Extraction Techniques for Mining Services from Web Sources

Post Info
More Details (n/a)

Added	31 Oct 2010
Updated	31 Oct 2010
Type	Conference
Year	2004
Where	ICIS
Authors	Zhengrui Jiang, Sumit Sarkar, Prabuddha De, Debabrata Dey

Comments (0)

Sciweavers

Reconciling Attribute Values from Multiple Data Sources

Attribute Values | ICIS 2004 | Information Technology | Multiple Data Sources | Today's Information Systems |

Explore & Download

Productivity Tools

Sciweavers