Sciweavers

26 search results - page 2 / 6
» Dealing with Predictive-but-Unpredictable Attributes in Nois...
Sort
View
SIGMOD
2010
ACM
174views Database» more  SIGMOD 2010»
13 years 11 months ago
Sampling dirty data for matching attributes
We investigate the problem of creating and analyzing samples of relational databases to find relationships between string-valued attributes. Our focus is on identifying attribute...
Henning Köhler, Xiaofang Zhou, Shazia Wasim S...
SEBD
2007
114views Database» more  SEBD 2007»
13 years 8 months ago
A New Type of Metadata for Querying Data Integration Systems
Research on data integration has provided languages and systems able to guarantee an integrated intensional representation of a given set of data sources. A significant limitation...
Sonia Bergamaschi, Francesco Guerra, Mirko Orsini,...
ER
2004
Springer
246views Database» more  ER 2004»
14 years 8 days ago
Data Mapping Diagrams for Data Warehouse Design with UML
Abstract. In Data Warehouse (DW) scenarios, ETL (Extraction, Transformation, Loading) processes are responsible for the extraction of data from heterogeneous operational data sourc...
Sergio Luján-Mora, Panos Vassiliadis, Juan ...
CIS
2007
Springer
14 years 1 months ago
Mining with Noise Knowledge: Error Aware Data Mining
—Real-world data mining deals with noisy information sources where data collection inaccuracy, device limitations, data transmission and discretization errors, or man-made pertur...
Xindong Wu
IJDE
2007
123views more  IJDE 2007»
13 years 6 months ago
Identifying Authorship by Byte-Level N-Grams: The Source Code Author Profile (SCAP) Method
Source code author identification deals with identifying the most likely author of a computer program, given a set of predefined author candidates. There are several scenarios whe...
Georgia Frantzeskou, Efstathios Stamatatos, Stefan...