Sciweavers

775 search results - page 64 / 155
» Email data cleaning
Sort
View
CONEXT
2008
ACM
13 years 11 months ago
Towards a new generation of information-oriented internetworking architectures
In response to the limitations of the Internet architecture when used for applications for which it was not originally designed, a series of clean slate efforts have emerged to sh...
Christian Esteve, Fábio Luciano Verdi, Maur...
LREC
2010
217views Education» more  LREC 2010»
13 years 11 months ago
Building a Web Corpus of Czech
Large corpora are essential to modern methods of computational linguistics and natural language processing. In this paper, we describe an ongoing project whose aim is to build a l...
Drahomíra "johanka" Spoustová, Miros...
ICDM
2005
IEEE
187views Data Mining» more  ICDM 2005»
14 years 3 months ago
Parallel Algorithms for Distance-Based and Density-Based Outliers
An outlier is an observation that deviates so much from other observations as to arouse suspicion that it was generated by a different mechanism. Outlier detection has many applic...
Elio Lozano, Edgar Acuña
DAWAK
2005
Springer
14 years 3 months ago
Graph-Based Modeling of ETL Activities with Multi-level Transformations and Updates
Extract-Transform-Load (ETL) workflows are data centric workflows responsible for transferring, cleaning, and loading data from their respective sources to the warehouse. Previous ...
Alkis Simitsis, Panos Vassiliadis, Manolis Terrovi...
SBBD
2007
149views Database» more  SBBD 2007»
13 years 11 months ago
Embedding Similarity Joins into Native XML Databases
Similarity joins in databases can be used for several important tasks such as data cleaning and instance-based data integration. In this paper, we explore ways how to support such ...
Leonardo Ribeiro, Theo Härder