Sciweavers

775 search results - page 9 / 155
» Email data cleaning
Sort
View
KDD
2008
ACM
176views Data Mining» more  KDD 2008»
14 years 10 months ago
Febrl -: an open source data cleaning, deduplication and record linkage system with a graphical user interface
Matching records that refer to the same entity across databases is becoming an increasingly important part of many data mining projects, as often data from multiple sources needs ...
Peter Christen
ICDE
2006
IEEE
124views Database» more  ICDE 2006»
14 years 11 months ago
A Pipelined Framework for Online Cleaning of Sensor Data Streams
Shawn R. Jeffery, Gustavo Alonso, Michael J. Frank...
ACSAC
2005
IEEE
14 years 3 months ago
Securing Email Archives through User Modeling
Online email archives are an under-protected yet extremely sensitive information resource. Email archives can store years worth of personal and business email in an easy-to-access...
Yiru Li, Anil Somayaji
IDA
2003
Springer
14 years 3 months ago
A Logical Formalisation of the Fellegi-Holt Method of Data Cleaning
The Fellegi-Holt method automatically “corrects” data that fail some predefined requirements. Computer implementations of the method were used in many national statistics bure...
Agnes Boskovitz, Rajeev Goré, Markus Heglan...
SIGMOD
2010
ACM
211views Database» more  SIGMOD 2010»
14 years 2 months ago
ERACER: a database approach for statistical inference and data cleaning
Real-world databases often contain syntactic and semantic errors, in spite of integrity constraints and other safety measures incorporated into modern DBMSs. We present ERACER, an...
Chris Mayfield, Jennifer Neville, Sunil Prabhakar