Sciweavers

106 search results - page 16 / 22
» Privacy-Preserving Record Linkage
Sort
View
AAAI
2006
13 years 8 months ago
Phoebus: A System for Extracting and Integrating Data from Unstructured and Ungrammatical Sources
With the proliferation of online classifieds and auctions comes a new need to meaningfully search and organize the items for sale. However, since the seller's item descriptio...
Matthew Michelson, Craig A. Knoblock
IJCAI
2003
13 years 8 months ago
A Comparison of String Distance Metrics for Name-Matching Tasks
Using an open-source, Java toolkit of name-matching methods, we experimentally compare string distance metrics on the task of matching entity names. We investigate a number of dif...
William W. Cohen, Pradeep D. Ravikumar, Stephen E....
AMW
2009
13 years 8 months ago
T3: On Mapping Text To Time Series
We investigate if the mapping between text and time series data is feasible such that relevant data mining problems in text can find their counterparts in time series (and vice ver...
Tao Yang, Dongwon Lee
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
14 years 7 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
SIGMOD
2007
ACM
172views Database» more  SIGMOD 2007»
14 years 7 months ago
Auditing disclosure by relevance ranking
Numerous widely publicized cases of theft and misuse of private information underscore the need for audit technology to identify the sources of unauthorized disclosure. We present...
Rakesh Agrawal, Alexandre V. Evfimievski, Jerry Ki...