Sciweavers

971 search results - page 143 / 195
» Mining Several Data Bases with an Ensemble of Classifiers
Sort
View
DMIN
2009
142views Data Mining» more  DMIN 2009»
13 years 7 months ago
Efficient Record Linkage using a Double Embedding Scheme
Record linkage is the problem of identifying similar records across different data sources. The similarity between two records is defined based on domain-specific similarity functi...
Noha Adly
CIKM
2008
Springer
13 years 11 months ago
SHOPSMART: product recommendations through technical specifications and user reviews
This paper describes a new method for providing recommendations tailored to a user's preferences using text mining techniques and online technical specifications of products....
Alexander Yates, James Joseph, Ana-Maria Popescu, ...
CAISE
2007
Springer
14 years 3 months ago
Declarative XML Data Cleaning with XClean
Data cleaning is the process of correcting anomalies in a data source, that may for instance be due to typographical errors, or duplicate representations of an entity. It is a cruc...
Melanie Weis, Ioana Manolescu
PKDD
2007
Springer
120views Data Mining» more  PKDD 2007»
14 years 3 months ago
Privacy Preserving Market Basket Data Analysis
Randomized Response techniques have been empirically investigated in privacy preserving association rule mining. However, previous research on privacy preserving market basket data...
Ling Guo, Songtao Guo, Xintao Wu
KDD
2002
ACM
155views Data Mining» more  KDD 2002»
14 years 9 months ago
SyMP: an efficient clustering approach to identify clusters of arbitrary shapes in large data sets
We propose a new clustering algorithm, called SyMP, which is based on synchronization of pulse-coupled oscillators. SyMP represents each data point by an Integrate-and-Fire oscill...
Hichem Frigui