Sciweavers

249 search results - page 18 / 50
» MALEF: Framework for distributed machine learning and data m...
Sort
View
KDD
2003
ACM
214views Data Mining» more  KDD 2003»
14 years 7 months ago
Adaptive duplicate detection using learnable string similarity measures
The problem of identifying approximately duplicate records in databases is an essential step for data cleaning and data integration processes. Most existing approaches have relied...
Mikhail Bilenko, Raymond J. Mooney
ICML
2007
IEEE
14 years 8 months ago
Multi-task learning for sequential data via iHMMs and the nested Dirichlet process
A new hierarchical nonparametric Bayesian model is proposed for the problem of multitask learning (MTL) with sequential data. Sequential data are typically modeled with a hidden M...
Kai Ni, Lawrence Carin, David B. Dunson
PKDD
2005
Springer
109views Data Mining» more  PKDD 2005»
14 years 26 days ago
An Imbalanced Data Rule Learner
Imbalanced data learning has recently begun to receive much attention from research and industrial communities as traditional machine learners no longer give satisfactory results. ...
Canh Hao Nguyen, Tu Bao Ho
ECML
2007
Springer
14 years 1 months ago
Scale-Space Based Weak Regressors for Boosting
Boosting is a simple yet powerful modeling technique that is used in many machine learning and data mining related applications. In this paper, we propose a novel scale-space based...
Jin Hyeong Park, Chandan K. Reddy
ISCIS
2009
Springer
13 years 12 months ago
PopulusLog: People information database
—Information about individuals on publicly available web sites stands as a valuable, yet unorganized, data source. Turning such an enormous data source into a “database” is h...
Ali Cakmak, Mustafa Kirac, Gultekin Özsoyoglu