Sciweavers

363 search results - page 4 / 73
» Probabilistic Data Generation for Deduplication and Data Lin...
Sort
View
AUSDM
2007
Springer
102views Data Mining» more  AUSDM 2007»
13 years 11 months ago
A Two-Step Classification Approach to Unsupervised Record Linkage
Linking or matching databases is becoming increasingly important in many data mining projects, as linked data can contain information that is not available otherwise, or that woul...
Peter Christen
RECOMB
2003
Springer
14 years 7 months ago
Optimizing exact genetic linkage computations
Genetic linkage analysis is a challenging application which requires Bayesian networks consisting of thousands of vertices. Consequently, computing the likelihood of data, which i...
Dan Geiger, Maáyan Fishelson
PAKDD
2009
ACM
225views Data Mining» more  PAKDD 2009»
14 years 6 hour ago
Accurate Synthetic Generation of Realistic Personal Information
A large proportion of the massive amounts of data that are being collected by many organisations today is about people, and often contains identifying information like names, addre...
Peter Christen, Agus Pudjijono
ICDM
2008
IEEE
104views Data Mining» more  ICDM 2008»
14 years 1 months ago
A Generative Probabilistic Model for Multi-label Classification
Hongning Wang, Minlie Huang, Xiaoyan Zhu
BMCBI
2007
153views more  BMCBI 2007»
13 years 7 months ago
Estimating genealogies from linked marker data: a Bayesian approach
Background: Answers to several fundamental questions in statistical genetics would ideally require knowledge of the ancestral pedigree and of the gene flow therein. A few examples...
Dario Gasbarra, Matti Pirinen, Mikko J. Sillanp&au...