Sciweavers

249 search results - page 43 / 50
» MALEF: Framework for distributed machine learning and data m...
Sort
View
DMKD
2003
ACM
110views Data Mining» more  DMKD 2003»
13 years 12 months ago
Weave amino acid sequences for protein secondary structure prediction
Given a known protein sequence, predicting its secondary structure can help understand its three-dimensional (tertiary) structure, i.e., the folding. In this paper, we present an ...
Xiaochun Yang, Bin Wang
ITNG
2010
IEEE
13 years 11 months ago
A Fast and Stable Incremental Clustering Algorithm
— Clustering is a pivotal building block in many data mining applications and in machine learning in general. Most clustering algorithms in the literature pertain to off-line (or...
Steven Young, Itamar Arel, Thomas P. Karnowski, De...
SIGMOD
2009
ACM
177views Database» more  SIGMOD 2009»
14 years 7 months ago
Exploiting context analysis for combining multiple entity resolution systems
Entity Resolution (ER) is an important real world problem that has attracted significant research interest over the past few years. It deals with determining which object descript...
Zhaoqi Chen, Dmitri V. Kalashnikov, Sharad Mehrotr...
SDM
2003
SIAM
129views Data Mining» more  SDM 2003»
13 years 8 months ago
Approximate Query Answering by Model Averaging
In earlier work we have introduced and explored a variety of different probabilistic models for the problem of answering selectivity queries posed to large sparse binary data set...
Dmitry Pavlov, Padhraic Smyth
AUSDM
2006
Springer
112views Data Mining» more  AUSDM 2006»
13 years 10 months ago
Accuracy Estimation With Clustered Dataset
If the dataset available to machine learning results from cluster sampling (e.g. patients from a sample of hospital wards), the usual cross-validation error rate estimate can lead...
Ricco Rakotomalala, Jean-Hugues Chauchat, Fran&cce...