Sciweavers

1443 search results - page 197 / 289
» Similarity Measures for Categorical Data: A Comparative Eval...
Sort
View
GRID
2005
Springer
14 years 2 months ago
Toward seamless grid data access: design and implementation of GridFTP on .NET
— To date, only Linux-/UNIX-based hosts have been participants in the Grid vision for seamless data access, because the necessary Grid data access protocols have not been impleme...
Jun Feng, Lingling Cui, Glenn S. Wasson, Marty Hum...
BMCBI
2004
180views more  BMCBI 2004»
13 years 8 months ago
Noise filtering and nonparametric analysis of microarray data underscores discriminating markers of oral, prostate, lung, ovaria
Background: A major goal of cancer research is to identify discrete biomarkers that specifically characterize a given malignancy. These markers are useful in diagnosis, may identi...
Virginie M. Aris, Michael J. Cody, Jeff Cheng, Jam...
SIGMOD
2009
ACM
136views Database» more  SIGMOD 2009»
14 years 9 months ago
A comparison of approaches to large-scale data analysis
There is currently considerable enthusiasm around the MapReduce (MR) paradigm for large-scale data analysis [17]. Although the basic control flow of this framework has existed in ...
Andrew Pavlo, Erik Paulson, Alexander Rasin, Danie...
SIGMOD
2010
ACM
211views Database» more  SIGMOD 2010»
14 years 1 months ago
ERACER: a database approach for statistical inference and data cleaning
Real-world databases often contain syntactic and semantic errors, in spite of integrity constraints and other safety measures incorporated into modern DBMSs. We present ERACER, an...
Chris Mayfield, Jennifer Neville, Sunil Prabhakar
ICML
2007
IEEE
14 years 9 months ago
Revisiting probabilistic models for clustering with pair-wise constraints
We revisit recently proposed algorithms for probabilistic clustering with pair-wise constraints between data points. We evaluate and compare existing techniques in terms of robust...
Blaine Nelson, Ira Cohen