Sciweavers

926 search results - page 102 / 186
» Large Scale Data Mining: Challenges and Responses
Sort
View
DATASCIENCE
2007
84views more  DATASCIENCE 2007»
15 years 6 months ago
Open Data for Global Science
The digital revolution has transformed the accumulation of properly curated public research data into an essential upstream resource whose value increases with use.1 The potential...
Paul F. Uhlir, Peter Schröder
CSB
2005
IEEE
156views Bioinformatics» more  CSB 2005»
15 years 11 months ago
A Robust Meta-classification Strategy for Cancer Diagnosis from Gene Expression Data
One of the major challenges in cancer diagnosis from microarray data is to develop robust classification models which are independent of the analysis techniques used and can combi...
Gabriela Alexe, Gyan Bhanot, Babu Venkataraghavan,...
DOLAP
2004
ACM
15 years 11 months ago
Experimental evidence on partitioning in parallel data warehouses
Parallelism can be used for major performance improvement in large Data warehouses (DW) with performance and scalability challenges. A simple low-cost shared-nothing architecture ...
Pedro Furtado
OSDI
2008
ACM
15 years 8 months ago
DryadLINQ: A System for General-Purpose Distributed Data-Parallel Computing Using a High-Level Language
DryadLINQ is a system and a set of language extensions that enable a new programming model for large scale distributed computing. It generalizes previous execution environments su...
Yuan Yu, Michael Isard, Dennis Fetterly, Mihai Bud...
IPPS
2010
IEEE
15 years 4 months ago
Large-scale multi-dimensional document clustering on GPU clusters
Document clustering plays an important role in data mining systems. Recently, a flocking-based document clustering algorithm has been proposed to solve the problem through simulat...
Yongpeng Zhang, Frank Mueller, Xiaohui Cui, Thomas...