Sciweavers

1702 search results - page 272 / 341
» Using Nondeterminism to Design Efficient Deterministic Algor...
Sort
View
KDD
2009
ACM
198views Data Mining» more  KDD 2009»
14 years 9 months ago
Pervasive parallelism in data mining: dataflow solution to co-clustering large and sparse Netflix data
All Netflix Prize algorithms proposed so far are prohibitively costly for large-scale production systems. In this paper, we describe an efficient dataflow implementation of a coll...
Srivatsava Daruru, Nena M. Marin, Matt Walker, Joy...
KDD
2004
ACM
151views Data Mining» more  KDD 2004»
14 years 9 months ago
Feature selection in scientific applications
Numerous applications of data mining to scientific data involve the induction of a classification model. In many cases, the collection of data is not performed with this task in m...
Erick Cantú-Paz, Shawn Newsam, Chandrika Ka...
MIR
2003
ACM
391views Multimedia» more  MIR 2003»
14 years 2 months ago
Generic sign board detection in images
Sign board detection is important for such computer vision applications as video surveillance and content based visual information retrieval. Previous researches on this topic foc...
Hua Shen, Xiaoou Tang
HPDC
1998
IEEE
14 years 1 months ago
Matchmaking: Distributed Resource Management for High Throughput Computing
Conventional resource management systems use a system model to describe resources and a centralized scheduler to control their allocation. We argue that this paradigm does not ada...
Rajesh Raman, Miron Livny, Marvin H. Solomon
DOLAP
2008
ACM
13 years 10 months ago
Data mining-based fragmentation of XML data warehouses
With the multiplication of XML data sources, many XML data warehouse models have been proposed to handle data heterogeneity and complexity in a way relational data warehouses fail...
Hadj Mahboubi, Jérôme Darmont