Promoting performance and separation of concerns for data mining applications on the grid

15 years 6 months ago

Download walfredo.dsc.ufcg.edu.br

Grid Computing brought the promise of making high-performance computing cheaper and more easily available than traditional supercomputing platforms. Such a promise was very well received by the data mining (DM) community, as DM applications typically process very large datasets and are thus very resource intensive. However, since the Grid is very dynamic and parallel data mining is prone to load unbalancing, obtaining good data mining performance on the Grid is hard. It typically requires for the scheduler to understand the inner works of the application, bringing two related problems. First, good Grid schedulers tend to be very specialized in the application they target. Second, changing the application may require changing the scheduler, what may be specially challenging when there is no clear separation between the application and the scheduler code. We pose and evaluate a knowledge-based approach that provides abstractions to the DM developer and optimizes at runtime the DM applic...

Vasco Furtado, Francisco Flávio de Souza, W

Real-time Traffic

Dm Application | FGCS 2007 | Grid | Traditional Supercomputing Platforms |

claim paper

Added	19 Dec 2010
Updated	19 Dec 2010
Type	Journal
Year	2007
Where	FGCS
Authors	Vasco Furtado, Francisco Flávio de Souza, Walfredo Cirne

Sciweavers

Promoting performance and separation of concerns for data mining applications on the grid

Dm Application | FGCS 2007 | Grid | Traditional Supercomputing Platforms |

Explore & Download

Productivity Tools

Sciweavers