Sciweavers

FGCS
2007

Distributed data mining in grid computing environments

13 years 11 months ago
Distributed data mining in grid computing environments
The computing-intensive data mining for inherently Internet-wide distributed data, referred to as Distributed Data Mining (DDM), calls for the support of a powerful Grid with an effective scheduling framework. DDM often shares the computing paradigm of local processing and global synthesizing. It involves every phase of Data Mining (DM) processes, which makes the workflow of DDM very complex and can be modelled only by a Directed Acyclic Graph (DAG) with multiple data entries. Motivated by the need for a practical solution of the Grid scheduling problem for the DDM workflow, this paper proposes a novel two-phase scheduling framework, including External Scheduling and Internal Scheduling, on a twolevel Grid architecture (InterGrid, IntraGrid). Currently a DM IntraGrid, named DMGCE (Data Mining Grid Computing Environment), has been developed with a dynamic scheduling framework for competitive DAGs in a heterogeneous computing environment. This system is implemented in an established M...
Ping Luo, Kevin Lü, Zhongzhi Shi, Qing He
Added 14 Dec 2010
Updated 14 Dec 2010
Type Journal
Year 2007
Where FGCS
Authors Ping Luo, Kevin Lü, Zhongzhi Shi, Qing He
Comments (0)