Data Management on Grid Filesystem for Data-Intensive Computing

14 years 6 months ago

Download matsu-www.is.titech.ac.jp

In parallel computing environments such as HPC clusters and the Grid, data-intensive applications involve large overhead costs due to a concentration of access to the ﬁles on common nodes. To avoid this problem in traditional distributed ﬁlesystems, users have to distribute the ﬁle access manually. However, such solution has some difﬁculties for users in the Grid environment. We propose a data management mechanism for data-intensive computing on Grid ﬁlesystem. Our technique improves the ﬁle access performance by automatically scheduling the ﬁle access and the data management on the ﬁlesystem. The ﬁlesystem is based on dynamically conﬁgured node groups corresponding to the network topology. Utilizing the conﬁguration, it monitors ﬁle access to detect concentrated situations, creates the ﬁle replica, and schedules its placement and access. We applied the proposal technique to the Gfarm, a ﬁlesystem that scales to the Grid. We emulate real application workloa...

Hitoshi Sato, Satoshi Matsuoka

Real-time Traffic

Data Management | Internet Technology | SAINT 2007 | ﬁle Access | ﬁle Access Performance |

claim paper

Post Info
More Details (n/a)

Added	04 Jun 2010
Updated	04 Jun 2010
Type	Conference
Year	2007
Where	SAINT
Authors	Hitoshi Sato, Satoshi Matsuoka

Comments (0)

Sciweavers

Data Management on Grid Filesystem for Data-Intensive Computing

Data Management | Internet Technology | SAINT 2007 | ﬁle Access | ﬁle Access Performance |

Explore & Download

Productivity Tools

Document Tools

Image Tools

Sciweavers