Sciweavers

CCGRID
2010
IEEE

File-Access Characteristics of Data-Intensive Workflow Applications

13 years 9 months ago
File-Access Characteristics of Data-Intensive Workflow Applications
This paper studies five real-world data intensive workflow applications in the fields of natural language processing, astronomy image analysis, and web data analysis. Data intensive workflows are increasingly becoming important applications for cluster and Grid environments. They open new challenges to various components of workflow execution environments including job dispatchers, schedulers, file systems, and file staging tools. Their impacts on real workloads are largely unknown. Understanding characteristics of real-world workflow applications is a required step to promote research in this area. To this end, we analyse real-world workflow applications focusing on their file access patterns and summarize their implications to schedulers and file system/staging designs.
Takeshi Shibata, SungJun Choi, Kenjiro Taura
Added 28 Feb 2011
Updated 28 Feb 2011
Type Journal
Year 2010
Where CCGRID
Authors Takeshi Shibata, SungJun Choi, Kenjiro Taura
Comments (0)