Abstract. Workflow is an important approach for the specification and management of complex processing tasks. This approach is especially powerful for utilizing distributed service...
The Hadoop filesystem is a large scale distributed filesystem used to manage and quickly process extremely large data sets. We want to utilize Hadoop to assist with dataintensive ...
1 -- As the scale and complexity of data-driven computational science grows, so grows the burden on the scientists and students in managing the data products used and generated dur...
Yiming Sun, Scott Jensen, Sangmi Lee Pallickara, B...
This paper describes XPeer, a zero-administration system for sharing and querying XML data. The system allows users to share XML data without significant human intervention, and t...
Carlo Sartiani, Paolo Manghi, Giorgio Ghelli, Giov...
Grid computing is about allocating distributed collections of resources including computers, storage systems, networks and instruments to form a coherent system devoted to a “vir...
Dennis Gannon, Beth Plale, Marcus Christie, Liang ...