Using Active Data to Provide Smart Data Surveillance to E-Science Users

10 years 3 months ago

Download www.anthony-simonet.fr

Abstract—Modern scientiﬁc experiments often involve multiple storage and computing platforms, software tools, and analysis scripts. The resulting heterogeneous environments make data management operations challenging; the signiﬁcant number of events and the absence of data integration makes it difﬁcult to track data provenance, manage sophisticated analysis processes, and recover from unexpected situations. Current approaches often require costly human intervention and are inherently error prone. The difﬁculties inherent in managing and manipulating such large and highly distributed datasets also limits automated sharing and collaboration. We study a real world e-Science application involving terabytes of data, using three different analysis and storage platforms, and a number of applications and analysis processes. We demonstrate that using a specialized data life cycle and programming model—Active Data—we can easily implement global progress monitoring, and sharing; rec...

Anthony Simonet, Kyle Chard, Gilles Fedak, Ian T.

Real-time Traffic

Distributed And Parallel Computing | PDP 2015 |

claim paper

Post Info
More Details (n/a)

Added	16 Apr 2016
Updated	16 Apr 2016
Type	Journal
Year	2015
Where	PDP
Authors	Anthony Simonet, Kyle Chard, Gilles Fedak, Ian T. Foster

Comments (0)

Sciweavers

Using Active Data to Provide Smart Data Surveillance to E-Science Users

Distributed And Parallel Computing | PDP 2015 |

Explore & Download

Productivity Tools

Sciweavers