Sciweavers

SIGSOFT
2008
ACM

Experience in using a process language to define scientific workflow and generate dataset provenance

15 years 1 months ago
Experience in using a process language to define scientific workflow and generate dataset provenance
This paper describes our experiences in exploring the applicability of software engineering approaches to scientific data management problems. Specifically, this paper describes how process definition languages can be used to expedite production of scientific datasets as well as to generate documentation of their provenance. Our approach uses a process definition language that incorporates powerful semantics to encode scientific processes in the form of a Process Definition Graph (PDG). The paper describes how execution of the PDG-defined process can generate Dataset Derivation Graphs (DDGs), metadata that document how the scientific process developed each of its product datasets. The paper uses an example to show that scientific processes may be complex and to illustrate why some of the more powerful semantic features of the process definition language are useful in supporting clarity and conciseness in representing such processes. This work is similar in goals to work generally refe...
Leon J. Osterweil, Lori A. Clarke, Aaron M. Elliso
Added 20 Nov 2009
Updated 20 Nov 2009
Type Conference
Year 2008
Where SIGSOFT
Authors Leon J. Osterweil, Lori A. Clarke, Aaron M. Ellison, Rodion M. Podorozhny, Alexander E. Wise, Emery R. Boose, Julian L. Hadley
Comments (0)