Lookup NU author(s): Dr Paolo Missier
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0).
Provenance generated by different workflow systems is generally ex- pressed using different formats. This is not an issue when scientists analyze provenance graphs in isolation, or when they use the same workflow system. However, when analyzing heterogeneous provenance graphs from multiple systems poses a challenge. To address this problem we adopt ProvONE as an integration model, and show how different provenance databases can be con- verted to a global ProvONE schema. Scientists can then query this integrated database, exploring and linking provenance across several different workflows that may represent different implementations of the same experiment. To illus- trate the feasibility of our approach, we developed conceptual mappings be- tween the provenance databases of two workflow systems (e-Science Central and SciCumulus). We provide cartridges that implement these mappings and generate an integrated provenance database expressed as Prolog facts. To demonstrate its usage, we have developed Prolog rules that enable scientists to query the integrated database.
Author(s): Oliveira W, Missier P, Ocana K, Oliveira D, Braganholo V
Editor(s): Marta Mattoso and Boris Glavic
Publication type: Conference Proceedings (inc. Abstract)
Publication status: Published
Conference Name: Provenance and Annotation of Data and Processes: 6th International Provenance and Annotation Workshop (IPAW 2016)
Year of Conference: 2016
Print publication date: 06/06/2016
Online publication date: 04/06/2016
Acceptance date: 13/04/2016
Date deposited: 03/05/2016
Library holdings: Search Newcastle University Library for this item
Series Title: Lecture Notes in Computer Science