ABSTRACT
We describe a corpus of provenance traces that we have collected by executing 120 real world scientific workflows. The workflows are from two different workflow systems: Taverna [5] and Wings [3], and 12 different application domains (see Figure 1). Table 1 provides a summary of this PROV-corpus.
- Khalid Belhajjame, Oscar Corcho, Daniel Garijo, Jun Zhao, Paolo Missier, David Newman, Raul Palma, Sean Bechhofer, Esteban Garcia-Cuesta, Jose-Manuel Gomez-Perez, Graham Klyne, Kevin Page, Marco Roos, Jose Enrique Ruiz, Stian Soiland-Reyes, Lourdes Verdes-Montenegro, David De Roure, and Carole Goble. Workflow-centric research objects: First class citizens in scholarly discourse. In Proceedings of Sepublica2012, pages 1--12, Hersonissos, 2012.Google Scholar
- D. Garijo and Y. Gil. A new approach for publishing workflows: Abstractions, standards, and linked data. In Proceedings of the 6th workshop on Workflows in support of large-scale science, pages 47--56, Seattle, 2011. ACM. Google ScholarDigital Library
- Y. Gil, V. Ratnakar, J. Kim, et al. Wings: Intelligent workflow-based design of computational experiments. IEEE Intelligent Systems, 26(1):62--72, 2011. Google ScholarDigital Library
- T. Lebo, S. Sahoo, D. McGuinness, K. Belhajjame, J. Cheney, D. Corsar, D. Garijo, S. Soiland-Reyes, S. Zednik, and J. Zhao. Prov-o: The prov ontology. Technical report, 2012.Google Scholar
- P. Missier, S. Soiland-Reyes, S. Owen, W. Tan, A. Nenadic, I. Dunlop, A. Williams, T. Oinn, and C. Goble. Taverna, reloaded. In M Gertz, T Hey, and B Ludaescher, editors, Procs. SSDBM 2010, Heidelberg, Germany, 2010. Google ScholarDigital Library
Index Terms
- A workflow PROV-corpus based on taverna and wings
Recommendations
Taverna: lessons in creating a workflow environment for the life sciences: Research Articles
Workflow in Grid SystemsLife sciences research is based on individuals, often with diverse skills, assembled into research groups. These groups use their specialist expertise to address scientific problems. The in silico experiments undertaken by these research groups can be ...
Taverna: a tool for the composition and enactment of bioinformatics workflows
Motivation: In silico experiments in bioinformatics involve the co-ordinated use of computational tools and information repositories. A growing number of these resources are being made available with programmatic access in the form of Web services. ...
Metadata Management in the Taverna Workflow System
CCGRID '08: Proceedings of the 2008 Eighth IEEE International Symposium on Cluster Computing and the GridThere seems to be a general consensus on the crucial role metadata can play for enhancing the functionalities of scientific workflows systems, e.g., workflow and service discovery, composition and provenance browsing, among others. However, in most ...
Comments