کد مقاله | کد نشریه | سال انتشار | مقاله انگلیسی | نسخه تمام متن |
---|---|---|---|---|
424616 | 685612 | 2013 | 12 صفحه PDF | دانلود رایگان |

• Usage of provenance to characterize workflow-based activity on an e-infrastructure.
• Usage and error patterns analysis at workflow and task levels using provenance.
• Use of classification algorithms explanatory power to find component attributes related to failure.
• Description of a generic approach to workflow failure analysis.
• Analysis of 8 months of workflow activity.
Grid computing and workflow management systems emerged as solutions to the challenges arising from the processing and storage of shear volumes of data generated by modern simulations and data acquisition devices. Workflow management systems usually document the process of the workflow execution either as structured provenance information or as log files. Provenance is recognized as an important feature in workflow management systems, however there are still few reports on its usage in practical cases. In this paper we present the provenance system implemented in our platform, and then use the information captured by this system during 8 months of platform operation to analyze the platform usage and to perform multilevel error pattern analysis. We make use of the large amount of structured data using the explanatory potential of statistical approaches to find properties of workflows, jobs and resources that are related to workflow failure. Such an analysis enables us to characterize workflow executions on the infrastructure and understand workflow failures. The approach is generic and applicable to other e-infrastructures to gain insight into operational incidents.
Journal: Future Generation Computer Systems - Volume 29, Issue 8, October 2013, Pages 1931–1942