Article ID Journal Published Year Pages File Type
508778 Computers in Industry 2015 14 Pages PDF
Abstract

•We introduce a logical representation of event logs using first-order predicates.•We use logical decision trees to classify process instances according to duration.•Possible causes of delay are induced by the classifier and appear in the decision tree.•The approach is applied in two case studies involving real-world event logs.

In real-world business processes it is often difficult to explain why some process instances take longer than usual to complete. With process mining techniques, it is possible to do an a posteriori analysis of a large number of process instances and detect the occurrence of delays, but discovering the actual cause of such delays is a different problem. For example, it may be the case that when a certain activity is performed or a certain user (or combination of users) participates in the process, the process suffers a delay. In this work, we show that it is possible to retrieve possible causes of delay based on the information recorded in an event log. The approach consists in translating the event log into a logical representation, and then applying decision tree induction to classify process instances according to duration. Besides splitting those instances into several subsets, each path in the tree yields a rule that explains why a given subset has an average duration that is higher or lower than other subsets of instances. The approach is applied in two case studies involving real-world event logs, where it succeeds in discovering meaningful causes of delay, some of which having been pointed out by domain experts.

Related Topics
Physical Sciences and Engineering Computer Science Computer Science Applications
Authors
, ,