Page 144 -

P. 144

126 5 Process Discovery: An Introduction

3 2
Fig. 5.1 WF-net N 1 discovered for L 1 =[ a,b,c,d , a,c,b,d , a,e,d ]

ments. To make things more concrete, we deﬁne the target to be a Petri net model.
Moreover, we use a simple event log as input (cf. Deﬁnition 4.4). A simple event
∗
log L is a multi-set of traces over some set of activities A , i.e., L ∈ B(A ).For
example,
3 2

L 1 = a,b,c,d , a,c,b,d , a,e,d
L 1 is a simple log describing the history of six cases. The goal is now to discover a
Petri net that can “replay” event log L 1 . Ideally, the Petri net is a sound WF-net as
deﬁned in Sect. 2.2.3. Based on these choices, we reformulate the process discovery
problem and make it more concrete.
Deﬁnition 5.2 (Speciﬁc process discovery problem) A process discovery algorithm
is a function γ that maps a log L ∈ B(A ) onto a marked Petri net γ(L) = (N,M).
∗
Ideally, N is a sound WF-net and all traces in L correspond to possible ﬁring se-
quences of (N,M).

Function γ deﬁnes a so-called “Play-in” technique as described in Chap. 1. Based
on L 1 , a process discovery algorithm γ could discover the WF-net shown in Fig. 5.1,
i.e., γ(L 1 ) = (N 1 ,[start]). Each trace in L 1 corresponds to a possible ﬁring se-
quence of WF-net N 1 shown in Fig. 5.1. Therefore, it is easy to see that the WF-net
can indeed replay all traces in the event log. In fact, each of the three possible ﬁring
sequences of WF-net N 1 appears in L 1 .
Let us now consider another event log:
3 4 2
L 2 = a,b,c,d , a,c,b,d , a,b,c,e,f,b,c,d , a,b,c,e,f,c,b,d ,
2
a,c,b,e,f,b,c,d , a,c,b,e,f,b,c,e,f,c,b,d
L 2 is a simple event log consisting of 13 cases represented by 6 different traces.
Based on event log L 2 ,some γ could discover WF-net N 2 showninFig. 5.2.This
WF-net can indeed replay all traces in the log. However, not all ﬁring sequences of
N 2 correspond to traces in L 2 . For example, the ﬁring sequence a,c,b,e,f,c,b,d
does not appear in L 2 . In fact, there are inﬁnitely many ﬁring sequences because of
the loop construct in N 2 . Clearly, these cannot all appear in the event log. Therefore,
Deﬁnition 5.2 does not require all ﬁring sequences of (N,M) to be traces in L.

139 140 141 142 143 144 145 146 147 148 149