Page 195 -
P. 195

6.4 Region-Based Mining                                         177

            Fig. 6.15 Transition system
            TS L 1 ,l 5 state ()  derived from L 1
            using
                      1
                         k
            l 5 state (σ,k) = tl (hd (σ))





            labeled with the last activity executed. For the initial state, this results in the empty
            sequence. TS L 1 ,l state ()  allows for the traces in the event log, but also traces such as
                         5
                                                      3
             a,b,c,b,c,d . Another example is l state (σ,k) = hd (tl |σ|−k (σ)), i.e., the state is
                                          6
            determined by the next three events.
              Thus far, we only considered a simple event log as input. Real-life event logs
            contain much more information as was shown in Chap. 4 (cf. Definition 4.3 and the
            XES format). Information about resources and data can also be taken into account
            when constructing a transition system. This information can be used to identify
            states and to label transitions. For example, states may encode whether the customer
            being served is a gold or silver customer. Transitions can be labeled with resource
            names rather than activity names. See [111] for a systematic treatment of the topic.
              A transition system defines a “low-level” process model. Unfortunately, such
            models cannot express higher level constructs and suffer from the “state explosion”
            problem. As indicated before, a simple process with 10 parallel activities already
            results in a transition system with 2 10  = 1024 states and 10 × 2 10−1  = 5120 tran-
            sitions. Fortunately, state-based regions can be used to synthesize a more compact
            model from such transition systems.



            6.4.2 Process Discovery Using State-Based Regions


            After transforming an event log into a low-level transition system, we can synthesize
            a Petri net from it. In turn, this Petri net can be used to construct a process model in
            some other high-level notation (e.g., BPMN, UML activity diagrams, YAWL, and
            EPCs). The challenge is to fold a large transition system into a smaller Petri net
            by detecting concurrency. The core idea is to discover regions that correspond to
            places. A region is a set of states such that all activities in the transition system
            “agree” on the region.

            Definition 6.3 (State-based region) Let TS = (S,A,T ) be a transition system and
            R ⊆ S be a subset of states. R is a region if for each activity a ∈ A one of the
            following conditions hold:

            1. All transitions (s 1 ,a,s 2 ) ∈ T enter R, i.e., s 1 /∈ R and s 2 ∈ R.
            2. All transitions (s 1 ,a,s 2 ) ∈ T exit R, i.e., s 1 ∈ R and s 2 /∈ R.
            3. All transitions (s 1 ,a,s 2 ) ∈ T do not cross R, i.e., s 1 ,s 2 ∈ R or s 1 ,s 2 /∈ R.
   190   191   192   193   194   195   196   197   198   199   200