Page 241 -
P. 241

8.3 Organizational Mining                                       223

















            Fig. 8.5 A social network consists of nodes representing organizational entities and arcs repre-
            senting relationships. Both nodes and arcs can have weights indicated by “w = ...” and the size of
            the shape


            is a one-to-one correspondence between the resources found in the log and organi-
            zational entities (i.e., nodes). In Fig. 8.5, nodes x, y, and z could refer to persons.
            The nodes in a social network may also correspond to aggregate organizational en-
            tities such as roles, groups, and departments. The arcs in a social network corre-
            spond to relationships between such organizational entities. Arcs and nodes may
            have weights. The weight of an arc or node indicates its importance. For instance,
            node y is more important than x and z as is indicated by its size. The relationship
            between x and y is much stronger than the relationship between z and x as shown
            by the thickness of the arc. The interpretation of “importance” depends on the social
            network. Later, we will give some examples to illustrate the concept.
              Sometimes the term distance is used to refer to the inverse of the weight of an
            arc. An arc connecting two organizational entities has a high weight if the distance
            between both entities is small. If the distance from node x to node y is large, then
            the weight of the corresponding arc is small (or the arc is not present in the social
            network).
              A wide variety of metrics have been defined to analyze social networks and to
            characterize the role of individual nodes in such a diagram [122]. For example, if
            all other nodes are in short distance to a given node and all geodesic paths (i.e.,
            shortest paths in the graph) visit this node, then clearly the node is very central (like
            a spider in the web). There are different metrics for this intuitive notion of centrality.
            The Bavelas–Leavitt index of centrality is a well-known example that is based on
            the geodesic paths in the graph. Let i be an node and let D j,k be the geodesic
            distance from node j to node k. The Bavelas–Leavitt index of centrality is defined as

            BL(i) = (   D j,k )/(  D j,i + D i,k ). The index divides the sum of all geodesic
                      j,k       j,k
            distances by the sum of all geodesic distances from and to node i. Other related
            metrics are closeness (1 divided by the sum of all geodesic distances to a given
            node) and betweenness (a ratio based on the number of geodesic paths visiting a
            given node) [104, 122]. Recall that distance can be seen as the inverse of arc weight.
              Notions such as centrality analyze the position of one organizational entity, say a
            person, in the whole social network. There are also metrics making statements about
            the network as a whole, e.g., the degree of connectedness. Moreover, there are also
   236   237   238   239   240   241   242   243   244   245   246