Page 281 -
P. 281

10.1  Business Intelligence?                                    263

            data, often referred to as “slicing and dicing”, helps to see particular patterns. By
            “slicing” the OLAP cube, the analyst can zoom into a selected slice of the overall
            data, e.g., only looking at sales of the iPod nano. The term “dicing” refers to the re-
            ordering of dimensions, e.g., one can look at the sales of different products within a
            given region or, alternatively, view the sales per region for a given product. One can
            look at an OLAP cube as a dice. Using this metaphor, dicing corresponds to “throw-
            ing the dice” to look at the data from another angle. The results can be viewed in
            tabular form or visualized using various charts. BI products support a broad range
            of charts, e.g., pie charts, bar charts, radar plots, scatter plots, speedometers, Pareto
            charts, box plots, and scorecards. Any of these charts can be part of some predefined
            dashboard or report.
              The mainstream BI products from vendors such as IBM, Oracle, SAP, and Mi-
            crosoft do not support process mining. All of the systems mentioned earlier are
            data-centric and are unaware of the processes the data refers to. For example, BI
            products can analyze an OLAP cube with sales data as shown in Fig. 10.1, but do
            this without considering the underlying process. The sales events are immediately
            aggregated without trying to distill the underlying process. BI products do not show
            the end-to-end process and cannot zoom into selected parts of this process.
              Another problem of mainstream BI products is that the focus is on fancy-looking
            dashboards and rather simple reports, rather than a deeper analysis of the data col-
            lected. This is surprising as the “I” in BI refers to “intelligence”. Unfortunately, the
            business unintelligence market is dominated by large vendors that focus on monitor-
            ing and reporting rather than analytics. Gartner [41] and Forrester [40] report a shift
            from “measurement” to “analysis, forecasting, predictive analytics, optimization”.
            However, no clear definitions of such capabilities are provided and most vendors
            tend to interpret these terms in a rather deceptive manner. Data mining or statisti-
            cal analysis are often added as an afterthought. For instance, IBM acquired SPSS to
            add some “intelligence” to their IBM Cognos Business Intelligence offering. TIBCO
            Spotfire Analytics uses the statistical programming environment called S+ (a close
            relative of the well-known R open-source language for statistical computing and vi-
            sualization). The open-source BI products of Jaspersoft and Pentaho can connect to
            open-source data mining tools such as WEKA (Waikato Environment for Knowledge
            Analysis, weka.wikispaces.com) and R (www.r-project.org).


              Pentaho: An Open-Source BI Suite
              To illustrate the functionality of mainstream BI products, we briefly discuss
              one of the leading open-source BI products. The Pentaho BI Suite can be
              downloaded from www.pentaho.com and consists of software to:
              • Extract, transform, and load data from different sources (CSV files,
                databases, Google Analytics, RSS, etc.)
              • Design reports and dashboards
              • Generate reports
              • Visualize dashboards showing tabular data and a variety of charts
              • View OLAP cubes in an interactive manner
   276   277   278   279   280   281   282   283   284   285   286