Page 262 -
P. 262

Chapter 6 Foundations of Business Intelligence: Databases and Information Management 261


                      INTERACTIVE SESSION: TECHNOLOGY


               BIG DATA, BIG REWARDS

               Today’s companies are dealing with an avalanche of   records, and 33 billion public records. The system’s
               data from social media, search, and sensors as well as   search capabilities allow the NYPD to quickly obtain
               from traditional sources. In 2012, the amount of digital   data from any of these data sources. Information on
               information generated is expected to reach 988 exa-  criminals, such as a suspect’s photo with details of
               bytes, which is the equivalent to a stack of books from   past offenses or addresses with maps, can be visual-
               the sun to the planet Pluto and back. Making sense of   ized in seconds on a video wall or instantly relayed
               “big data” has become one of the primary challenges   to officers at a crime scene.
               for corporations of all shapes and sizes, but it also rep-  Other organizations are using the data to go
               resents new opportunities. How are companies cur-    green, or, in the case of Vestas, to go even greener.
               rently taking advantage of big data opportunities?   Headquartered in Denmark,Vestas is the world’s
                  The British Library had to adapt to handle big data.   largest wind energy company, with over 43,000
               Every year visitors to the British Library Web site   wind turbines across 66 countries. Location data are
               perform over 6 billion searches, and the library is also   important to Vestas so that it can accurately place its
               responsible for preserving British Web sites that no   turbines for optimal wind power generation. Areas
               longer exist but need to be preserved for  historical   without enough wind will not generate the necessary
               purposes, such as the Web sites for past politicians.   power, but areas with too much wind may damage
               Traditional data management methods proved inade-    the turbines. Vestas relies on location-based data to
               quate to archive millions of these Web pages, and leg-  determine the best spots to install their turbines.
               acy analytics tools couldn’t extract useful knowledge   To gather data on prospective turbine locations,
               from such quantities of data. So the British Library   Vestas’s wind library combines data from global
               partnered with IBM to implement a big data  solution   weather systems along with data from existing
               to these challenges. IBM BigSheets is an insight       turbines. The company’s previous wind library
               engine that helps extract, annotate, and visually ana-    provided information in a grid pattern, with each
               lyze vast amounts of unstructured Web data, deliver-  grid measuring 27 x 27 kilometers (17 x 17 miles).
               ing the results via a Web browser. For example, users   Vestas engineers were able to bring the resolution
               can see search results in a pie chart. IBM BigSheets   down to about 10 x 10 meters (32 x 32 feet) to estab-
               is built atop the Hadoop framework, so it can process   lish the exact wind flow pattern at a particular loca-
               large amounts of data quickly and efficiently.       tion. To further increase the accuracy of its turbine
                  State and federal law enforcement agencies are      placement models, Vestas needed to shrink the grid
               analyzing big data to discover hidden patterns in    area even more, and this required 10 times as much
               criminal activity such as correlations between time,   data as the previous system and a more powerful
               opportunity, and organizations, or non-obvious       data management platform.
               relationships (see Chapter 4) between individuals       The company implemented a solution consisting
               and criminal organizations that would be difficult to   of IBM InfoSphere BigInsights software running on
               uncover in smaller data sets. Criminals and criminal   a high-performance IBM System x iDataPlex server.
               organizations are increasingly using the Internet to   (InfoSphere BigInsights is a set of software tools for
               coordinate and perpetrate their crimes. New tools    big data analysis and visualization, and is powered by
               allow agencies to analyze data from a wide array of   Apache Hadoop.) Using these technologies, Vestas in
               sources and apply analytics to predict future crime   creased the size of its wind library and is able  manage
               patterns. This means that law enforcement can        and analyze location and weather data with models
               become more proactive in its efforts to fight crime   that are much more powerful and  precise.
               and stop it before it occurs.                           Vestas’s wind library currently stores 2.8  petabytes
                  In New York City, the Real Time Crime Center      of data and includes approximately 178 parameters,
               data warehouse contains millions of data points on   such as barometric pressure, humidity, wind direc-
               city crime and criminals. IBM and the New York City   tion, temperature, wind velocity, and other company
               Police Department (NYPD) worked together to create   historical data. Vestas plans to add global deforesta-
               the warehouse, which contains data on over 120 mil-  tion metrics, satellite images, geospatial data, and
               lion criminal complaints, 31 million national crime   data on phases of the moon and tides.







   MIS_13_Ch_06 Global.indd   261                                                                             1/17/2013   2:27:44 PM
   257   258   259   260   261   262   263   264   265   266   267