Page 95 - Building Big Data Applications
P. 95
90 Building Big Data Applications
potentially even help us move into the discovery of the elusive Higgs boson particle. The
challenge is significant in the LEP that it has been built in a tunnel, reusing this tunnel
will require us to decommission the LEP and then build the detectors and colliders for
the LHC.
What the rebuild will provide us is an opportunity to align all technologies in the big
data platform layers to collect data that is generated by these mammoth colliders, which
will help us rebuild the collision process and visualize the impact repeatedly. The LEP
collider was having Oracle databases, C programs and Open Source data models and
platforms, Pascal, Fortran, and SAS but had to collect data as needed missing on
foundational areas sometimes, and there were issues in the need for time as collisions
occurred and people recorded as much as possible. To compute better with collection of
large volumes of data and analyze results on demand the data platforms added at LHC
included Hadoop, NoSQL, Spark, and Kubernetes. The next segment is to look at the
technologies and the data.
Big data platform and application
History of LHC at CERN can be summarized in these events:
1982: First studies for the LHC project
1983: Z0/W discovered at SPS proton antiproton collider (SppbarS)
1989: Start of LEP operation (Z/W boson factory)
1994: Approval of the LHC by the CERN Council
1996: Final decision to start the LHC construction
2000: Last year of LEP operation above 100 GeV
2002: LEP equipment removed
2003: Start of LHC installation
2005: Start of LHC hardware commissioning
2008: Start of (short) beam commissioning
2009: Repair, recommissioning, and beam commissioning
2011: Launch of LHC
As LHC was being designed at CERN, the data calculations were being computed for
each device and how much data is needed for use. CERN has approximately 3000
Members and 12,000 users. The Large Hadron Collider and four big Experiments: ATLAS,
CMS, LHCb, and ALICE generate data 24 7 365. CERN OpenLab is a publiceprivate
partnership, through which CERN collaborates with leading ICT companies and research
organizations. The Worldwide LHC Computing Grid (WLCG) is a global collaboration of
more than 170 institutions in 42 countries which provide resources to store, distribute,
and analyze the multiple PBs of LHC Data. Data at CERN 15 PB per month and >250 PB
today at CERN data center.