Page 241 -

P. 241

Chapter 6 The Cloud
240
0GGF 5ECNCDKNKV[

2GCM WUCIG QXGT )DRU

8QNWOG QH &CVC &GNKXGTGF )DRU

Figure 6-1 #/ #/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/ 2/

Example Video Banner #/ #/ 'ZCORNG 8KFGQ DCPPGT CF EWUVQOGT
Ad Customer

ad may appeal more to one nationality than to another. Will 70 percent of those visits arise in the
United States and the rest in Europe? Or will there be millions from Japan? Or Australia? Given this
uncertainty, how does the car manufacturer prepare its computing infrastructure? The car manu-
facturer knows that if it cannot provide very short response time (say, a fraction of a second), it
will lose the benefit of an incredibly expensive ad. On the other hand, if the ad is a flop, preprovi-
sioning of thousands of servers will add to the accumulation of wasted money.
Figure 6-1 shows an example of this situation, based on a real case supported by Amazon.
com’s CloudFront. Suppose Figure 6-1 shows the processing on the car manufacturer’s Web site
during the Academy Awards. Throughout the day, the car manufacturer is delivering less than
10 Gbps of its content to users. However, as soon as its ad runs (2 PM in the Hawaii-Aleutian
time zone where the data was collected), demand increases seven-fold and stays high for half an
hour. After the announcement of Best Picture, when its ad runs again, demand again increases to
30 and 40 Gpbs for an hour and then returns to its base level.
Without an increase in servers, response time will be 3 or 5 seconds or more, which is far too
long to maintain the attention of a charged-up Academy Awards viewer. However, the car manu-
facturer has contracted with its cloud vendor to add servers, wherever needed worldwide, to keep
response time to less than 0.5 seconds. Using cloud technology, the cloud vendor will program-
matically increase its servers to keep response time below the 0.5-second threshold. As demand
falls after the ad runs a second time, it will release the excess servers and reallocate them at the
end of the awards.
In this way, the car manufacturer need not build or contract for infrastructure that supports
maximum demand. Had it done so, the vast majority of its servers would have been idle for most
of the evening. And, as you’ll learn, the cloud vendor can provision servers worldwide using the
cloud; if a good portion of the excess demand is in Singapore, for example, it can provision extra
servers in Asia and reduce wait time due to global transmission delays.

Pooled
The second key in the definition of cloud is pooled. Cloud resources are pooled because many
different organizations use the same physical hardware; they share that hardware through
virtualization. Cloud vendors dynamically allocate virtual machines to physical hardware as cus-
tomer needs increase or decrease. Thus, servers that advertisers need for the Academy Awards
can be reallocated to CPA firms that need them later that same day, to textbook publishers who
need them for online student activity on Monday, or to the hotel industry that needs them later
the next week.
An easy way to understand the essence of this development is to consider electrical power.
In the very earliest days of electric power generation, organizations operated their own genera-
tors to create power for their company’s needs. Over time, as the power grid expanded, it became

236 237 238 239 240 241 242 243 244 245 246