Page 87 - Applied statistics and probability for engineers
P. 87

3











                                                                    Discrete Random



                                                                    Variables and


                                                                    Probability


                                                                    Distributions





                                                                    A redundant array of independent disks (RAID) uses multiple
                     Chapter Outline                                physical disk drives as one logical unit in a computer system.
                                                                    The array can increase performance and robustness to a disk
                     3-1 Discrete Random Variables                  failure. Data copies can be written simultaneously to multiple
                                                                    drives (known as mirroring) to provide immediate backup and
                     3-2  Probability Distributions and             the ability to recover from failures but with less storage capac-
                         Probability Mass Functions                 ity than would otherwise be available. Alternatively, to increase
                                                                    performance, the data can be distributed among multiple disks
                     3-3  Cumulative Distribution Functions
                                                                    with only a fraction of the data on each one (known as striping).
                     3-4  Mean and Variance of a Discrete           But a failure to even a single disk can lead to loss of data. An
                         Random Variable                            intermediate design is to distribute the source data along with
                                                                    additional data (known as parity data) across multiple disks.
                     3-5  Discrete Uniform Distribution             With the parity data, the source data can be recovered even with
                                                                    disk failures. In particular, a RAID 5 design uses striping and
                     3-6 Binomial Distribution                      parity to be able to recover the source data if one disk in the
                                                                    array fails, and a RAID 6 design allows for data recovery even
                     3-7  Geometric and Negative Binomial           if two disks fail. Disk failures due to hardware malfunction are
                         Distributions
                                                                    often assumed to be independent with constant probability. With
                     3-8 Hypergeometric Distribution                a large number of disks in an array, the risk of data loss and the
                                                                    appropriate array design to meet the system performance, avail-
                     3-9 Poisson Distribution                       ability, and cost criteria are important. The number of failed
                                                                    drives can be modeled as a discrete random variable, and the
                                                                    risk of data loss in a redundant system is only one example of
                                                                    the use of the topics in this chapter.









                                                                                                                  65
   82   83   84   85   86   87   88   89   90   91   92