Page 175 - Applied Probability
P. 175

8. The Polygenic Model
                              160
                                    (b) Omitting irrelevant constants, the Q(γ | γ n ) function of the EM
                                        algorithm is
                                                Q(γ | γ n )
                                                  1  r−1     2   1     −1        t  −1

                                            = −       {m ln σ +  2  [tr(Γ k  Υ nk )+ ν Γ k  ν nk ]}
                                                                                 nk
                                                             k
                                                  2             σ
                                                   k=1           k
                                                  m    2    1                    t
                                                −   ln σ −    [tr(Υ nr )+ (ν nr − Aµ) (ν nr − Aµ)],
                                                       r
                                                  2        2σ r 2
                                        where ν nk is the conditional mean vector
                                                                      2
                                                       =1 {k=r} Aµ + σ Γ k Ω −1 [y − Aµ]
                                                   ν k
                                                                      k
                                        and Υ nk is the conditional covariance matrix
                                                               2
                                                                      2
                                                          = σ Γ k − σ Γ k Ω −1 2
                                                      Υ k                    σ Γ k
                                                               k      k       k
                                            k
                                        of X given Y = y evaluated at the current iterate γ n . (Hint:
                                                                                      k
                                        Consider the concatenated random normal vector  X  and use
                                                                                     Y
                                        fact (e) proved in the text.)
                                    (c) The solution of the M step is
                                                        t
                                                              t
                                           µ n+1  =(A A)   −1 A ν nr
                                                      1     −1        t  −1
                                           2
                                          σ       =     [tr(Γ  Υ nk )+ ν Γ  ν nk ], 1 ≤ k ≤ r − 1
                                           n+1,k            k         nk  k
                                                      m
                                                      1                       t
                                           2
                                          σ       =     [tr(Υ nr )+ (ν nr − Aµ n+1 ) (ν nr − Aµ n+1 )].
                                           n+1,r
                                                      m
                                        In the above update, µ n+1 is the next iterate of the mean vector
                                        µ and not a component of µ.
                                 6. Continuing Problem 5, show that σ 2  ≥ 0 holds for all k and n if
                                                                   nk
                                   σ 2  ≥ 0 holds initially for all k.If all σ 2  > 0, show that all σ 2  > 0.
                                     1k                              1k                  nk
                                 7. Continuing Problem 5, suppose that one or more of the covariance
                                   matrices Γ k is singular. For instance, in modeling common household
                                   effects, the corresponding missing data X k  can be represented as
                                     k
                                                k
                                   X = σ k M k W , where M k is a constant m×s matrix having exactly
                                   one entry 1 and the remaining entries 0 in each row, and where W k
                                   has s independent, standard normal components. Each component of
                                      k
                                   W corresponds to a different household; each row of M k chooses the
                                   correct household for a given person. It follows from this description
                                   that
                                                         2              k
                                                        σ Γ k  = Var(X )
                                                         k
                                                                   2
                                                                         t
                                                              = σ M k M .
                                                                   k
                                                                         k
   170   171   172   173   174   175   176   177   178   179   180