Page 235 - Applied Probability
P. 235

10. Molecular Phylogeny

                                                                                              is
                                                                   e
                                                                    θ 0 c i
                                                                         min{r,m−i} θ j 1 {c i =c i+j }
                                                                         j=1
                              so forth until a single array holding L c i
                              formed. Multiplication of this array against a i−1 (c i ,...,c min{i−1+r,m} ) can
                              be carried out simultaneously with addition over the index c i . When the
                                               is brought into play, its obvious symmetries can be ex-
                              array e
                                     θ j 1 {c i =c i+j }
                              ploited to reduce the overall computational burden. The computational
                              complexity of the forward algorithm scales linearly in m.  e  221
                                These technical details fail to specify how we distinguish between slow
                              and fast-evolving codon sites. One obvious choice is to again modulate
                              the rate of evolution through the acceptance probabilities. Slow evolution
                              can be distinguished from fast evolution by introducing a multiplicative
                              parameter η ∈ (0, 1) and replacing each acceptance probability ρ i by ηρ i .
                              For synonymous codon changes, it makes sense to retain the acceptance
                              probability of 1.
                                                               root

                                                             9


                                                         7

                                                                     8
                                                                           10
                                                    6
                                                             5

                                                1       2         3        4



                                              rabbit    rat   goat lemur  man  opossum
                                  FIGURE 10.8. Standard Tree for the Evolution of Six Mammalian Taxa


                              10.10 Illustration of the Codon and Rate Models

                              To illustrate the flexibility of the codon substitution and rate variation
                              models, we now apply them to six aligned β-hemoglobin genes from the
                              mammalian taxa: opossum, goat, rat, rabbit, monkey, and human [7]. The
                              globin gene superfamily is well understood, both structurally and evolution-
                              arily. The β-hemoglobin sequence used in this study stretches over 444 nu-
                              cleotides and represents 148 codons. In goats, the second and third codons
                              are missing; we set these to unobserved in the subsequent data analysis.
                              The β-hemoglobin secondary structure includes 11 α-helices encompassing
                              111 codons. These helices are conserved, but less so than the non-helical
   230   231   232   233   234   235   236   237   238   239   240