Page 402 - Numerical Methods for Chemical Engineering
P. 402

Selecting a prior for single-response data                          391



                  Although the posterior density above appears to be of a simple functional form, for a
                  general nonlinear model, S(θ) by (8.58) may depend in a very complicated manner upon θ.
                  Therefore, we approximate S(θ) in the local neighborhood of θ LS by a quadratic expansion
                  (for a linear model, this expansion is exact),
                                                            2
                                                         T
                                                1
                                 S(θ) − S(θ LS ) ≈ (θ − θ LS ) [∇ S(θ LS )](θ − θ LS )  (8.95)
                                                2
                    2
                  ∇ S(θ LS ) is the Hessian of S(θ), evaluated at θ LS . Following our numerical treatment of
                  nonlinear least squares, we define the matrix H(θ LS )as
                                                      2    1
                                            H(θ LS ) =∇   S(θ LS )                   (8.96)
                                                        2
                  with the elements
                                                               )            *
                                            N                      2
                                 T         	    [k]      [k]      ∂ f
                                      )
                     H ab (θ LS ) = (X X| θ LS ab −  y  − f x ; θ LS                 (8.97)

                                                                        [k]
                                           k=1                   ∂θ a ∂θ b x ;θ LS
                  The linearized design matrix is again given by (8.80). It is consistent with a quadratic
                  expansion of S(θ) to neglect the second contribution to H ab (θ LS ), so that
                                                T
                                              X X| θ LS  ≈ H(θ LS )                  (8.98)
                  Again, for a linear model, this approximation is exact. The quadratic expansion for S(θ),
                  using  1  2                T    , is then
                       2  ∇ S(θ LS ) = H(θ LS ) ≈ X X| θ LS
                                                        T   T
                                  S(θ) − S(θ LS ) ≈ (θ − θ LS ) [X X| θ LS ](θ − θ LS )  (8.99)
                  Thus, an approximate “conditional posterior” for θ given y and σ is
                                                                     1
                                         1           T  T
                        π(θ|y,σ) ∝ exp −     θ − θ LS [X X| θ LS ](θ − θ LS ) p(θ)  (8.100)
                                        2σ  2
                  The approximate “conditional likelihood function”
                                                                          1

                                                        T
                                              1             T
                         l approx (θ|y,σ) ∝ exp −  θ − θ LS  X X   θ − θ LS         (8.101)
                                             2σ 2              θ LS
                  is of the form of a multivariate normal distribution
                                                     1      T  −1

                                    p(x|µ, ) ∝ exp − (x − µ)   (x − µ)              (8.102)
                                                     2
                  with
                                                         2  T     −1
                                        µ = θ LS     = σ (X X| θ LS )               (8.103)
                  For a linear model, the design matrix X is completely specified at the time that we choose the
                                                            T
                  predictor values in each experiment, and thus so is X X. From one trial set of experiments
                  to the next, the only quantity that varies significantly is θ LS , and this quantity merely sets the
                  center of the distribution, but does not affect its shape. Therefore, the likelihood function
                  (8.101) is said to be translated by the data,or data-translated. For a linear model, where
                  X does not vary, this data-translation property is exact. For a nonlinear model, this data-
                  translation property is only approximate.
   397   398   399   400   401   402   403   404   405   406   407