Page 159 - Macromolecular Crystallography
P. 159

148  MACROMOLECULAR CRYS TALLOGRAPHY

         (a)                                         probability distributions: (1) real space restraints
                                                     give distributions for modified structure factors,
                                                     while (2) the phasing experiments give partially
                                                     independent phase probability distributions. Com-
                                                     bining distributions is easy: we just multiply them,
                                                     provided we know they are independent. However,
                                                     here consecutive distributions are clearly not inde-
                                                     pendent and treating them as independent would
                                                     inevitably lead to an undesirable bias towards the
         (b)
                                                     very first map with which we started the Fourier
                                                     cycling. How do we deal with this situation? We
                                                     separate out the dependent component, and multiply
                                                     the independent components. In order to explain
                                                     how this is done in practice, we give a more
                                                     quantitative explanation of the reason why Fourier
                                                     cycling and phase recombination works, first for
                                                     solvent flattening and subsequently for NCS and
         (c)                                         histogram matching.

                                                     10.7 Why does Fourier cycling improve
                                                     phases in solvent flattening?
                                                     Before we can flatten the solvent, we need to know
                                                     where it is. One of the implementations to obtain a
                                                     good approximation of the solvent mask computes
                                                     the variance of the electron density within a small
        Figure 10.2 The procedure of convolution, represented graphically.
        (a) A one-dimensional centrosymmetric structure. (b) A Gaussian  sphere throughout the entire unit cell. Regions in
        distribution, which could potentially be an atomic shape function.  the unit cell where a low variance is found then are
        (c) The convolution of the function in (a) and (b).  considered to be solvent, whereas a high variance
                                                     indicates protein. Most density modification pro-
                                                     grams use a binary solvent mask, with one value
        weight or lower fall-off as the phase refinement  representing the protein region and the other value
        converges.                                   representing the solvent region. Some programs
          In practice, recombination of structure factors  have reported good results by extending this and
        involvesfirstweightingofthephasesofthemodified  using real valued numbers between 0 and 1, where
        structure factors in a resolution dependent fashion,  the value of the grid point indicates the probability
        according to their estimated accuracy or probability.  of being in a protein region (Terwilliger, 2003).
        Every phase also has an experimental probability  Now return to Eq. 3, which describes the process
        (determined by experimental phasing techniques  of solvent flattening. As a restraint, it can be written
        and/or molecular replacement). The two distribu-  down as follows:
        tions are combined by multiplication, and the new
        phase is calculated from this combined probability  ρ mod (x) = ρ init (x)g(x) + ρ solvent ˆ g(x)  (8)
        distribution. The measured associated structure fac-
                                                     where:
        tor amplitude is then scaled by the probability of the
        phase, and we have our set of recombined structure  g(x) is a mask function which is equal to one in the
        factors.                                      protein region and is zero in the solvent region.
          However, there is a problem with the phase rec-  ˆ g(x) is a mask function that is zero in the protein
        ombination approach. Essentially we’re combining  region and one in the solvent region.
   154   155   156   157   158   159   160   161   162   163   164