Page 147 - Macromolecular Crystallography
P. 147

136  MACROMOLECULAR CRYS TALLOGRAPHY

        indicationthatasolution(s)exists. Confirmationthat  direct-methods-based substructure determination
        this is true for Trial 5 in this example can be obtained  process terminated once a solution has been found,
        by inspecting a trace of its R MIN value as a func-  and protein phasing initiated. Since the ranges of
        tion of refinement cycle (Fig. 9.4). Solutions usually  figure-of-merit values for solutions are not the same
        show an abrupt decrease in value over a few cycles  for all structures and data sets, a careful analysis is
        followed by stability at the lower value. Further  required to avoid false solutions and to minimize
        confirmation can be obtained by looking for corrob-  the number of missed solutions. In automatic mode,
        oration from the other figures of merit as illustrated  BnP looks for a trial having R MIN and R CRYST val-
        in Table 9.5 where low R MIN values are correlated  ues significantly less than their respective means. In
        with low R CRYST values and high CC values.  this context, ‘significant’ means differing by more
          For high-throughput operation, solutions need  than three standard deviations. The standard devi-
        to be recognized quickly and automatically, the  ations are computed based on the figure-of-merit
                                                     values for a minimum of five trials which have been
                                                     preliminarily classified as non-solutions because
                                                     their R CRYST values are greater than 0.32. As a
           20
                                                     result, BnP always processes at least six trials. As
           16                                        shown by the statistics in Table 9.6, Trial 5 in the
          No. of trials  12 8                        1JC4 example is a solution that can be recognized
                                                     automatically.


           4                                         9.5 Choosing correct sites
           0                                         For each job involving an N-site substructure, the
              0.17  0.19  0.21  0.23  0.25  0.27  0.29  0.31
                                                     SnB component of BnP provides an output file of
                             R MIN
                                                     1.5N peak positions for the best trial (based on its
        Figure 9.3 A histogram of the final R MIN values for 100 trial  final R MIN value) sorted in descending order accord-
        structures for 1JC4 was made by dividing the range of the values  ing to the corresponding electron density. It is then
        (0.168–0.316) into 15 intervals (buckets) and counting the number  necessary to decide which, and how many, of these
        of trials whose R MIN values fell into each bucket. The 29 trials in the
        two left buckets with the lowest R MIN values are solutions, and the  peaks correspond to actual atoms. The first N peaks
        trials in the buckets at the right are non-solutions.  have the highest probability of being correct, and in



               (a)                                  (b)
                  0.47                                 0.37
                             Trial 5 (a solution)                Trial 6 (a non-solution)
                                                       0.35
                  0.37
                 R MIN                               R MIN  0.33

                  0.27
                                                       0.31


                  0.17                                 0.29
                     0   10   20   30  40   50  60        0   10   20   30  40   50  60
                           Shake-and-bake cycle                 Shake-and-bake cycle
        Figure 9.4 Plots of the minimal-function value over 56 Shake-and-Bake cycles for (a) a solution and (b) a non-solution. Trial 5 illustrates the
        characteristic features of a solution – a small initial decrease in R MIN value followed by a plateau, a sharp decrease over a few cycles, and a
        second plateau at a lower value. The first plateau may be shortened significantly if convergence to a solution occurs quickly.
   142   143   144   145   146   147   148   149   150   151   152