Page 73 - Hardware Implementation of Finite-Field Arithmetic
P. 73

56     Cha pte r  T w o


                    n      k      FFs   LUTs    Mult   Slices  Period
                    16     8      23    80      1      44      8.1
                    24     8      32    88      1      50      8.1
                    64     32     74    319     4      172     20.8
                    128    64     138   897     16     518     26.4
                    256    128    268   2,757   64     1,630   43.6
                    512    256                         ∞

                  Note: ∞ slices means that the circuit does not fit within the device.
                                                      k
                  TABLE 2.5  Cost and Period of Reducers mod 2 − a


                  In order to get an estimation of the computation time, 1,000 pairs
               (x, m) have been generated for several values of k, and the correspond-
               ing numbers of cycles have been observed by simulation. In this case
               the  Start/Done communication-protocol cycles have been included.
               Then, the minimum (MinCycles), maximum (MaxCycles), and aver-
               age (AverCycles) numbers of cycles have been obtained. By multiply-
               ing the average number of cycles by the period, an estimation of the
               average computation time (AverTime) can be computed (Table 2.6).



                k     Period  MinCycles  MaxCycles  AverCycles  AverTime
                8     8.1    6          15          10.33      83.7
                32    20.8   7          12          11.24      233.8
                64    26.4   7          11          8.92       235.5
                128   43.6   7          11          8.94       389.6

               TABLE 2.6  Average Delay of Reducers mod 2 − a
                                                 k


                  If m is a constant, then a = 2  − m is also a constant and it is no
                                           k
               longer necessary to use multipliers for computing  qa. Specific
               combinational circuits can be used. As an example, a 384-bit to 192-bit
                         64
               mod (2  − 2  − 1) reducer has been implemented. The implementation
                     192
               results are the following:

          FFs  LUTs    Slices  Period  MinCycles  MaxCycles  AverCycles  AverTime
          686  38,560  19,978  50.6  7       11        8.96      453.6
   68   69   70   71   72   73   74   75   76   77   78