Page 341 - DSP Integrated Circuits
P. 341

326                                             Chapter 7 DSP System Design

































                   Figure 7.55 ASAP schedule of the inner loop




            Obviously, this memory access schedule is
        unacceptable since there are four concurrent read
        or write operations. This schedule therefore
        requires four memories. Fortunately, the memory
        accesses are not evenly distributed within the
        folded schedule. We can therefore reschedule the
        processes so that the memory accesses are distrib-
        uted more evenly in time and still meet the memory
        constraint. This can be done by introducing extra
        slack time (shimming delay), as shown in Figure
        7.57. The corresponding folded schedule is shown in
        Figure 7.58.
            The increased latency will increase the storage
        requirement. The Address must be stored for either
        15 memory cycles or only four cycles if the address
        is recomputed. However, this requires extra hard-
        ware. The WP processes (WP) must be stored for two
        memory cycles or be scheduled to be computed two
        cycles later.
            The inputs and outputs of the butterfly opera-
        tions must also be stored as shown in Figure 7.59.
        Thus, only four registers, one for each complex data
        word to and from a butterfly, are required in the
        corresponding architecture.                     Figure 7.56 Folded schedule
   336   337   338   339   340   341   342   343   344   345   346