Page 380 - DSP Integrated Circuits
P. 380

8.4 Ideal DSP Architectures                                          365


        faces are provided. The on-chip memory consists of two independent memories,
        each containing a 512 x 32-bit data RAM, a 512 x 32-bit data ROM, and a 512 x 32-
        bit program RAM. The two ROMs are used for two-quadrant sine and cosine wave
        tables. The DSP96001 is available in a 163-pin pin-grid array package.
            The DSP96002 is a dual-port version of the DSP96001. The two expansion
        ports effectively double the off-chip bus bandwidth [18]. The size of the on-chip
        program RAM has been increased from 512 to 1024 words. The DSP96002 can per-
        form up to 10 operations simultaneously. The peak performance is 200 MOPS and
        50 MFLOP @ a 40-MHz clock frequency. Typical power consumption is less than 1
        W. The DSP96002 is available in a 223-pin pin-grid array package. Typical perfor-
        mance of the DSP96001 is as follows: An N-tap FIR filter requires N+7 clock cycles
        while N second-order sections in direct form II require 5N+5 clock cycles. A 91-tap
        FIR filter can be executed in 4.9 us and a sixth-order filter in 1.0 us. A 1024-point
        FFT, without bit-reversal, takes 1.04 ms, while the DSP96002 requires less than
        0.8 ms.


        8.4 IDEAL DSP ARCHITECTURES

        At the architectural level, the main interest is the overall organization of the sys-
        tem which is described in terms of the high-level primitives: processors, memories,
        communication channels, and control [13]. These primitives are characterized by
        parameters such as memory capacity, access time, word length, transmission
        rates, and processor execution time. At this level, the designer is also interested in
        the global data flow between the high-level primitives in the system. The objective
        is to estimate system performance and identify potential bottlenecks in data
        transmission within the system. At the end of the architectural design phase, tim-
        ing relations between the hardware components are of more interest. Also test pat-
        tern generation and fault simulation are a concern at this stage of the system
        design. Figure 8.9 illustrates the main architectural components.
            The fact that most
        DSP algorithms lack
        data-dependent branch-
        ing operations facili-
        tates perfect scheduling
        of the operations. The
        scheduling step is fol-
        lowed by the resource
        allocation and assign-
        ment steps, as dis-
        cussed in Chapter 7. For
        simplicity we assume            Figure 8.9 Architectural components
        that the schedule for the
        subsystem is fixed. In
        practice, the schedule may have to be iterated for different sets of basic operations,
        processors, and architectures. The design decisions taken in these steps determine
        the major parameters of the architecture. For example, the operation schedule
        determines the number of processors, their type, and performance. The variable
        assignment determines the required amount of memory and its organization. Both
   375   376   377   378   379   380   381   382   383   384   385