# HSP43881 January 1994 **Digital Filter** #### Features - Eight Filter Cells - 0 to 30MHz Sample Rate - · 8-Bit Coefficients and Signal Data - · 26-Bit Accumulator Per Stage - Filter Lengths Over 1000 Taps - Expandable Coefficient Size, Data Size and Filter Length - Decimation by 2, 3 or 4 #### **Applications** - . 1-D and 2-D FIR Filters - · Radar/Sonar - Adaptive Filters - Echo Cancellation - Complex Multiply-Add - Sample Rate Converters #### Ordering Information | PART NUMBER | TEMPERATURE<br>RANGE | PACKAGE | |---------------|----------------------|--------------| | HSP43881JC-20 | 0°C to +70°C | 84 Lead PLCC | | HSP43881JC-25 | 0°C to +70°C | 84 Lead PLCC | | HSP43881JC-30 | 0°C to +70°C | 84 Lead PLCC | | HSP43881GC-20 | 0°C to +70°C | 85 Lead PGA | | HSP43881GC-25 | 0°C to +70°C | 85 Lead PGA | | HSP43881GC-30 | 0°C to +70°C | 85 Lead PGA | #### Description The HSP43881 is a video speed Digital Filter (DF) designed to efficiently implement vector operations such as FIR digital filters. It is comprised of eight filter cells cascaded internally and a shift and add output stage, all in a single integrated circuit. Each filter cell contains a 8 x 8-bit multiplier, three decimation registers and a 26-bit accumulator. The output stage contains an additional 26-bit accumulator which can add the contents of any filter cell accumulator to the output stage accumulator shifted right by 8-bits. The HSP43881 has a maximum sample rate of 30MHz. The effective multiply accumulate (mac) rate is 240MHz. The HSP43881 DF can be configured to process expanded coefficient and word sizes. Multiple DFs can be cascaded for larger filter lengths without degrading the sample rate or a single DF can process larger filter lengths at less than 30MHz with multiple passes. The architecture permits processing filter lengths of over 1000 taps with the guarantee of no overflows. In practice, most filter coefficients are less than 1.0, making even larger filter lengths possible. The DF provides for 8-bit unsigned or two's complement arithmetic, independently selectable for coefficients and signal data. Each DF filter cell contains three resampling or decimation registers which permit output sample rate reduction at rates of $^{1}/_{2}$ , $^{1}/_{3}$ or $^{1}/_{4}$ the input sample rate. These registers also provide the capability to perform 2-D operations such as matrix multiplication and NxN spatial correlations/convolutions for image processing applications. # HSP43881 # Pin Description | SYMBOL | PIN<br>NUMBER | TYPE | NAME AND FUNCTION | |-----------------|---------------------------------------------|------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | VCC | A3, A10, B1,<br>D11, F10, J1,<br>K4, L7 | | +5V Power Supply Input | | V <sub>SS</sub> | A1, A11, E2, F1,<br>E11, H11, K3,<br>K6, L9 | | Power Supply Ground Input | | CLK | G3 | ı | The CLK input provides the DF system sample clock. The maximum clock frequency is is 30MHz. | | DINO-7 | A5-8, B6-7,<br>C6-7 | - | These eight inputs are the data sample input bus. Eight bit data samples are synchronously loaded through these pins to the X register of each filter cell simultaneously. The DIENB signal enables loading, which is synchronous on the rising edge of the clock signal. | | TCS | B5 | 1 | The TCS input determines the number system interpretation of the data input samples on pins DIN0-7 as follows: TCS = Low → Unsigned Arithmetic | | | | | TCS = High → Two's Complement Arithmetic | | | | | The TCS signal is synchronously loaded into the X register in the same way as the DINO-7 inputs. | | DIENB | C5 | I | A low on this enables the data sample input bus (DINO-7) to all the filter cells. A rising edge of the CLK signal occurring while DIENB is low will load the X register of every filter cell with the 8 bit value present on DINO-7. A high on this input forces all the bits of the data sample input bus to zero; a rising CLK edge when DIENB is high will load the X register of every filter cell with all zeros. This signal is latched inside the DF, delaying its effect by one clock internal to the DF. Therefore, it must be low during the clock cycle immediately preceding presentation of the desired data on the DINO-7 inputs. Detailed operation is shown in later timing diagrams. | | CINO-7 | B9-11, C10-11,<br>D10, E9-10 | 1 | These eight inputs are used to input the 8 bit coefficients. The coefficients are synchronously synchronously loaded into the C register of filter CELL 0 if a rising edge of CLK occurs while CIENB is low. The CIENB signal is delayed by one clock as discussed below. | | TCCI | A9 | ı | The TCCI input determines the number system interpretation of the coefficient inputs on pins CINO-7 as follows: | | | | | TCCI = LOW → Unsigned Arithmetic | | | | | TCCI = HIGH → Two's Complement Arithmetic | | | | | The TCCI signal is synchronously loaded into the C register in the same way as the CINO-7 inputs. | | CIENB | B8 | I | A low on this input enable the C register of every filter cell and the D registers (decimation) of every filter cell according to the state of the DCMO-1 inputs. A rising edge of the CLK signal occurring while CIENB is low will load the C register and appropriate D registers with the coefficient data present at their inputs. This privides the mechanism for shifting the coefficients from cell to cell through the device. A high on this input freezes the contents of the C register and the D registers, ignoring the CLK signal. This signal is latched and delayed by one clock internal to the DF. Therefore, it must be low during the clock cycle immediately preceding presentation of the desired coefficient on the CINO-7 inputs. Detailed operation is shown in the Timing Diagrams section. | | COUTO-7 | B2, C1-2, D1-2<br>E1, E3, F2 | 0 | These eight three-state outputs are used to output the 8 bit coefficients from filter cell 7. These outputs are enabled by the COENB signal low. These outputs may be tied to the CINO-7 inputs of the same DF to recirculate the coefficients, or they may be tied to the CINO-7 inputs of another DF to cascade DFs for longer filter lengths. | | TCCO | В3 | 0 | The TCCO three-state output determines the number system representation of the coefficients output on COUTO-7. It tracks the TCCI signal to this same DF, It should be tied to the TCCI input of the next DF in a cascade of DFs for increased filter lengths. This signal is enabled by COENB low. | | COENB | A2 | 1 | A low on the COENB input enables the COUTO-7 and the TCCO output. A high on this input places all these outputs in their high impedance state. | # Pin Description (Continued) | SYMBOL | PIN<br>NUMBER | TYPE | NAME FUNCTION | |-----------|--------------------------------------------------------|------|--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------| | DCM0-1 | G2, L1 | 1 | These two inputs determine the use of the internal decimation registers as follows: DCM1 | | | | | 1 0 Two decimation registers are used 1 1 Three decimation registers are used The coefficients pass from cell to cell at a rate determined by the number of decimation registers used. When no decimation registers are used, coefficients move from cell to cell on each clock. When one decimation register is used, coefficients move from cell to cell on every other clock, etc. These signals are latched and delayed by one clock internal to the DF. | | SUM0-25 | J2, J5-8,<br>J10, K2,<br>K5-11,<br>L2-6, L8,<br>L10-11 | 0 | These 26 three-state outputs are used to output the results of the internal filter cell computations. Individual filter cell results or the result of the shift-and-add output stage can be output. If an individual filter cell result is to be output, the ADRO-2 signals select the filter cell result. The SHADD signal determines whether the selected filter cell result or the output stage adder result is output. The signals SENBH and SENBL enable the most significant and least significant bits of the SUMO-25 result, respectively. Both SENBH and SENBL may be enabled simultaneously if the system has a 26 bit or larger bus. However, individual enables are provided to facilitate use with a 16 bit bus. | | SENBH | K1 | i | A low on this input enables result bits SUM16-25. A high on this input places these bits in their high impedance state. | | SENBL | E11 | I | A low on this input enables result bits SUM0-15. A high on this input places these bits in their high impedance state. | | ADR0-2 | G1, H1-2 | 1 | These inputs select the one cell whose accumulator will be read through the output bus (SUMO-25) or added to the output stage accumulator. They also determine which accumulator will be cleared when ERASE is low. For selection of which accumulator to read through the output bus (SUMO-25) or which to add of the output stage accumulator, these inputs are latched in the DF and delayed by one clock internal to the device. If the ADRO-2 lines remain at the same address for more than one clock, the output at SUMO-25 will not change to reflect any subsequent accumulator updates in the addressed cell. Only the result available during the first clock, when ADRO-1 selects the cell, will be output. This does not hinder normal operation since the ADRO-1 lines are changed sequentially. This feature facilitates the interface with slow memories where the output is required to be fixed for more than one clock. | | SHADD | F3 | I | The SHADD input controls the activation of the shift-and-add operation in the output stage. This signal is latched in the DF and delayed by one clock internal to the device. A detailed explanation is given in the DF Output Stage section. | | RESET | A4 | 1 | A low on this input synchronously clears all the internal registers, except the cell accumulators. It can be used with ERASE to also clear all the accumulators simultaneously. This signal is latched in the DF and delayed by one clock internal to the DF. | | ERASE | B4 | ì | A low on this input synchronously clears the cell accumulator selected by the ADR0-1 signals. If RESET is also low simultaneously, all cell accumulators are cleared. | | ALIGN PIN | C3 | | Used for aligning chip in socket or printed circuit board. Must be left as a no connect in circuit. | #### Functional Description The Digital Filter Processor (DF) is composed of eight filter cells cascaded together and an output stage for combining or selecting filter cell outputs (See Block Diagram). Each filter cell contains a multiplier-accumulator and several registers (Figure 1). Each 8-bit coefficient is multiplied by a 8- bit data sample, with the result added to the 26-bit accumulator contents. The coefficient output of each cell is cascaded to the coefficient input of the next cell to its right. #### DF Filter Cell A 8-bit coefficient (CINO-7) enters each cell through the C register on the left and exits the cell on the right as signals COUTO-7. With no decimation, the coefficient moves directly from the C register to the output, and is valid on the clock following its entrance. When decimation is selected the coefficient exit is delayed by 1, 2 or 3 clocks by passing through one or more decimation registers (D1, D2 or D3). The combination of D registers through which the coefficient passes is determined by the state of DCM0 and DCM1. The output signals (COUTO-7) are connected to the CINO-7 inputs of the next cell to its right. The COENB input signal enables the COUTO-7 outputs of the right most cell to the COUTO-7 pins of the device. The C and D registers are enabled for loading by $\overline{\text{CIENB}}$ . Loading is synchronous with CLK when $\overline{\text{CIENB}}$ is low. Note that $\overline{\text{CIENB}}$ is latched internally. It enables the register for loading after the next CLK following the onset of $\overline{\text{CIENB}}$ low. Actual loading occurs on the second CLK following the onset of $\overline{\text{CIENB}}$ low. Therefore $\overline{\text{CIENB}}$ must be low during the clock cycle immediately preceding presentation of the coefficient on the CINO-7 inputs. In most basic FIR operations, $\overline{\text{CIENB}}$ will be low throughout the process, so this latching and delay sequence is only important during the initialization phase. When $\overline{\text{CIENB}}$ is high, the coefficients are frozen. These registers are cleared synchronously under control of RESET, which is latched and delayed exactly like CIENB. The output of the C register (CO-8) is one input to 8 x 8 multiplier. The other input to the 8x8 multiplier comes from the output of the X register. This register is loaded with a data sample from the device input signals DIN0-7 discussed above. The X register is enabled for loading by DIENB. Loading is synchronous with CLK when DIENB is low. Note that DIENB is latched internally. It enables the register for loading after the next CLK following the onset of DIENB low, Actual loading occurs on the second CLK following the onset of DIENB low; therefore, DIENB must be low during the clock cycle immediately preceding presentation of the data sample on the DIN0-7 inputs. In most basic FIR operations, DIENB will be low throughout the process, so this latching and delay sequence is only important during the initialization phase. When DIENB is high, the X register is loaded with all zeros. The multiplier is pipelined and is modeled as a multiplier core followed by two pipeline registers, MREGO and MREG1 (Figure 1). The multiplier output is sign extended and Input as one operand of the 26-bit adder. The other adder operand is the output of the 26-bit accumulator. The adder output is loaded synchronously into both the accumulator and the TREG. The TREG loading is disabled by the cell select signal, CELLn, where n is the cell number. The cell select is decoded from the ADRO-2 signals to generate the TREG load enable. The cell select is inverted and applied as the load enable to the TREG. Operation is such that the TREG is loaded whenever the cell is not selected. Therefore, TREG is loaded every clock except the clock following cell selection. The purpose of the TREG is to hold the result of a sum-of-products calculation during the clock when the accumulator is cleared to prepare for the next sum-of-products calculation. This allows continuous accumulation without wasting clocks. The accumulator is loaded with the adder output every clock unless it is cleared. It is cleared synchronously in two ways. When RESET and ERASE are both low, the accumulator is cleared along with all other registers on the device. Since ERASE and RESET are latched and delayed one clock internally, clearing occurs on the second CLK following the onset of both ERASE and RESET low. The second accumulator clearing mechanism clears a single accumulator in a selected cell. The cell select signal, CELLn, decoded from ADR0-2 and the ERASE signal enable clearing of the accumulator on the next CLK. The ERASE and RESET signals clear the DF internal registers and states as follows: | ERASE | RESET | CLEARING EFFECT | |-------|-------|--------------------------------------------------------------------------------------------------------------| | 1 | 1 | No clearing occurs, internal state remains same | | 1 | 0 | RESET only active, all registers except accumulators are cleared, including the internal pipeline registers. | | 0 | 1 | ERASE only active, the accumulator whose address is given by the ADR0-2 inputs is cleared. | | 0 | 0 | Both RESET and ERASE active, all accumulators as well as all other registers are cleared. | ### The DF Output Stage The output stage consists of a 26-bit adder, 26-bit register, feedback multiplexer from the register to the adder, an output multiplexer and a 26-bit three-state driver stage (Figure 2). The 26-bit output adder can add any filter cell accumulator result to the 18 most significant bits of the output buffer. This result is stored back in the output buffer. This operation takes place in one clock period. The eight LSBs of the output buffer are lost. The filter cell accumulator is selected by the ADRO-2 inputs. The 18 MSBs of the output buffer actually pass through the zero mux on their way to the output adder input. The zero mux is controlled by the SHADD input signal and selects either the output buffer 18 MSBs or all zeros for the adder input. A low on the SHADD input selects zero. A high on the SHADD input selects the output buffer MSBs, thus activating the shift-and-add operation. The SHADD signal is latched and delayed by one clock internally. FIGURE 2. HSP43881 DF OUTPUT STAGE The 26 least significant bits (LSBs) from either a cell accumulator or the output buffer are output on the SUMO-25 bus. The output mux determines whether the cell accumulator selected by ADRO-2 or the output buffer is output to the bus. This mux is controlled by the SHADD input signal. Control is based on the state of the SHADD during two successive clocks; in other words, the output mux selection contains memory. If SHADD is low during a clock cycle and was low during the previous clock, the output mux selects the contents of the filter cell accumulator addressed by ADRO-2. Otherwise the output mux selects the contents of the output mux selects the contents of the output mux selects the contents of the output mux selects the contents of the output mux selects the contents of the output buffer. If the ADRO-2 lines remain at the same address for more than one clock, the output at SUMO-25 will not change to reflect any subsequent accumulator updates in the addressed cell. Only the result available during the first clock when ADRO-2 selects the cell will be output. This does not hinder normal FIR operation since the ADRO-2 lines are changed sequentially. This feature facilitates the interface with slow memories where the output is required to be fixed for more than one clock. The SUM0-25 output bus is controlled by the SENBH and SENBL signals. A low on SENBL enables bits SUM0-15. A low on SENBH enables bits SUM16-25. Thus all 26 bits can be output simultaneously if the external system has a 26-bit or larger bus. If the external system bus is only 16 bits, the bits can be enabled in two groups of 16 and 10 bits (sign extended). #### **DF Arithmetic** Both data samples and coefficients can be represented as either unsigned or two's complement numbers. The TCS and TCCI inputs determine the type of arithmetic representation. Internally all values are represented by a 9-bit two's complement number. The value of the additional ninth bit depends on the arithmetic representation selected. For two's complement arithmetic, the sign is extended into the ninth bit. For unsigned arithmetic, bit 9 is 0. The multiplier output is 18 bits and the accumulator is 26 bits. The accumulator width determines the maximum possible number of terms in the sum of products without overflow. The maximum number of terms depends also on the number system and the distribution of the coefficient and data values. Then maximum numbers of terms in the sum products are: | NUMBER SYSTEM | MAX #<br>OF TERMS | |-------------------------------------------------------------------------------------------------------------------|----------------------| | Two unsigned vectors | 1032 | | Two two's complement: Two positive vectors Negative vectors One positive and one negative vector | 2080<br>2047<br>2064 | | One unsigned and one two's complement vector: Positive two's complement vector Negative two's complement vector | 1036<br>1028 | For practical FIR filters, the coefficients are never all near maximum value, so even larger vectors are possible in practice. #### Basic FIR Operation A simple, 30MHz 8-tap filter example serves to illustrate more clearly the operation of the DF. The sequence table (Table 1) shows the results of the multiply accumulate in each cell after each clock. The coefficient sequence, Cn, enters the DF on the left and moves from left to right through the cells. The data sample sequence, Xn, enters the DF from the top, with each cell receiving the same sample simultaneously. Each cell accumulates the sum of products for one output point. Eight sums of products are calculated simultaneously, but staggered in time so that a new output is available every system clock. | CLK | CELL 0 | CELL 1 | CELL 2 | CELL 3 | CELL 4 | CELL 5 | CELL 6 | CELL 7 | SUM/CLR | |-----|-----------------------------------|-----------------------------------|----------------------------------|-----------|-----------------------|-----------|-----------|-----------------------------------|--------------| | 0 | C7 x X0 | 0 | 0 | 0 | | | | | _ | | 1 1 | +C6 x X1 | C7 x X1 | 0 | 0 | | | | | - | | 2 | +C5 x X2 | +C6 x X2 | C7 x X2 | 0 | | ļ | | | - | | 3 | +C4 x X3 | +C5 x X3 | +C6×X3 | C7 x X3 | | į | | | - | | 4 | +C3 x X4 | +C4 x X4 | +C5 x X4 | +C6 x X4 | C7 x X4 | ] | | | - | | 5 | +C2 x X5 | C3×X5 | +C4 x X5 | +C5 x X5 | +C6 x X5 | C7×X5 | | 1 | - | | 6 | +C1 x X6 | +C2 x X6 | +C3 x X6 | +C4 x X6 | +C5 × X6 | +C6 x X6 | C7×X6 | 1 | - | | 7 | +C <sub>0</sub> x X <sub>7</sub> | +C <sub>1</sub> x X <sub>7</sub> | +C2 x X7 | +C3 x X7 | +C4 x X7 | +C5 x X7 | +C6 x X7 | C7 x X7 | Cell 0 (Y7) | | 8 | C7 x X8 | +C <sub>0</sub> x X <sub>8</sub> | +C1 x X8 | +C2 x X8 | +C3 x X8 | +C4×X8 | +C5 x X8 | +C6 x X8 | Cell 1 (Y8) | | 9 | +C6 x X9 | C <sub>7</sub> x X <sub>9</sub> | +C <sub>0</sub> x X <sub>9</sub> | +C1 x X9 | +C2 x X9 | +C3×X9 | +C4 x X9 | +C5 x X9 | Cell 2 (Y9) | | 10 | +C5 × X10 | +C6 x X10 | C7 x X10 | +C0×X10 | +C1 x X10 | +C2 × X10 | +C3 × X10 | +C4 x X10 | Cell 3 (Y10) | | 11 | +C4 x X11 | +C5 x X11 | +C6×X11 | C7 x X11 | +C0 × X11 | +C1 x X11 | +C2 x X11 | +C3 × X11 | Cell 4 (Y11) | | 12 | +C3 x X12 | +C4 x X12 | +C5 x X12 | +C6 x X12 | C7 x X12 | +C0 × X12 | +C1 x X12 | +C2 × X12 | Cell 5 (Y12) | | 13 | +C2 x X13 | +C3 × X13 | +C4 × X13 | +C5 × X13 | +C6 × X13 | C7 x X13 | +C0 × X13 | +C1 × X13 | Cell 6 (Y13) | | 14 | +C1 x X14 | +C2 x X14 | +C3 x X14 | +C4 × X14 | +C5 x X14 | +C6 × X14 | +C7 x X14 | +C <sub>0</sub> × X <sub>14</sub> | Cell 7 (Y14) | | 15 | +C <sub>0</sub> × X <sub>15</sub> | +C <sub>1</sub> x X <sub>15</sub> | +C2 x X15 | +C3 x X15 | +C4 x X <sub>15</sub> | +C5 x X15 | +C6 x X15 | C7 x X15 | Cell 0 (Y15) | Detailed operation of the DF to perform a basic 8-tap, 8-bit coefficient, 8-bit data, 30MHz FIR filter is best understood by observing the schematic (Figure 3) and timing diagram (Figure 4). The internal pipeline length of the DF is four (4) clock cycles, corresponding to the register levels CREG (or XREG), MREGO, MREG1, and TREG (Figures 1 and 2). Therefore the delay from presentation of data and coefficients at the DINO-7 and CINO-7 inputs to a sum appearing at the SUMO-25 output is: k + Tc where k = filter length Td = 4, the internal pipeline delay of DF After the pipeline has filled, a new output sample is available every clock. The delay to last sample output from last sample input is Td. The output sums, Yn, shown in the timing diagram are derived from the sum-of-products equation: $Y(n) = C(0) \times X(n) + C(1) \times X(n-1) + C(2) \times X(n-2) + C(3) \times X(n-3) + C(4) \times X(n-4) + C(5) \times X(n-5) + C(6) \times X(n-6) + C(7) \times X(n-7)$ #### Extended FIR Filter Length Filter lengths greater that eight taps can be created by either cascading together multiple DF devices or "reusing" a single device. Using multiple devices, an FIR filter of over 1000 taps can be constructed to operate at a 30MHz sample rate. Using a single device clocked at 30MHz, an FIR filter of over 1000 taps can be constructed to operate at less than a 30MHz sample rate. Combinations of these two techniques are also possible. FIGURE 4. HSP43881 30MHz, 8 TAP FIR FILTER TIMING #### Cascade Configuration To design a filter length L>8, L/8 DFs are cascaded by connecting the COUTO-7 outputs of the (i)th DF to the CINO-7 inputs of the (i+1)th DF. The DINO-7 inputs and SUMO-25 outputs of all the DFs are also tied together. A specific example of two cascaded DFs illustrates the technique (Figure 5). Timing (Figure 6) is similar to the simple 8-tap FIR, except the ERASE and SENBL/SENBH signals must be enabled independently for the two DFs in order to clear the correct accumulators and enable the SUM0-25 output signals at the proper times. # Single DF Configuration Using a single DF, a filter of length L>8 can be constructed by processing in L/8 passes as illustrated in the following table (Table 2) for a 16-tap FIR. Each pass is composed of TABLE 2. HSP43881 16-TAP FIR FILTER SEQUENCE USING A SINGLE DF | CLK | CELL 0 | CELL 1 | CELL 2 | CELL 3 | CELL 4 | CELL 5 | CELL 6 | CELL 7 | SUM/CLR | |----------|------------------------------------------------------------------------|----------------------------------|-----------------------------------|------------------------------------------------------------------------|-----------------------------------|-----------------------------------|-----------------------------------|------------------------------------|------------------------------| | 6 | C <sub>15</sub> × X <sub>0</sub> | 0 | 0 | 0 | | | | | _ | | 7 | +C14 × X1 | C <sub>15</sub> x X <sub>1</sub> | 0 | 0 | | | | 1 1 | _ | | 8 | +C <sub>13</sub> x X <sub>2</sub> | | C <sub>15</sub> x X <sub>2</sub> | 0 | 1 | | | | _ | | 9 | +C <sub>12</sub> x X <sub>3</sub> | | l ı | C <sub>15</sub> × X <sub>3</sub> | C | | | 1 | _ | | 111 | +C <sub>11</sub> × X <sub>4</sub><br>+C <sub>10</sub> × X <sub>5</sub> | | 1 | +C <sub>14</sub> × X <sub>4</sub><br>+C <sub>13</sub> × X <sub>5</sub> | C <sub>15</sub> x X <sub>4</sub> | C <sub>15</sub> x X <sub>5</sub> | | | _ | | 12 | +C9 x X6 | | 1 | +C <sub>12</sub> x X <sub>6</sub> | | 015 7 75 | C <sub>15</sub> x X <sub>6</sub> | | _ | | 13 | +C8 × X7 | | l l | +C <sub>11</sub> x X <sub>7</sub> | | | V15^^6 | C <sub>15</sub> × X <sub>7</sub> | | | 14 | +C7 × X8 | | | +C <sub>10</sub> x X <sub>8</sub> | | | | +C <sub>14</sub> × X <sub>8</sub> | _ | | 15 | +C6 x X9 | 1 | 1 1 | +Cg x Xg | l i | 1 1 | l l | | _ | | 16 | +C5 × X10 | | l i | | | 1 1 | i į | +C <sub>13</sub> × X <sub>9</sub> | <del>-</del> | | 17 | +C4 × X11 | | | +C8 x X10 | | | l i | +C <sub>12</sub> × X <sub>10</sub> | _ | | 18 | +C3 × X12 | | | +C7 x X11 | | | | +C <sub>11</sub> x X <sub>11</sub> | _ | | 19 | +C2 × X13 | | | +C6 x X12 | | | | +C <sub>10</sub> × X <sub>12</sub> | <del>-</del> | | 20 | +C1 x X14 | | | +C5 x X13 | | | | +C9 × X13 | _ | | 21 | +C <sub>0</sub> × X <sub>15</sub> | 1 | | +C4 x X14 | | | | +C8 × X14 | -<br>OFI : 0 ()(4.5) | | 22 | 0 | C <sub>0</sub> × X <sub>16</sub> | ↓ ↓ | +C3 x X <sub>15</sub><br>+C <sub>2</sub> x X <sub>16</sub> | | | | +C7 × X15<br>+C6 × X16 | CELL 0 (Y15)<br>CELL 1 (Y16) | | 23 | Ö | 0 | C <sub>0</sub> × X <sub>17</sub> | +C1 x X17 | | | l I | +C5 X X17 | CELL 2 (Y17) | | 24 | 0 | Ö | 0 '' | +C0 x X18 | ↓ | | | +C4 x X18 | CELL 3 (Y18) | | 25 | 0 | 0 | 0 | 0 | C <sub>0</sub> x X <sub>19</sub> | ₩ | | +C3 x X19 | CELL 4 (Y19) | | 26 | 0 | 0 | 0 | 0 | 0 | C <sub>0</sub> x X <sub>20</sub> | . ♥. | +C <sub>2</sub> x X <sub>20</sub> | CELL 5 (Y20) | | 27<br>28 | 0 | 0 | 0 | 0 | 0 | 0 | C <sub>0</sub> x X <sub>21</sub> | +C1 x X21 | CELL 6 (Y21) | | 29 | C <sub>15</sub> x X <sub>8</sub> | 0 | 0 | 0 | 0 | 0 | 0 | +C <sub>0</sub> × X <sub>22</sub> | CELL 7 (Y22) | | 30 | +C <sub>14</sub> x X <sub>9</sub> | _ | _ | _ | 0 | 0 | 0 | 0 | _ | | 31 | +C <sub>13</sub> × X <sub>10</sub> | C <sub>15</sub> x X <sub>9</sub> | 0,, | 0 | 0 | 0 | 0 | 0 | - | | 32 | +C <sub>12</sub> x X <sub>11</sub> | | C <sub>15</sub> × X <sub>10</sub> | 1 - | 0 | 0 | 0 | 0 | - | | 33 | +C <sub>11</sub> × X <sub>12</sub> | | ! i | C <sub>15</sub> × X <sub>11</sub> | 0 | 0 | 0 | 0 | - | | 34 | +C <sub>10</sub> × X <sub>13</sub> | | 1 | | C <sub>15</sub> × X <sub>12</sub> | 0 | 0 | 0 | - | | 35 | +C9 × X14 | | 1 1 | 1 | | C <sub>15</sub> × X <sub>13</sub> | 0 | 0 | - | | 36 | +C8 × X15 | | 1 1 | | | | C <sub>15</sub> × X <sub>14</sub> | 0 | - | | 37 | +C7 × X16 | | | | | | | C <sub>15</sub> x X <sub>15</sub> | <del>-</del> | | 38 | +C7 × × 16<br>+C6 × × 17 | | 1 1 | | | | | +C <sub>14</sub> × X <sub>16</sub> | - | | 39 | | | | | | | | +C <sub>13</sub> × X <sub>17</sub> | - | | 40 | +C5 x X18 | | | | | | | +C <sub>12</sub> × X <sub>18</sub> | - | | 41 | +C4 x X19 | | 1 | | | | | +C <sub>11</sub> × X <sub>19</sub> | - | | 42 | +C3 × X20 | | | | | | | +C <sub>10</sub> x X <sub>20</sub> | _ | | 42 | +C2 x X21 | | | | | | | +C9 x X21 | ~ | | 43 | +C <sub>1</sub> x X <sub>22</sub> | | | | | | | +C8 x X22 | - | | | +C <sub>0</sub> ×X <sub>23</sub> | <b>♥</b> | | | | | | +C7 x X23 | CELL 0 (Y23) | | 45<br>46 | 0 | C <sub>0</sub> x X <sub>24</sub> | <b>*</b> | | | | | +C6 x X24 | CELL 1 (Y24) | | 46 | 0 | 0 | C <sub>0</sub> × X <sub>25</sub> | ♦ | | | | +C5 x X25 | CELL 2 (Y25) | | 47 | 0 | 0 | 0 | C <sub>0</sub> × X <sub>26</sub> | <b>♦</b> | | | +C4 x X26 | CELL 3 (Y26) | | 48 | 0 | 0 | _ | 0 | C <sub>0</sub> x X <sub>27</sub> | | <b>*</b> | +C3 x X27 | CELL 4 (Y27) | Tp = 7 + L cycles and computes eight output samples. In pass i, the sample with indices i\*8 to i\*8 +(L-1) enter the DINO-7 inputs. The coefficients $C_0$ - $C_L$ - 1 enter the CINO-7 inputs, followed by seven zeros. As these zeros are entered, the result samples are output and the accumulators reset. Initial filing of the pipeline is not shown in this sequence table. Filter outputs can be put through a FIFO to even out the sample rate. ## Extended Coefficient and Data Sample Word Size The sample and coefficient word size can be extended by utilizing several DFs in parallel to get the maximum sample rate or a single DF with resulting lower sample rates. The technique is to compute partial products of 8x8 and combine these partial products by shifting and adding to obtain the final result. The shifting and adding can be accomplished with external adders (at full speed) or with the DF's shift-and-add mechanism contained in its output stage (at reduced speed). #### Decimation/Resampling The HSP43881 DF provides a mechanism for decimating by factors of 2, 3, or 4. From the DF filter cell block diagram (Figure 1), note the three D registers and two multiplexers in the coefficient path through the cell. These allow the coefficients to be delayed by 1, 2, or 3 clocks through the cell. The sequence table (Table 3) for a decimate-by-two-filter illustrates the technique (internal cell pipelining ignored for simplicity). Detailed timing for a 30MHz input sample rate, 15MHz output sample rate (i.e., decimate-by-two), 16-tap FIR filter, including pipelining, is shown in Figure 7. This filter requires only a single HSP43881 DF. | | CLK | CELL 0 | CELL 1 | CELL 2 | CELL 3 | CELL 4 | CELL 5 | CELL 6 | CELL 7 | SUM/CLR | |-----|----------|------------------------------------------------------------------------|----------------------------------|----------------------------------|----------------------------------|----------------------------------|------------------------------------|-----------------------------------|------------------------------------|--------------| | | 6 | C <sub>15</sub> x X <sub>0</sub> | 0 | 0 | 0 | 0 | 0 | 0 | 0 | | | - 1 | 7 | +C <sub>14</sub> × X <sub>1</sub> | 0 | 0 | 0 | 0 | 0 | 0 | 0 | - | | - 1 | 8 | +C <sub>13</sub> x X <sub>2</sub> | C <sub>15</sub> x X <sub>2</sub> | 0 | 0 | 0 | 0 | 0 | 0 | - | | - 1 | 9 | +C <sub>12</sub> x X <sub>3</sub> | | 0 | 0 | 0 | 0 | 0 | 0 1 | - | | - 1 | 10 | +C <sub>11</sub> x X <sub>4</sub> | | C <sub>15</sub> x X <sub>4</sub> | 0 | 0 | 0 | 0 | 0 | - | | | 11<br>12 | +C <sub>10</sub> x X <sub>5</sub> | | | 1 - | 0 | 0 | 0 | 0 | - | | | 13 | +C9 x X6 | | | C <sub>15</sub> × X <sub>6</sub> | 0 | 0 | 0 | 0 | - | | | | +C8 x X7 | | | | 0 | 0 | 0 | 0 | - | | | 14 | +C7 x X8 | | | | C <sub>15</sub> × X <sub>8</sub> | 0 | 0 | 0 | - | | | 15 | +C6 x X9 | | | | | 0 | 0 | 0 | - | | | 16 | +C5 x X <sub>10</sub> | | | | | C <sub>15</sub> x X <sub>10</sub> | 0 | 0 | - | | | 17 | +C4 x X11 | | | ŀ | | 1 | 0 | 0 | - ] | | | 18 | +C3 x X12 | | | | | | C <sub>15</sub> x X <sub>12</sub> | 0 | - 1 | | | 19 | +C2 x X13 | | | | | | l "1 | o | - | | | 20 | +C1 x X14 | | | | | 1 1 | | C <sub>15</sub> × X <sub>14</sub> | - | | | 21 | +C0 × X15 | | | | | 1 | | +C <sub>14</sub> x X <sub>15</sub> | CELL 0 (Y15) | | | 22 | C <sub>15</sub> x X <sub>16</sub> | | | | | | | +C <sub>13</sub> × X <sub>16</sub> | - | | | 23 | +C14 x X17 | | | | | | | +C <sub>12</sub> x X <sub>17</sub> | CELL 1 (Y17) | | | 24 | +C <sub>13</sub> x X <sub>18</sub> | | | | | | | +C <sub>11</sub> x X <sub>18</sub> | - | | | 25 | +C <sub>12</sub> x X <sub>19</sub> | | | | | | | +C <sub>10</sub> x X <sub>19</sub> | CELL 2 (Y19) | | | 26 | +C <sub>11</sub> × X <sub>20</sub> | | | | | | | +C <sub>9</sub> x X <sub>20</sub> | <del>-</del> | | | 27 | +C <sub>10</sub> x X <sub>21</sub> | | | | | 1 | | +C8 x X21 | CELL 3 (Y21) | | | 28<br>29 | +C <sub>9</sub> x X <sub>22</sub><br>+C <sub>8</sub> x X <sub>23</sub> | | | | | | | +C <sub>7</sub> x X <sub>22</sub> | - | | | 30 | | | | | | | | +C6 x X23 | CELL 4 (Y23) | | | | +C7 x X24 | | | | | | | +C5 x X24 | - | | | 31 | +C <sub>6</sub> x X <sub>25</sub> | | | l i | | 1 1 | | +C4 x X25 | CELL 5 (Y25) | | | 32 | +C <sub>5</sub> x X <sub>26</sub> | | | | | | | +C3 x X26 | - ] | | | 33 | +C <sub>4</sub> x X <sub>27</sub> | | | | | | | +C2 x X27 | CELL 6 (Y27) | | | 34 | +C3 x X28 | | | | | i l | | +C <sub>1</sub> x X <sub>28</sub> | - 1 | | | 35 | +C2 x X29 | | 1 1 | | 1 | | | +C0 x X29 | CELL 7 (Y29) | | | 36 | +C <sub>1</sub> x X <sub>30</sub> | ↓ | | | I ↓ | I ↓ | ↓ | C <sub>15</sub> x X <sub>30</sub> | - | | | 37 | +C <sub>0</sub> x X <sub>31</sub> | +C14 x X31 | +C14 x X31 | +C14 X X31 | +C14 x X21 | +C <sub>14</sub> x X <sub>31</sub> | +C14 X X21 | +C <sub>14</sub> x X <sub>31</sub> | CELL 8 (Y31) | | | 1 | 1 1 | l 'i " | l 'i " | ''i '' | l 'i '' | l 'i " | l 'i '' | 'i " | i` | | | 1 | | 1 | | | | | | | | | ı | | ▼ | ▼ | <u> </u> | ▼ | | ▼ | ▼ | _ ▼ | ▼ | # Specifications HSP43881 # **Absolute Maximum Ratings** | upply Voltage | Vo.8+ | |-------------------------------------------|---------------------------| | put, Output Voltage | | | torage Temperature | 65°C to +150°C | | SD | | | laximum Package Power Dissipation at 70°C | 2.4W (PLCC), 2.88W (PGA) | | c · · · · · · · · · · · · · · · · · · · | | | a · · · · · · · · · · · · · · · · · · · | | | ate Count | | | unction Temperature | 150°C (PLCC), 175°C (PGA) | | ead Temperature (Soldering 10s) | | # **Operating Conditions** | Operating Voltage Range | | |-----------------------------|---| | Operating Temperature Range | 1 | #### **D.C. Electrical Specifications** | SYMBOL | PARAMETER | MIN | MAX | UNITS | TEST CONDITIONS | | |-----------------|------------------------------|-----|----------|----------|----------------------------------------------------------------|--| | ICCOP | Power Supply Current | | 140 | mA | V <sub>CC</sub> = Max<br>CLK Frequency 20MHz<br>Note 1, Note 3 | | | ICCSB | Standby Power Supply Current | - | 500 | μА | V <sub>CC</sub> = Max, Note 3 | | | Ι <sub>1</sub> | Input Leakage Current | -10 | 10 | μА | V <sub>CC</sub> = Max, Input = 0V or V <sub>CC</sub> | | | Ю | Output Leakage Current | -10 | 10 | μА | V <sub>CC</sub> = Max, Input = 0V or V <sub>CC</sub> | | | V <sub>IH</sub> | Logical One Input Voltage | 2.0 | - | V | V <sub>CC</sub> = Max | | | V <sub>IL</sub> | Logical Zero Input Voltage | | 0.8 | V | V <sub>CC</sub> = Min | | | VOH | Logical One Output Voltage | 2.6 | _ | ٧ | I <sub>OH</sub> = -400μA, V <sub>CC</sub> = Min | | | v <sub>OL</sub> | Logical Zero Output Voltage | - | 0.4 | ٧ | I <sub>OL</sub> = 2mA, V <sub>CC</sub> = Min | | | VIHC | Clock Input High | 3.0 | _ | ٧ | V <sub>CC</sub> = Max | | | VILC | Clock Input Low | - | 0.8 | ٧ | V <sub>CC</sub> = Min | | | C <sub>IN</sub> | Input Capacitance PLCC PGA | - | 10<br>15 | pF<br>pF | CLK Frequency 1MHz<br>All measurements referenced | | | COUT | Output Capacitance PLCC PGA | - | 10<br>15 | pF<br>pF | to GND<br>T <sub>A</sub> = +25°C, Note 2 | | #### NOTES: - Operating supply current is proportional to frequency. Typical rating is 7mA/MHz. - Controlled via design or process parameters and not directly tested. Characterized upon initial design and after major process and/or design changes. - 3. Output load per test load circuit and $C_L$ = 40pF. | A.C. Electrical Specifications | $V_{CC} = 5V + 5\%$ . $T_A = 0^{\circ}C$ to $+70^{\circ}C$ | |--------------------------------|------------------------------------------------------------| | | | | | | -20 (20MHz) | | -25 (25.6MHz) | | -30 (30MHz) | | | TEST | |-----------------|------------------------------------|-------------|-----|---------------|-----|-------------|-----|-------|------------| | SYMBOL | PARAMETER | MIN | MAX | MIN | MAX | MIN | MAX | UNITS | CONDITIONS | | T <sub>CP</sub> | Clock Period | 50 | - | 39 | | 33 | - | ns | | | T <sub>CL</sub> | Clock Low | 20 | - | 16 | - | 13 | - | ns | | | TCH | Clock High | 20 | - | 16 | 1 | 13 | - | ns | | | TIS | Input Setup | 16 | - | 14 | - | 13 | - | ns | | | TIH | Input Hold | 0 | - | 0 | - | 0 | - | ns | | | TODC | CLK to Coefficient<br>Output Delay | - | 24 | - | 20 | - | 18 | ns | | | TOED | Output Enable Delay | - | 20 | | 15 | - | 15 | ns | | | TODD | Output Disable Delay | - | 20 | - | 15 | - | 15 | ns | Note 1 | | - Tops | CLK to SUM<br>Output Delay | - | 27 | 1 | 25 | - | 21 | ns | | | TOR | Output Rise | - | 6 | - | 6 | <b>-</b> | 6 | ns | Note 1 | | TOF | Output Fall | - | 6 | - | 6 | _ | 6 | ns | Note 1 | # Test Load Circuit Switch S1 Open for ICCSB and ICCOP Tests NOTE: 1. Controlled by design or process parameters and not directly tested. Characterized upon initial design and after major process and/or design # Waveforms ~~-- 4.0V 2.0V 0.00 CLK - TIS 3.0V TCP 1.5V 1.5V TCH TCL. 0.00 \* Input includes: DINO-7, CINO-7, DIENB, CIENB, ERASE, RESET, DCM0-1, ADR0-2, TCS, TCCI, SHADD 2.0V 2.0V CLK **CLOCK AC PARAMETERS** INPUT SETUP AND HOLD 2.0V TODC TODS 2.0V SUM0 - 25 COUT0 - 7 1.5V 0.80 TCCO TOR \* SUM0-25, COUT0-7, TCCO are assumed not to be in SUM0-25, COUTO-7, TCCO OUTPUT DELAYS **OUTPUT RISE AND FALL TIMES** SENBL SENBH COENB 1.5V 1.5V DEVICE UNDER INPUT 1.5V 1.5V OUTPUT 0.0V TOED TODD SUMO - 25 COUTO - 7 TCCO A.C. Testing: Inputs are driven at 3.0V for Logic "1" and 0.0V for Logic "0". Input and output timing measurements are made at 1.5V for both a Logic "1" HIGH HIGH IMPEDANCE IMPEDANCE and "0". CLK is driven at 4.0V and 0V and measured at 2.0V. **OUTPUT ENABLE, DISABLE TIMING** A.C. TESTING INPUT, OUTPUT WAVEFORM