Title:
CSA 5-3 compressor circuit and carry-save adder circuit using same
Document Type and Number:
Kind Code:
A1

Abstract:
At least two EOR circuits for carry-out which output carry-out bits and the complementary signals thereof are provided in the 5-3 compressor circuits constituted by an EOR circuit group, and dual lanes are employed at least for carry-out. As a result, the number of inverters required can be reduced, increases in delay time can be suppressed, and fast addition operation can be achieved.
Inventors:
Abe, Kazuhiro (Kawasaki, JP)
Application Number:
11/905324
Publication Date:
02/28/2008
Filing Date:
09/28/2007
View Patent Images:
Images are available in PDF form when logged in. To view PDFs, Login  or  Create Account (Free!)
Assignee:
FUJITSU LIMITED (Kawasaki, JP)
Primary Class:
International Classes:
H04L1/00
Attorney, Agent or Firm:
STAAS & HALSEY LLP (SUITE 700, 1201 NEW YORK AVENUE, N.W., WASHINGTON, DC, 20005, US)
Claims:
1. A 5-3 compressor circuit for CSA, which receives four input signals and a lower-digit carry-out input, and computes addition bits, carry bits, and said carry-out bits, comprising: four first-stage EOR circuits, which receive said four input signals and complementary signals of said four input signals, perform EOR computation of two input signals, and output two EOR outputs and complementary signals of said two EOR outputs; two second-stage EOR circuits, which receive the outputs of said four first-stage EOR circuits, perform EOR computation of said two EOR outputs, and output one EOR output and the complementary signal of said EOR output; a third-stage EOR circuit, which receives the outputs of said two second-stage EOR circuits and said carry-out bits, performs EOR computation of said EOR outputs and said carry-out bits, and outputs addition bits; a carry output EOR circuit, which receives the outputs of said two second-stage EOR circuits, said carry-out bits, and one of said four input signals, performs EOR computation of said EOR outputs, said carry-out bits, and said one input signal, and outputs carry bits; and two carry-out EOR circuits, which receive the outputs of said two first-stage EOR circuits and two complementary signals of said input signals, perform EOR computation of said two EOR outputs and said two complementary signals, and output said carry-out bits and the complementary signals thereof.

2. The 5-3 compressor circuit for CSA according to claim 1, wherein said third-stage EOR circuit comprises a pair of EOR circuits, which receive the outputs of said two second-stage EOR circuits and said carry-out bits, perform EOR computation of said EOR outputs and said carry-out bits, and output addition bits and the complementary signals thereof, and wherein said carry output EOR circuit comprises a pair of EOR circuits, which receive the outputs of said two second-stage EOR circuits, said carry-out bits, and one of said four input signals, perform EOR computations of said EOR outputs, said carry-out bits, and said one input signal, and output carry bits and the complementary signals thereof.

3. The 5-3 compressor circuit for CSA according to claim 1, further comprising four inverters which generate the complementary signals of said four input signals.

4. The 5-3 compressor circuit for CSA according to claim 2, wherein said four first-stage EOR circuits receive said four input signals and the complementary signals thereof from a data bus.

5. The 5-3 compressor circuit for CSA according to claim 3, wherein said four first-stage EOR circuits and said four inverters receive said four input signals from a data bus.

6. A carry-save adder circuit comprising: a plurality of 5-3 compressor circuits for CSA, which receive four input signals and a lower-digit carry-out input, and compute addition bits, carry bits, and said carry-out bits, and connected by said carry-out inputs, wherein each of said 5-3 compressor circuits for CSA comprises: four first-stage EOR circuits, which receive four input signals and complementary signals thereof, perform EOR computation of every two input signals, and output two EOR outputs and complementary signals thereof; two second-stage EOR circuits, which receive the outputs of said four first-stage EOR circuits, perform EOR computation of said two EOR outputs, and output one EOR output and the complementary signal thereof; a third-stage EOR circuit, which receives the outputs of said two second-stage EOR circuits and said carry-out bits, performs EOR computation of said EOR outputs and said carry-out bits, and outputs addition bits; a carry output EOR circuit, which receives the outputs of said two second-stage EOR circuits, said carry-out bits, and one of said four input signals, performs EOR computation of said EOR outputs, said carry-out bits, and said one input signal, and outputs carry bits; and two carry-out EOR circuits, which receive the outputs of said two first-stage EOR circuits and two complementary signals of said input signals, perform EOR computation of said two EOR outputs and said two complementary signals, and output said carry-out bits and the complementary signals thereof.

7. The carry-save adder circuit according to claim 6, wherein said third-stage EOR circuit comprises a pair of EOR circuits, which receive the outputs of said two second-stage EOR circuits and said carry-out bits, perform EOR computation of said EOR outputs and said carry-out bits, and output addition bits and the complementary signals thereof, and wherein said carry output EOR circuit comprises a pair of EOR circuits, which receive the outputs of said two second-stage EOR circuits, said carry-out bits, and one of said four input signals, perform EOR computations of said EOR outputs, said carry-out bits, and said one input signal, and output carry bits and the complementary signals thereof.

8. The carry-save adder circuit according to claim 6, further comprising four inverters which generate the complementary signals of said four input signals.

9. The carry-save adder circuit according to claim 7, wherein said four first-stage EOR circuits in each of said 5-3 compressor circuits receives said four input signals and the complementary signals thereof from a data bus.

10. The carry-save adder circuit according to claim 8, wherein said four first-stage EOR circuits and said four inverters in each of said 5-3 compressor circuits receive said four input signals from a data bus.

11. The carry-save adder circuit according to claim 6, wherein said plurality of 5-3 compressor circuits are provided in a plurality of stages, and wherein the addition bits and said carry bits of said plurality of 5-3 compressor circuits of the initial stage are input to said plurality of 5-3 compressor circuits of the next stage.

12. The carry-save adder circuit according to claim 6, wherein said four input signals are signals of each of partial products in multiplication.

Description:

CROSS REFERENCE TO RELATED APPLICATIONS

This application is a continuation of International Application No. PCT/JP2005/006336, filed on Mar. 31, 2005, now pending, herein incorporated by reference.

TECHNICAL FIELD

This invention relates to a 5-3 compressor circuit for a carry-save adder circuit, which adds partial products when performing multiplication, and to a carry-save adder circuit, and in particular relates to a 5-3 compressor circuit for a carry-save adder circuit which performs high-speed addition operations and to a carry-save adder circuit.

BACKGROUND ART

As processing speeds of CPUs (Central Processor Units) have increased, CPU configurations have grown more complex. For example, a CPU comprises a computation unit, an instruction unit, a cache memory, a cache controller unit, and various registers. With advances in integration technology in recent years, there have been trends toward incorporation of CPUs on a single chip, and a need has arisen for faster operation speeds for each of the units in a CPU.

In a computation unit, there exist adders/subtractors, logical computation circuits, shift circuits, and similar. Of these, carry-save adder circuits, which are circuits to add partial products when performing multiplication, are used as adders. FIG. 7 explains multiplication operation; for example, in order to multiply E, consisting of Em through E 0 , and F, consisting of Fm through F 0 , E, that is, (Em˜E 0 ), is multiplied by each of the digits of F, and addition of the multiplication results (called partial products) is performed. Here, if the multiplication result is G, then the multiplication result is obtained from the following equation.
G =( Em˜E 0)× F 0+( Em˜E 0)× F 1+ . . . +( Em˜E 0)× Fm

In FIG. 7, the partial product X 1 (X 1 m ˜X 10 ) is (Em˜E 0 )×F 0 , the partial product X 2 (X 2 m ˜X 20 ) is (Em˜E 0 )×F 1 , and similarly thereafter, the partial product Xn (Xnm˜Xn 0 ) is (Em˜E 0 )×Fm. These partial products are added to obtain the final multiplication result G.

Operation of a carry-save adder which adds these partial products is explained in FIG. 8. In FIG. 8, an example in which eight partial products X 1 to X 8 are added is shown. In the first stage, the carry-save adder (CSA) circuit calculates the addition results S 1 m to S 10 for each digit and the carry bits C 1 m to C 10 for each digit for the partial products X 1 to X 4 , and the addition results S 2 m to S 20 for each digit and the carry bits C 2 m to C 20 for each digit for the partial products X 5 to X 8 . In the second stage, the addition results S 1 m to S 10 , carry bits for each digit C 1 m to C 10 , addition results S 2 m to S 20 , and carry bits for each digit C 2 m to C 20 are added for each digit, to obtain addition results S 3 m to S 30 and carry bits C 3 m to C 30 (see for example Japanese Patent Laid-open Hei No. 2-501242).

This CSA is configured by connecting 5-3 compressor circuits for each digit. FIG. 9 shows the configuration of a conventional CSA using 5-3 compressor circuits. In FIG. 9, the 5-3 compressor circuits 100 - 1 , 100 - 2 , 100 - 3 and 100 - 4 are connected in series. The 5-3 compressor circuit 100 - 2 ( 100 - 1 , 100 - 3 , 100 - 4 ) used in the CSA mainly comprises exclusive-OR circuits (EORs). That is, the exclusive OR of two signals is generated and used.

That is, in order to generate the complementary signals of the inputs (A 1 , A 2 , A 3 , A 4 ), inverter gates 110 , 112 , 114 , 116 are inserted, and pairs of pass transistors 120 , 122 , 124 , 126 compute the EORs. That is, output of the EOR of A 1 and A 2 is the sum of the product of A 1 and the inverse of A 2 with the product of the inverse of A 1 and A 2 , as shown in FIG. 10.

Hence the EOR circuit 122 outputs the EOR of A 1 and A 2 , the EOR circuit 120 outputs the inverse of the EOR of A 1 and A 2 , the EOR circuit 126 outputs the EOR of A 3 and A 4 , and the EOR circuit 124 outputs the inverse of the EOR of A 3 and A 4 .

Similarly, the EOR circuit 142 outputs the EOR of A 1 , A 2 , A 3 and A 4 , and the EOR circuit 140 outputs the inverse of the EOR of A 1 , A 2 , A 3 and A 4 ; the EOR circuit 144 which receives these outputs outputs the EOR of A 1 , A 2 , A 3 , A 4 , and CI as the S (SUM). Similarly, the EOR circuit 146 , which receives A 1 via the inverse gates 134 and 152 , the outputs of the EOR circuits 140 and 142 , and CI via an inverse gate, outputs the carry result C shown in FIG. 10.

Further, the EOR circuit 148 , which receives the output of EOR circuit 124 , the output of EOR circuit 126 , the inverse of A 2 via an inverter gate 124 , and the inverse of A 4 via an inverter gate 116 , outputs the carry-out result CO in FIG. 10 via the inverter gate 154 . This carry-out CO is output as the carry-in CI of the upper-digit 5-3 compressor circuit 100 - 3 .

In this way, the carry-out CO is the carry result to the upper digit, and the carry C is carried to the next stage (see FIG. 8). That is, this adder circuit is a five-input, three-output adder circuit.

However, in the technology of the prior art, complementary signals are required as input for EOR computation, as shown in FIG. 10, so that inverter gates must be inserted, and hence delay times are lengthened and high-speed addition operation is difficult. In particular, in a CSA which performs multiplication of partial products with a plurality of digits, as in FIG. 8, the delay time of one adder circuit accumulates, so that the addition time of partial products is lengthened.

Hence an object of this invention is to provide a 5-3 compressor circuit for a CSA to realize high-speed operation of the adder circuit when performing EOR computations, as well as a carry-save adder circuit using this 5-3 compressor circuit.

A further object of this invention is to provide a 5-3 compressor circuit for a CSA to prevent increases in the delay time necessary for generation of complementary signals for EOR computation, and a carry-save adder circuit using this 5-3 compressor circuit.

Still a further object of this invention is to provide a 5-3 compressor circuit for a CSA to prevent increases in delay time due to inverter gates for EOR computation, and a carry-save adder circuit using this 5-3 compressor circuit.

DISCLOSURE OF THE INVENTION

In order to attain these objects, a 5-3 compressor circuit for a CSA of this invention receives four input signals and a lower-digit carry-out input, and computes the addition bits, a carry bit, and the carry-out bit. This compressor circuit has four first-stage EOR circuits, which receive the four input signals and the complementary signals of these, perform EOR computation of each two input signals, and output two EOR outputs and their complementary signals; two second-stage EOR circuits, which receive the outputs of the four first-stage EOR circuits, perform EOR computations of the two EOR outputs, and output one EOR output and the complementary signal thereof; a third-stage EOR circuit, which receives the outputs of the two second-stage EOR circuits and the carry-out bit, performs EOR computation of the EOR outputs and the carry-out bit, and outputs addition bits; a carry-output EOR circuit, which receives the outputs of the two second-stage EOR circuits, the carry-out bit, and one of the four-input signals, performs EOR computation of the EOR outputs, the carry-out bit, and the one input signal, and outputs a carry bit; and two carry-out EOR circuits, which receive the outputs of two first-stage EOR circuits and two complementary signals of the input signals, perform EOR computation of the two EOR outputs and the two complementary signals, and output the carry-out bit and the complementary signal thereof.

Further, a carry-save adder circuit of this invention connects to the carry-out input a plurality of 5-3 compressor circuits for a CSA, which receive four input signals and a lower-digit carry-out input, and compute addition bits, a carry bit, and the carry-out bit. Each of the 5-3 compressor circuits for a CSA has four first-stage EOR circuits, which receive the four input signals and the complementary signals of these, perform EOR computation of each two input signals, and output two EOR outputs and their complementary signals; two second-stage EOR circuits, which receive the outputs of the four first-stage EOR circuits, perform EOR computations of the two EOR outputs, and output one EOR output and the complementary signal thereof; a third-stage EOR circuit, which receives the outputs of the two second-stage EOR circuits and the carry-out bit, performs EOR computation of the EOR outputs and the carry-out bit, and outputs addition bits; a carry-output EOR circuit, which receives the outputs of the two second-stage EOR circuits, the carry-out bit, and one of the four input signals, performs EOR computation of the EOR outputs, the carry-out bit, and the one input signal, and outputs a carry bit; and two carry-out EOR circuits, which receive the outputs of two first-stage EOR circuits and two complementary signals of the input signals, perform EOR computation of the two EOR outputs and the two complementary signals, and output the carry-out bit and the complementary signal thereof.

Further, in this invention it is preferable that the third-stage EOR circuit have a pair of EOR circuits which receive the outputs from the two second-stage EOR circuits and the carry-out bit, perform EOR computation of the EOR outputs and the carry-out bit, and output the addition bits and the complementary signals thereof, and that the carry-output EOR circuit have a pair of EOR circuits which receive the outputs of the two second-stage EOR circuits, the carry-out bit, and one of the four input signals, perform EOR computation of the EOR outputs, the carry-out bit, and the one input signal, and output a carry bit and the complementary signal thereof.

Further, in this invention it is preferable that four inverters which generate complementary signals of the four input signals be further provided.

Further, in this invention it is preferable that the four first-stage EOR circuits receive the four input signals and the complementary signals thereof from a data bus.

Further, in this invention it is preferable that the four first-stage EOR circuits and the four inverters receive the four input signals from a data bus.

Further, in this invention it is preferable that a plurality of the plurality of 5-3 compressor circuits is provided in a plurality stage, and that the addition bits and carry bits of the plurality of 5-3 compressor circuits of the first stage be input to the plurality of 5-3 compressor circuits of the next stage.

Further, in this invention it is preferable that the four input signals each be partial product signals in multiplication.

In this invention, at least two carry-out EOR circuits which output a carry-out bit and the complementary signal there of are provided in a 5-3 compressor circuit comprising an EOR circuit group, and at least dual lanes for carry-out bits are provided, so that required inverters can be eliminated, increases in delay time can be suppressed, and high-speed adding operation can be achieved.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a circuit diagram of a 5-3 compressor circuit for a CSA in a first embodiment of the invention;

FIG. 2 is a circuit diagram of a carry-save adder using the 5-3 compressor circuit of FIG. 1;

FIG. 3 is a circuit diagram of a 5-3 compressor circuit for a CSA in a second embodiment of the invention;

FIG. 4 is a circuit diagram of a carry-save adder using the 5-3 compressor circuit of FIG. 3;

FIG. 5 explains the numbers of inverter stages and pass transistor stages in each stage of a 5-3 compressor circuit of the prior art, of the first embodiment, and of the second embodiment;

FIG. 6 compares the number of stages of inverters and pass transistors in each stage of a 5-3 compressor circuit of the prior art, of the first embodiment, and of the second embodiment;

FIG. 7 explains partial products in multiplication;

FIG. 8 explains CSA addition operation of partial products;

FIG. 9 is a circuit diagram of a 5-3 compressor circuit of the prior art; and

FIG. 10 explains sum, carry, and carry-out computation operation of the EOR circuit of FIG. 9.

PREFERRED EMBODIMENTS OF THE INVENTION

Below, embodiments of the invention are explained, in the order of a first embodiment, a second embodiment, and other embodiments; however, these embodiments are examples of the invention, and various modifications are not excluded.

First Embodiment

FIG. 1 is a circuit diagram of the 5-3 compressor circuit of the first embodiment of the invention; FIG. 2 is a block diagram of a CSA which uses the 5-3 compressor circuit of FIG. 1.

As shown in FIG. 1, the 5-3 compressor circuits 1 - 0 , 1 - 1 , 1 - 2 , 1 - 3 are connected in series. Each 5-3 compressor circuit 1 - 1 ( 1 - 0 , 1 - 2 , 1 - 3 ) comprises a three-stage EOR circuit group. Each EOR circuit comprises a pair of pass transistors; as shown in FIG. 10, the EOR outputs A 1 and A 2 are used in computing the sum of the product of A 1 and the inverse of A 2 with the product of the inverse of A 1 and A 2 .

In this embodiment, by employing dual lanes which use both complementary signals, high-speed operation is attained. That is, the inputs (A 1 , A 2 , A 3 , A 4 ) and the complementary signals thereof (XA 1 , XA 2 , XA 3 , XA 4 ) are input to each of the EORs 10 , 12 , 14 , 16 in the first stage. Hence the EOR circuit 12 outputs the EOR of A 1 and A 2 , the EOR circuit 10 takes the inverse of the EOR of A 1 and A 2 , the EOR circuit 16 takes the EOR of A 3 and A 4 , and the EOR circuit 14 takes the inverse of the EOR of A 3 and A 4 .

Similarly, the EOR circuit 22 of the second state outputs the EOR of A 1 , A 2 , A 3 and A 4 , the EOR circuit 20 outputs the inverse of the EOR of A 1 , A 2 , A 3 and A 4 , the EOR circuit 32 of the third stage receiving outputs output takes via the inverting gate 47 , as S (SUM), the EOR of A 1 , A 2 , A 3 , A 4 , and CI. Similarly, the EOR circuit 34 receives XA 1 via the inverting gates 41 and 43 , outputs of the EOR circuit outputs 20 , 22 and XCI, then outputs the carrier C shown in FIG. 10 via the inverting gate 48 .

The EOR circuit 24 , which receives the output of EOR circuit 14 , the output of EOR circuit 16 , XA 2 , and XA 4 , outputs the carry-out CO of FIG. 10 via the inverter gate 44 . This carry-out CO is output as the carry-in CI of the upper-digit 5-3 compressor circuit 1 - 2 . In this way, the carry-out CO is carried to the upper digit, and the carry C is the carry output for the next stage.

In order to generate a complementary signal for the output, the third-stage EOR circuit 30 , which receives the output of the second-stage EOR circuit 20 , the output of the EOR circuit 22 , CI, and XCI (the complementary signal of CI), outputs via the inverter gate 46 the inverse XS of the EOR of A 1 , A 2 , A 3 , A 4 , and CI. Similarly, the EOR circuit 36 , which receives XA 1 via the inverter gate 41 , and also receives the outputs of the EOR circuits 20 and 22 and CI, outputs via the inverter gate 49 the complementary signal XC of the carry C shown in FIG. 10.

Further, the EOR circuit 26 , which receives the output of EOR circuit 14 , the output of EOR circuit 16 , A 2 and A 4 , outputs via the inverter gate 45 the complementary signal XCO of the carry-out CO of FIG. 10. This carry-out XCO is also output as the carry-in XCI of the upper-digit 5-3 compressor circuit 1 - 2 .

The other 5-3 compressor circuits 1 - 0 , 1 - 2 , 1 - 3 are similarly configured. In this way, in a circuit which uses numerous exclusive-OR circuits such as the 5-3 compressor circuit, by focusing on the fact that an EOR circuit can reduce to zero the delay time to generate a signal and its inverted signal, and making efficient use of these two signals, high-speed operation can be achieved.

Through use of dual lanes employing both complementary signals, although the number of transistors increases in the amount of the increased output (EOR circuits 30 , 36 , 26 in FIG. 1), because complementary signals are handled, there is no need to generate inverses of signals using an inverter gate. That is, compared with the prior art in FIG. 9, the inverter gates 110 , 112 , 114 , 116 , 150 can be eliminated. Hence to this extent the number of transistors can be reduced, and as a result increases in the number of transistors can be suppressed.

The carry-out output (CO) of each CSA cell 1 - 1 is connected to the carry-in input (CI) of the higher-digit CSA cell 1 - 2 ; the number of path stages, from the input to carry-out, through the higher-digit carry-in, the higher-digit SUM, up to the carry, is one greater than the path not passing through the carry-out and carry-in, and is the critical path. The complementary signal XCO of the carry-out is generated and passed to the higher digit to achieve faster operation of the path. As a result, the critical path speed can be increased, and in terms of stages, the path from input to SUM and carry can be made the same, regardless of whether carry-out is passed.

FIG. 2 is a block diagram of a CSA using this circuit. An example of a configuration which performs the operation to add partial products in FIG. 8 is shown. The first CSA 3 performs addition of each of the digits of the partial products X 1 , X 2 , X 3 , X 4 of FIG. 8, and the second CSA 4 performs addition of each of the digits of the partial products X 5 , X 6 , X 7 , X 8 of FIG. 8. The third CSA 5 performs addition, for each digit, of the addition results S 1 m to S 10 in FIG. 8, carries for each digit C 1 m to C 10 , addition results S 2 m to S 20 , and carries for each digit C 2 m to C 20 , to obtain addition results S 3 m to S 30 and carries C 3 m to C 30 .

The CSA 3 is configured by connecting 5-3 compressor circuits 1 - 0 , 1 - 1 , 1 - 2 , 1 - 3 , 1 - 4 , 1 - 5 , shown in FIG. 1, by the lower-digit carry-outs CO. Each of the 5-3 compressor circuits 1 - 0 , 1 - 1 , 1 - 2 , 1 - 3 , 1 - 4 , 1 - 5 is connected to a bus which transmits the partial products X 1 , X 2 , X 3 , X 4 and the inverted signals XX 1 , XX 2 , XX 3 , XX 4 . Specifically, the first digit X 10 of the partial product X 1 and the complementary signal thereof XX 10 are input to the 5-3 compressor circuit 1 - 0 of the lowermost-digit (first digit). The second digit X 11 of the partial product X 1 , the complementary signal XX 11 thereof, the first digit X 20 of the partial product X 2 , and the complementary signal XX 20 thereof are input to the second-digit 5-3 compressor circuit 1 - 1 .

The third digit X 12 of the partial product X 1 and the complementary signal XX 12 thereof, the second digit X 21 of the partial product X 2 and the complementary signal XX 21 thereof, and the first digit X 30 of the partial product X 3 and the complementary signal XX 30 thereof, are input to the third-digit 5-3 compressor circuit 1 - 2 . And, the fourth digit X 13 of the partial product X 1 and the complementary signal XX 13 thereof, the third digit X 22 of the partial product X 2 and the complementary signal XX 22 thereof, the second digit X 31 of the partial product X 3 and the complementary signal XX 31 thereof, and the first digit X 40 of the partial product X 4 and the complementary signal XX 40 thereof, are input to the fourth-digit 5-3 compressor circuit 1 - 3 .

The fifth digit X 14 of the partial product X 1 and the complementary signal XX 14 thereof, the fourth digit X 23 of the partial product X 2 and the complementary signal XX 23 thereof, the third digit X 32 of the partial product X 3 and the complementary signal XX 32 thereof, and the second digit X 41 of the partial product X 4 and the complementary signal XX 41 thereof, are input to the fifth-digit 5-3 compressor circuit 1 - 4 . And, the sixth digit X 15 of the partial product X 1 and the complementary signal XX 15 thereof, the fifth digit X 24 of the partial product X 2 and the complementary signal XX 24 thereof, the fourth digit X 33 of the partial product X 3 and the complementary signal XX 33 thereof, and the third digit X 42 of the partial product X 4 and the complementary signal XX 42 thereof, are input to the sixth-digit compressor circuit 1 - 5 . Inputs are applied similarly thereafter.

Similarly, the CSA 4 is configured by connecting 5-3 compressor circuits 1 - 10 , 1 - 11 , 1 - 12 , 1 - 13 , . . . , shown in FIG. 1 by the lower-digit carry-outs CO. Each of the 5-3 compressor circuits 1 - 10 , 1 - 11 , 1 - 12 , 1 - 13 , . . . is connected to a bus which transmits the partial products X 5 , X 6 , X 7 , X 8 and the inverse signals XX 5 , XX 6 , XX 7 , XX 8 . Specifically, the first digit X 50 of the partial product X 5 and the complementary signal XX 50 thereof are input to the lowermost-digit (first digit) 5-3 compressor circuit 1 - 10 . The second digit X 51 of the partial product X 5 and the complementary signal XX 51 thereof, and the first digit X 60 of the partial product X 6 and the complementary signal XX 60 thereof are input to the second-digit 5-3 compressor circuit 1 - 11 .

The third digit X 52 of the partial product X 5 and the complementary signal XX 52 thereof, the second digit X 61 of the partial product X 6 and the complementary signal XX 61 thereof, and the first digit X 70 of the partial product X 7 and the complementary signal XX 70 thereof are input to the third-digit 5-3 compressor circuit 1 - 12 . The fourth digit X 53 of the partial product X 5 and the complementary signal XX 53 thereof, the third digit X 62 of the partial product X 6 and the complementary signal XX 62 thereof, the second digit X 71 of the partial product X 7 and the complementary signal XX 71 thereof, and the first digit X 80 of the partial product X 8 and the complementary signal XX 80 thereof are input to the fourth-digit 5-3 compressor circuit 1 - 13 . Inputs are applied similarly thereafter.

Further, the CSA 5 is configured by connecting 5-3 compressor circuits 1 - 20 , 1 - 21 , 1 - 22 , 1 - 23 , 1 - 24 , 1 - 25 , 1 - 26 , 1 - 27 , 1 - 28 , . . . , shown in FIG. 1 by the lower-digit carry-outs CO. The sum S 20 and XS 20 of the 5-3 compressor circuit 1 - 0 are input to the lowermost-digit (first digit) 5-3 compressor circuit 1 - 20 . The carry C 10 and XC 10 of the 5-3 compressor circuit 1 - 0 and sum S 11 and XS 11 of the 5-3 compressor circuit 1 - 1 are input to the second-digit 5-3 compressor circuit 1 - 21 . Thereafter inputs are similarly supplied, to the fifth-digit 5-3 compressor circuit 1 - 24 .

The carry C 14 and XC 14 of the 5-3 compressor circuit 1 - 4 , sum S 15 and XS 15 of the 5-3 compressor circuit 1 - 5 , and sum S 20 and XS 20 of the 5-3 compressor circuit 1 - 10 are input to the sixth-digit 5-3 compressor circuit 1 - 25 . Thereafter inputs are similarly supplied, from the seventh-digit 5-3 compressor circuit 1 - 26 .

In this way, by inputting from a bus line the four inputs and their complementary signals to each of the 5-3 compressor circuits 1 - 0 to 1 - 13 , 5-3 compressor circuits configured as shown in FIG. 1 can be made to operate. In such addition of partial products to perform multiplication, a plurality of the CSA cells are combined in states and used, so that faster operation can be achieved.

Second Embodiment

FIG. 3 is a circuit diagram of the 5-3 compressor circuit of a second embodiment of the invention, and FIG. 4 is a block diagram of a CSA employing the 5-3 compressor circuit of FIG. 3. In FIG. 3 and FIG. 4, portions which are the same as in FIG. 1 and FIG. 2 are assigned the same symbols.

As shown in FIG. 3, the 5-3 compressor circuits 1 - 0 , 1 - 1 , 1 - 2 , 1 - 3 are connected in series. Each of the 5-3 compressor circuits 1 - 1 ( 1 - 0 , 1 - 2 , 1 - 3 ) comprise a three-stage EOR circuit group. Each EOR circuit comprises a pair of pass transistors, and as shown in FIG. 10, the EOR output of A 1 and A 2 is computed as the sum of the product of A 1 with the inverse of A 2 and the product of the inverse of A 1 with A 2 .

In this embodiment, by using dual lanes employing both complementary signals only in the carry-out which is the critical path, high-speed operation is achieved while suppressing increases in chip area. That is, the inputs (A 1 , A 2 , A 3 , A 4 ), and the complementary signals (XA 1 , XA 2 , XA 3 , XA 4 ) obtained by inverting by the inverting gates 50 , 51 , 52 , 53 , are input to the first-stage EORs 10 , 12 , 14 , 16 . Hence the EOR circuit 12 outputs the EOR of A 1 and A 2 , EOR circuit 10 outputs the inverse of the EOR of A 1 and A 2 , EOR circuit 16 outputs the EOR of A 3 and A 4 , and EOR circuit 14 outputs the inverse of the EOR of A 3 and A 4 .

Similarly, the second-stage EOR circuit 22 outputs the EOR of A 1 , A 2 , A 3 and A 4 , and the EOR circuit 20 outputs the inverse of the EOR of A 1 , A 2 , A 3 and A 4 ; the third-stage EOR circuit 32 , on receiving these outputs, outputs as S (SUM), via the inverting gate 47 , the EOR of A 1 , A 2 , A 3 , A 4 , and CI. Similarly, the EOR circuit 34 receives XA 1 via the inverting gates 41 and 43 , the outputs of EOR circuits 20 and 22 , and XCI, and outputs, via the inverting gate 48 , the carry C shown in FIG. 10.

The EOR circuit 24 , which receives the output of EOR circuit 14 , the output of EOR circuit 16 , XA 2 , and XA 4 , outputs the carry-out CO of FIG. 10 via the inverting gate 44 . This carry-out CO is output as the carry-in CI of the upper-digit 5-3 compressor circuit 1 - 2 . In this way, the carry-out CO is the carry to the upper digit, and the carry C is the carry output to the next stage.

In order to generate the complementary signal of the carry-out output, the EOR circuit 26 , which receives the output of EOR circuit 14 , the output of EOR circuit 16 , A 2 , and A 4 , outputs the complementary signal XCO of the carry-out CO of FIG. 10 via the inverting gate 45 . This carry-out XCO is also output as the carry-in XCI of the upper-digit 5-3 compressor circuit 1 - 2 .

The other 5-3 compressor circuits 1 - 0 , 1 - 2 , 1 - 3 are similarly configured. In a circuit which uses numerous exclusive-OR circuits such as these 5-3 compressor circuits, by focusing on the fact that an EOR circuit can reduce to zero the delay time to generate a signal and its inverted signal, and making efficient use of these two signals, high-speed operation can be achieved.

By using dual lanes employing both complementary signals, although the number of transistors increases by the amount of the increased output (EOR 26 in FIG. 3), complementary signals are handled, so that the inverted signals of the signals need not be generated. That is, compared with the prior art in FIG. 9, the inverting gate 150 can be eliminated. Hence to this extent the number of transistors can be decreased, and consequently increases in the number of transistors can be suppressed.

Further, the carry-out output (CO) of each CSA cell 1 - 1 is connected to the carry-in input (CI) of the higher-digit CSA cell 1 - 2 ; the number of path stages, from the input to carry-out, through the higher-digit carry-in, the higher-digit SUM, up to the carry, is one greater than the path not passing through the carry-out and carry-in, and is the critical path. The carry-out complementary signal XCO is generated and passed to the higher digit in order to speed this path. As a result, operation on the critical path is made faster, and in terms of stages, the path from input to SUM and carry can be made the same, regardless of whether carry-out is passed.

FIG. 4 is a block diagram of a CSA using this circuit. A configuration performing the partial product addition operation of FIG. 8 is taken as an example. The first CSA 3 performs addition of each of the digits of the partial products X 1 , X 2 , X 3 , X 4 in FIG. 8, and the second CSA 4 performs addition of each of the digits of the partial products X 5 , X 6 , X 7 , X 8 in FIG. 8. The third CSA 5 performs addition of the addition results S 1 m to S 10 in FIG. 8, the carries for each digit C 1 m to C 10 , the addition results S 2 m to S 20 , and the carries for each digit C 2 m to C 20 , to obtain the addition results S 3 m to S 30 and carries C 3 m to C 30 .

The CSA 3 is configured by connecting 5-3 compressor circuits 1 - 0 , 1 - 1 , 1 - 2 , 1 - 3 , 1 - 4 , 1 - 5 shown in FIG. 3 by the lower-digit carry-outs CO, XCO. Each 5-3 compressor circuit 1 - 0 , 1 - 1 , 1 - 2 , 1 - 3 , 1 - 4 , 1 - 5 is connected to a-bus which transmits partial products X 1 , X 2 , X 3 , X 4 . Specifically, the first digit X 10 of the partial product X 1 is input to the lowermost-digit (first digit) 5-3 compressor circuit 1 - 0 . The second digit X 11 of the partial product X 1 and first digit X 20 of the partial product X 2 are input to the second-digit 5-3 compressor circuit 1 - 1 .

The third digit X 12 of the partial product X 1 , second digit X 21 of the partial product X 2 , and first digit X 30 of the partial product X 3 are input to the third-digit 5-3 compressor circuit 1 - 2 . The fourth digit X 13 of the partial product X 1 , third digit X 22 of the partial product X 2 , second digit X 31 of the partial product X 3 , and first digit X 40 of the partial product X 4 are input to the fourth-digit 5-3 compressor circuit 1 - 3 .

The fifth digit X 14 of the partial product X 1 , fourth digit X 23 of the partial product X 2 , third digit X 32 of the partial product X 3 , and second digit X 41 of the partial product X 4 are input to the fifth-digit 5-3 compressor circuit 1 - 4 . The sixth digit X 15 of the partial product X 1 , fifth digit X 24 of the partial product X 2 , fourth digit X 33 of the partial product X 3 , and third digit X 42 of the partial product X 4 are input to the sixth-digit 5-3 compressor circuit 1 - 5 . Inputs are applied similarly thereafter.

Similarly, the CSA 4 is configured by connecting 5-3 compressor circuits 1 - 10 , 1 - 11 , 1 - 12 , 1 - 13 , . . . , shown in FIG. 3 by the lower-digit carry-outs CO, XCO. Each 5-3 compressor circuit 1 - 10 , 1 - 11 , 1 - 12 , 1 - 13 , . . . , is connected to a bus which transmits partial products X 5 , X 6 , X 7 , X 8 . Specifically, the first digit X 50 of the partial product X 5 is input to the lowermost-digit (first digit) 5-3 compressor circuit 1 - 10 . The second digit X 51 of the partial product X 5 and first digit X 60 of the partial product X 6 are input to the second-digit 5-3 compressor circuit 1 - 11 .

The third digit X 52 of the partial product X 5 , second digit X 61 of the partial product X 6 , and first digit X 70 of the partial product X 7 are input to the third-digit 5-3 compressor circuit 1 - 12 . The fourth digit X 53 of the partial product X 5 , third digit X 62 of the partial product X 6 , second digit X 71 of the partial product X 7 , and first digit X 80 of the partial product X 8 are input to the fourth-digit 5-3 compressor circuit 1 - 13 . Inputs are applied similarly thereafter.

Further, the CSA 5 is configured by connecting 5-3 compressor circuits 1 - 20 , 1 - 21 , 1 - 22 , 1 - 23 , 1 - 24 , 1 - 25 , 1 - 26 , 1 - 27 , 1 - 28 , . . . , of FIG. 3 by the lower-digit carry-outs CO, XCO. The sum S 20 of the 5-3 compressor circuit 1 - 0 is input to the lowermost-digit (first digit) 5-3 compressor circuit 1 - 20 . The carry C 10 of the 5-3 compressor circuit 1 - 0 and the sum S 11 of the 5-3 compressor circuit 1 - 1 are input to the second-digit 5-3 compressor circuit 1 - 21 . Thereafter, inputs are similarly applied up to the fifth-digit 5-3 compressor circuit 1 - 24 .

The carry C 14 of 5-3 compressor circuit 1 - 4 , sum S 15 of 5-3 compressor circuit 1 - 5 , and sum S 20 of 5-3 compressor circuit 1 - 10 are input to the sixth-digit 5-3 compressor circuit 1 - 25 . Thereafter, inputs are similarly applied to the seventh-digit 5-3 compressor circuit 1 - 26 and subsequent 5-3 compressor circuits.

In this way, by inputting four inputs from the bus line to each of the 5-3 compressor circuits 1 - 0 to 1 - 13 , 5-3 compressor circuits configured as in FIG. 3 can be made to operate. Thus in addition of partial products to perform multiplication, a plurality of stages of these CSA cells is used, so that faster processing is possible while suppressing increases in area used.

FIG. 5 and FIG. 6 compare the numbers of inverters and transistors in the prior art of FIG. 9, the first embodiment of FIG. 1, and the second embodiment of FIG. 3. In the technology of the prior art in FIG. 9, three inverter stages and three pass transistor stages are required for carry output, and four inverter stages and three pass transistor stages are required for carry-out output, in one CSA cell (5-3 compressor circuit) stage. Hence when four stages, that is, four CSA cells (5-3 compressor circuits) are connected, 12-inverter stages and 12 pass transistor stages are required for carry output, and 16 inverter stages and 12 pass transistor stages are required for carry-out output.

On the other hand, in the first embodiment, three inverter stages and three pass transistor stages are required for the carry output, and three inverter stages and three pass transistor stages are required for the carry-out output, in one CSA cell (5-3 compressor circuit) stage. Hence when four stages, that is, four CSA cells (5-3 compressor circuits) are connected, only nine inverter stages and 12 pass transistor stages are required for carry output, and only nine inverter stages and 12 pass transistor stages are required for carry-out output.

Further, in the second embodiment, three inverter stages and three pass transistor stages are required for the carry output, and three inverter stages and three pass transistor stages are required for the carry-out output, in one CSA cell (5-3 compressor circuit) stage. Hence when four stages, that is, four CSA cells (5-3 compressor circuits) are connected, only 12 inverter stages and 12 pass transistor stages are required for carry output, and only 12 inverter stages and 12 pass transistor stages are required for carry-out output.

In this way, the number of pass transistor stages does not change, but the number of inverter stages can be decreased, and to this extent increases in delay time can be suppressed, and faster operation is possible. Further, in the first embodiment the number of inverter stages can be decreased the most, and faster operation is possible. Conversely in the second embodiment, the number of inverter stages can be decreased to a certain extent, and faster operation than in the prior art is possible, while increases in circuit scale can be suppressed, and increases in chip area can be suppressed.

Other Embodiments

In the above embodiments, examples of addition of the partial products in FIG. 7 and FIG. 8 were explained; but application to addition of partial products with other numbers of digits, and to addition of other numbers of partial products, is possible.

INDUSTRIAL APPLICABILITY

At least two carry-out EOR circuits, which output carry-out bits and the complementary signals thereof, are provided in a 5-3 compressor circuit comprising an EOR circuit group, and dual lanes are used at least for carry-out bits, so that the number of inverters required can be reduced, increases in delay time can be suppressed, and faster addition operation can be achieved, contributing to faster addition of partial products with numerous digits.