1 / 68

Lecture Note 5 Computer Arithmetic

Lecture Note 5 Computer Arithmetic. Pradondet Nilagupta (based on lecture note of Prof J. Kelly Flanagan). Computer Arithmetic. This lecture will introduce basic number representations and introduce basic integer and floating point arithmetic. Number Representation.

iman
Download Presentation

Lecture Note 5 Computer Arithmetic

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Lecture Note 5Computer Arithmetic Pradondet Nilagupta (based on lecture note of Prof J. Kelly Flanagan)

  2. Computer Arithmetic • This lecture will introduce basic number representations and introduce basic integer and floating point arithmetic.

  3. Number Representation • Any number can be represented in any base by the simple sum of each digit's value, positional notation: d * basei • The value 1011 (binary) equals 1 * 23 + 0 * 22 + 1 * 21 + 1 * 20 = 11 (decimal).

  4. Number Representation • as a 32 bit number this would be represented as follows: 0000 0000 0000 0000 0000 0000 0000 1011 • The most significant bit (MSB) is on the left and the least significant bit (LSB) is on the right. In the number above, the LSB is called bit 0 and the MSB is called bit 31.

  5. Range of Values • With 32 bits it is possible to have unsigned numbers that range from 0 to 232 - 1 = 4,294,967,295. • This is quite reasonable for address calculations of present machines, but future machines will surely have more than 232 memory locations. • In addition to more memory it is highly useful to be able to represent negative as well as positive integers and numbers smaller and larger then those possible with this format.

  6. Signed Integers • Since in the binary number system we have an even number of unique representations for a given number of bits we have two choices: 1. Have a balanced system with the same number of negative and positive values, but what about zero? If we represent zero we have an odd number of values to represent. This can be done by allowing zero to be represented by two different bit patterns.

  7. Signed Integers 2. Have an unbalanced system where zero has one representation, but there is either an extra positive or negative value. • We would like a balanced system, but this would require two different representations for the value zero. • This evil (having two zeros) is better to avoid and not worth the benefits of having a balanced system. Consequently, an unbalanced system has been nearly universally adopted.

  8. One's Complement • How can we represent negative and positive numbers in the binary system? • The 1's complement number system using 32 bits has a range from -(2^31 -1) to +(2^31 - 1). • Zero can be represented as either positive or negative. The two forms of zero are represented by 0000 0000 0000 0000 0000 0000 0000 0000 (all zeros) or 1111 1111 1111 1111 1111 1111 1111 1111 (all ones).

  9. One's Complement • A negative number is formed by representing its magnitude in normal, positive binary format and then inverting each bit. • 8-bit examples: 1111 1110 = -1 1111 1111 = -0 0000 0000 = +0 0000 0001 = +1 • Having two forms of zero is a problem, but arithmetic is simple.

  10. Two's Complement • The 2's complement number system using 32 bits has a range from -2^31 to 2^31 - 1. • Zero has only one representation: all zeros, but there is one extra negative value. • Negative numbers always have the MSB set to 1 -- thus making sign detection extremely simple. • Converting 2's complement numbers to signed decimal is extremely simple.

  11. Example 2’s complement Convert 1011 1010 to signed decimal. 1 * -27 + 0 * 26 + 1 * 25 + 1 * 24 + 1 * 23 + 0 * 22 + 1 * 21 + 0 * 20 = -128 + 0 + 32 + 16 + 8 + 0 + 2 + 0 = -70. • This approach is not feasible in hardware due to speed considerations.

  12. Two's Complement Shortcuts • Negation in two's complement is accomplished by inverting each bit of the number and then adding one to the result. • Example: Convert -70 (decimal) to two's complement binary. |-70| = 70 = 0100 0110 (binary) ~ 0100 0110 = 1011 1001 (where ~ denotes bit inversion) 1011 1001 + 1 = 1011 1010 = -70 (2's comp).

  13. Sign Extension • To add a 16-bit value to a 32-bit value, the 16-bit value must first be sign extended. The two numbers are right aligned and the sign bit of the shorter number is replicated to the left until the two numbers are equal in length.

  14. Example Sign Extension 0000 0000 0000 0000 0000 0000 0000 1011 + 1011 1010 we must sign extend the second number and then add: 0000 0000 0000 0000 0000 0000 0000 1011 + 1111 1111 1111 1111 1111 1111 1011 1011 ------------------------------------------------------------- 1 1111 1111 1111 1111 1111 1111 1100 0101 • The extra bit on the left is simply discarded, leaving us with 1111 1111 1111 1111 1111 1111 1100 0101 for our answer.

  15. Integer Addition • Straight forward approach consists of adding each bit together from right to left and propagating the carry from one bit to the next. Perform the operation 7 + 6 0111 + 0110 = 1101

  16. Integer Addition Add the numbers 2,147,483,647 + 2 0111 1111 1111 1111 1111 1111 1111 1111 0000 0000 0000 0000 0000 0000 0000 0010 This is known as overflow Overflow can be handled in several ways • An exception or interrupt can be asserted • A bit in a status register can be set • It can be completely ignored

  17. Integer Subtraction • Straight forward approach consists of negating one term and performing integer addition. Perform the operation 7 - 6 0111 - 0110 = 0001 Perform the operation 6 - 7 0110 - 0111 = 1111

  18. Integer Subtraction Add the numbers -2,147,483,647 - 2 1000 0000 0000 0000 0000 0000 0000 0001 1111 1111 1111 1111 1111 1111 1111 1110 This is also overflow Overflow can be handled in several ways • An exception or interrupt can be asserted • A bit in a status register can be set • It can be completely ignored

  19. Integer Overflow • Overflow can occur whenever the sum of two N bit numbers cannot be represented by another N bit number. • Unsigned numbers must be treated differently than signed numbers. • Unsigned numbers are usually used for address calculations and it is convenient to ignore overflow. • In many architectures this is accomplished by having arithmetic instructions that ignore the overflow condition.

  20. Longhand Multiplication • Multiply 1000 * 1001 (either decimal or binary will work!) Multiplicand 1000 Multiplier 1001 1000 0000 0000 1000 ------- Product 1001000

  21. Longhand Multiplication • The first number is the multiplicand and the second the multiplier. • The result is larger than either the multiplicand or the multiplier. If the size of the multiplicand is N bits and the size of the multiplier is M bits then the result is M+N bits long.

  22. Simple Multiplication Algorithm • The Multiplicand is stored in a 64 bit register and the Multiplier is stored in a 32 bit register while the product register is 64 bits long and initialized to zero. • Test bit 0 of the multiplier If this bit is 0 Continue If this bit is 1 Add the multiplicand to the product and store back in the product register Shift the multiplicand register left 1 bit Shift the multiplier register right 1 bit If this is the 32nd iteration END else continue

  23. MULTIPLY HARDWARE Version 1 • 64-bit Multiplicand reg, 64-bit ALU, 64-bit Product reg, 32-bit multiplier reg Shift Left 64-bit Multiplicand Shift Right 32-bit Multiplier 64-bit ALU Write 64-bit Product Control

  24. Start Multiply Algorithm Version 1 Multiplier0 = 1 TestMultiplier0 Multiplier0 = 0 1. Add multiplicand to product & place the result in Product register 2. Shift the Multiplicand register left 1 bit. 3. Shift the Multiplier register right 1 bit. 32nd repetition? No: < 32 repetitions Yes: 32 repetitions Done

  25. Try It Yourself, V1 • 4x4 bit Multiply, Version 1 • Show each step for 5 x 11 = 55 MultiplierMultiplicandProduct Operation 0101 0000 1011 0000 0000 initial load

  26. Solution, V1 • 5 x 11 = 55MultiplierMultiplicandProduct Operation 0101 0000 1011 0000 0000 initial load 0101 0000 0011 0000 1011 add 0010 0001 0110 0000 1011 shift 0010 0001 0110 0000 1011 no add 0001 0010 1100 0000 1011 shift 0001 0010 1100 0011 0111 add 0000 0101 1000 0011 0111 shift 0000 0101 1000 0011 0111 no add 0000 1011 0000 0011 0111 shift

  27. Observations on Multiply Version 1 • 1 clock per cycle --> 32 x 3 = 100 clocks per multiply • Ratio of multiply frequency to add is 5:1 to 100:1 • But remember Amdahl’s Law! • 1/2 bits in multiplicand always 0 • 64-bit adder is wasted • 0’s inserted in right side of multiplicand as shifted • least significant bits of product never changed once formed • Instead of shifting multiplicand to left, shift product to right?

  28. A Better Multiplier • In the previous algorithm half of the bits in the multiplicand register are zero and contain no needed information. This algorithm requires only a 32 bit ALU and multiplicand register If multiplier bit 0 = 0 Continue if multiplier bit 0 = 1 Add the multiplicand to the left half of the product register and store the result in the left half Shift the product and multiplier registers right 1 bit. If this is the 32nd iteration END otherwise continue

  29. MULTIPLY HARDWARE Version 2 • 32-bit Multiplicand reg, 32 -bit ALU, 64-bit Product reg, 32-bit Multiplier reg 32-bit Multiplicand 32-bit Multiplier Shift Right 32-bit ALU Shift Right 64-bit Product Control Write

  30. Start Multiply Algorithm Version 2 Multiplier0 = 1 TestMultiplier0 Multiplier0 = 0 1. Add multiplicand to the left half of product & place the result in the left half of Product register 2. Shift the Product register right 1 bit. 3. Shift the Multiplier register right 1 bit. 32nd repetition? No: < 32 repetitions Yes: 32 repetitions Done

  31. Try It Yourself, V2 • 4x4 bit Multiply, Version 2 • Show each step for 5 x 11 = 55 MultiplierMultiplicandProduct Operation 0101 1011 0000 0000 initial load

  32. Solution, V2 • 5 x 11 = 55MultiplierMultiplicandProduct Operation 0101 1011 0000 0000 initial load 0101 1011 1011 0000 add 0010 1011 0101 1000shift 0010 1011 0101 1000 no add 0001 1011 0010 1100shift 0001 1011 1101 1100 add 0000 1011 0110 1100shift 0000 1011 0110 1100 no add 0000 1011 0011 0111 shift

  33. Observations on Multiply Version 2 • Right half of product register wastes space that exactly matches size of multiplier • So put multiplier in right half of product register ?

  34. Another Multiplier • In the previous algorithm the lower bits of the product register are wasted and could be used to store the multiplier. If product bit 0 = 0 Continue if product bit 0 = 1 Add the multiplicand to the left half of the product register and store the result in the left half Shift the product register right 1 bit. If this is the 32nd iteration END otherwise continue

  35. MULTIPLY HARDWARE Version 3 • 32-bit Multiplicand reg, 32 -bit ALU, 64-bit Product / Multiplier reg 32-bit Multiplicand 32-bit ALU Shift Right 64-bit Product / Multiplier Control Write

  36. Start Multiply Algorithm Version 3 Product0 = 1 TestProduct0 Product0 = 0 1. Add multiplicand to the left half of product & place the result in the left half of Product register 2. Shift the Product register right 1 bit. 32nd repetition? No: < 32 repetitions Yes: 32 repetitions Done

  37. Try It Yourself, V3 • 4x4 bit Multiply, Version 3 • Show each step for 5 x 11 = 55 MultiplicandProduct / Multiplier Operation 1011 0000 0101 initial load

  38. Solution, V3 • 5 x 11 = 55Multiplicand Product / Multiplier Operation 1011 0000 0101 initial load 1011 10110101 add 1011 0101 1010shift 1011 0101 1010 no add 1011 0010 1101shift 1011 1101 1101 add 1011 0110 1100shift 1011 0110 1100 no add 1011 0011 0111 shift

  39. Observations on Multiply Version 3 • 2 steps per bit because Multiplier & Product combined • MIPS registers Hi and Lo are left and right half of Product • Gives us MIPS instruction MultU • What about signed multiplication? • easiest solution is to make both positive & remember whether to complement product when done (leave out the sign bit, run for 31 steps) • Booth’s Algorithm is more elegant way to multiply signed numbers using same hardware as before

  40. Booth's Multiplier • In the previous algorithms only unsigned numbers were dealt with. Booth's algorithm deals with signed numbers as well. If the current and previous bits are 00 -- no addition or subtraction 01 -- end of string of one's so add the multiplicand to the left half of the product 10 -- beginning of a string of one's so subtract the multiplicand from the left half of the product 11 -- middle of a set of one's no addition or subtraction Shift the product register right 1 bit. If this is the 32nd iteration END otherwise continue

  41. Example of Booth's Algorithm 5 = 00101 x -3 = x 11101 ---- ------- Product 00000 11101 0 subtract and shift 11011 11101 0 11101 11110 1 add and shift 00010 11110 1 00001 01111 0 subtract and shift 11100 01111 0 11110 00111 1 shift only 11111 00011 1 shift only 11111 10001 1 result = 10001 = -01111 = -15 (base10)

  42. Floating Point Numbers • Although 2's complement numbers have been used for nearly 25 years, many floating point number representations were used up until about 1985. • Different representations make it very difficult to transfer data from one machine to another. • A standard was and is needed.

  43. Reals and Floats Consider the following numbers: • 3.1415926 • 2.718 • 0.000000001 = 1 * 109 • 3,155,760,000 = 3.15576 * 109 • The first three numbers cannot be represented by binary integers for obvious reasons, and the fourth cannot because signed 32-bit values are not large enough.

  44. Reals and Floats • Binary numbers can also be represented in scientific notation. The general form of this is s 1.m * 10 ^ e (binary) • where s is either + or - representing the sign of the number, m is a string of 1s and 0s representing the fractional part of the mantissa, and e is a + or - string of 1s and 0s representing an exponent, or the number's order of magnitude. Note that this is almost identical in form to standard decimal scientific notation, except that the 10 represents 2 (decimal), not 10 (decimal).

  45. Single Precision Floating Point (FP) Numbers • Current computer systems dictate that FP numbers must fit in 32- or 64-bit registers. • 32-bit or single precision FP numbers are organized as follows: seee eeee emmm mmmm mmmm mmmm mmmm mmmm • where s is the sign of the number, e represents the biased exponent, and m represents the mantissa. • 32-bit values range in magnitude from 10-38 to 1038.

  46. Double Precision Floating Point Numbers • 64 bit double precision floating point numbers are organized as follows: • The MSB is the sign bit • The next 11 bits are the exponent • The remaining 52 bits are the significand • The range in magnitude is from 10-308 to 10308 • The growth of significand and exponent is a compromise between accuracy and range.

  47. Implied Normalization • In the IEEE 754 floating point standard, a leading 1 in the significand is assumed. This increases the effective length of the significand in both single and double precision arithmetic • Numbers are therefore represented by: (-1)S * (1.M) * 2E-127 where S is the sign, M is the significand and E is the exponent

  48. Why IEEE 754? Example if(x == 0.0) y = 17.0 else y = z/x Can this cause a divide by zero error?

  49. Why IEEE 754? • In the CDC6600 the adder and subtractor examine 13 bits while the multiplier/divider only examines 12 bits. Very small x's cause a problem. Solution: if(1.0 * x == 0.0) CRAY overflow, Why? y = 17.0 else y = z/x

  50. Floating Point Addition • It should be clear that addition and subtraction are performed in a similar manner. • The steps for adding two floating point numbers is as follows: 1.We must adjust the smaller number until its exponent matches that of the larger number. 2.We add the significands together and round 3.Then the result is normalized by shifting the significand left or right and adjusting the exponent. 4.The number must again be rounded

More Related