0% found this document useful (0 votes)

349 views

Multiplying Floating Point Numbers

Floating point multiplication involves: 1) Converting numbers to scientific notation 2) Adding the exponents and multiplying the mantissas 3) Renormalizing the product if needed by adjusting the exponent and radix point 4) XORing the sign bits to determine the overall sign Floating point addition equalizes exponents by adjusting one number's exponent and shifting its mantissa, then adds the mantissas and renormalizes the sum if needed. Both operations may result in overflow or underflow in hardware.

Uploaded by

MikaelYuruBeli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

349 views

Multiplying Floating Point Numbers

Uploaded by

MikaelYuruBeli

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 8

Multiplying Floating Point Numbers

Introduction
We'll do addition using the one byte floating point representation discussed in the
other class notes. IEEE 754 single precision has so many bits to work with, that it's
simply easier to explain how floating point addition works using a small float
representation.
Multiplication is simple. Suppose you want to multiply two floating point
numbers, X and Y.
Here's how to multiply floating point numbers.
1. First, convert the two representations to scientific notation. Thus, we explicitly
represent the hidden 1.
2. Let x be the exponent of X. Let y be the exponent of Y. The resulting exponent
(call it z) is the sum of the two exponents. z may need to be adjusted after the
next step.
3. Multiply the mantissa of X to the mantissa of Y. Call this result m.
4. If m is does not have a single 1 left of the radix point, then adjust the radix
point so it does, and adjust the exponent z to compensate.
5. Add the sign bits, mod 2, to get the sign of the resulting multiplication.
6. Convert back to the one byte floating point representation, truncating bits if
needed.
Example
Let's multiply the following two numbers:
Variable

sign

exponent

fraction

1001

010

0111

110

Here are the steps again:

1. First, convert the two representations to scientific notation. Thus, we

explicitly represent the hidden 1.
In this case, X is 1.01 X 22 and Y is 1.11 X 20.
2. Let x be the exponent of X. Let y be the exponent of Y. The resulting exponent
(call it z) is the sum of the two exponents. z may need to be adjusted after the
next step.
For now, the resulting exponent is 2 + 0 = 2
3. Multiply the mantissa of X to the mantissa of Y. Call this result m.
Multiplying 1.01 by 1.11 results in 10.0011
4. If m is does not have a single 1 left of the radix point, then adjust the radix
point so it does, and adjust the exponent z to compensate.
Now, we have to renormalize 10.0011 to 1.00011 and increase the exponent by
1 to 3
5. Add the sign bits, mod 2, to get the sign of the resulting multiplication.
The sign bit is 0 + 0 = 0.
6. Convert back to the one byte floating point representation, truncating bits
if needed.
We need to truncate 1.00011 x 23 to 1.000 x 23 and convert.
Product

sign

exponent

fraction

X*Y

1010

000

Negative Values
Unlike floating point addition, negative values are simple to take care of in floating
point multiplication. Treat the sign bit as 1 bit UB, and add modulo 2. This is the same
as XORing the sign bit.
Bias

Does the bias representation help us in floating point multiplication? In multiplication,

it's more of a pain to use bias representation because we need to sum the exponents.
Adding exponents represented in bias notation requires special hardware for biased
representation. You're better off converting to two's complement, and adding.
Thus, the bias makes it more challenging to add exponents.
Summary
Multiplying two floating point values isn't so difficult, at least, if you're mostly
interested in understanding how it works, rather than developing hardware to do the
multiplication.
Multiplying floating point requires you to add the exponents of two values, then
multiply the mantissas, then renormalize that result, adjusting the exponent if
necessary. Finally, you deal with the sign bit by XORing the sign bit.
Multiplying in real hardware involves rounding, dealing with overflow and underflow.
Our goal is to be able to do the multiplication on paper, just to get an idea of what's
going on.

Adding Floating Point Numbers

Introduction
We'll do addition using the one byte floating point representation discussed in the
other class notes. IEEE 754 single precision has so many bits to work with, that it's
simply easier to explain how floating point addition works using a small float
representation.
Addition is simple. Suppose you want to add two floating point numbers, X and Y.
For sake of argument, assume the exponent in Y is less than or equal to the exponent
in X. Let the exponent of Y be y and let the exponent of X be x.
Here's how to add floating point numbers.
1. First, convert the two representations to scientific notation. Thus, we explicitly
represent the hidden 1.
2. In order to add, we need the exponents of the two numbers to be the same. We
do this by rewriting Y. This will result in Y being not normalized, but value is
equivalent to the normalized Y.
Add x - y to Y's exponent. Shift the radix point of the mantissa
(signficand) Y left by x - y to compensate for the change in exponent.
3. Add the two mantissas of X and the adjusted Y together.
4. If the sum in the previous step does not have a single bit of value 1, left of the
radix point, then adjust the radix point and exponent until it does.
5. Convert back to the one byte floating point representation.
Example 1
Let's add the following two numbers:
Variable

sign

exponent

fraction

1001

110

0111

000

Here are the steps again:

1. First, convert the two representations to scientific notation. Thus, we
explicitly represent the hidden 1.
In normalized scientific notation, X is 1.110 x 22, and Y is 1.000 x 20.
2. In order to add, we need the exponents of the two numbers to be the same.
We do this by rewriting Y. This will result in Y being not normalized, but
value is equivalent to the normalized Y.
Add x - y to Y's exponent. Shift the radix point of the mantissa
(signficand) Y left by x - y to compensate for the change in exponent.
The difference of the exponent is 2. So, add 2 to Y's exponent, and shift the
radix point left by 2. This results in 0.0100 x 22. This is still equivalent to the
old value of Y. Call this readjusted value, Y'
3. Add the two mantissas of X and the adjusted Y' together.
We add 1.110two to 0.01two. The sum is: 10.0two. The exponent is still the
exponent of X, which is 2.
4. If the sum in the previous step does not have a single bit of value 1, left of
the radix point, then adjust the radix point and exponent until it does.
In this case, the sum, 10.0two, has two bits left of the radix point. We need to
move the radix point left by 1, and increase the exponent by 1 to compensate.
This results in: 1.000 x 23.
5. Convert back to the one byte floating point representation.
Sum

sign

exponent

fraction

X+Y

1010

000

Example 2
Let's add the following two numbers:
Variable

sign

exponent

fraction

1001

110

0110

110

Here are the steps again:

1. First, convert the two representations to scientific notation. Thus, we
explicitly represent the hidden 1.
In normalized scientific notation, X is 1.110 x 22, and Y is 1.110 x 2-1.
2. In order to add, we need the exponents of the two numbers to be the same.
We do this by rewriting Y. This will result in Y being not normalized, but
value is equivalent to the normalized Y.
Add x - y to Y's exponent. Shift the radix point of the mantissa
(signficand) Y left by x - y to compensate for the change in exponent.
The difference of the exponent is 3. So, add 3 to Y's exponent, and shift the
radix point of Y left by 3. This results in 0.00111 x 22. This is still equivalent to
the old value of Y. Call this readjusted value,Y'
3. Add the two mantissas of X and the adjusted Y' together.
We add 1.110two to 0.00111two. The sum is: 1.11111two. The exponent is still the
exponent of X, which is 2.
4. If the sum in the previous step does not have a single bit of value 1, left of
the radix point, then adjust the radix point and exponent until it does.
In this case, the sum, 1.11111two, has a single 1 left of the radix point. So, the
sum is normalized. We do not need to adjust anything yet.
So the result is the same as before: 1.11111 x 23.
5. Convert back to the one byte floating point representation.
We only have 3 bits to represent the fraction. However, there were 5 bits in our
answer. Obviously, it looks like we should round, and real floating point
hardware would do rounding.

However, for simplicity, we're going to truncate the additional two bits. After
truncating, we get 1.111 x 22. We convert this back to floating point.
Sum

sign

exponent

fraction

X+Y

1010

111

This example illustrates what happens if the exponents are separated by too much. In
fact, if the exponent differs by 4 or more, then effectively, you are adding 0 to the
larger of the two numbers.
Negative Values
So far, we've only considered adding two non-negative numbers. What happens with
negative values?
If you're doing it on paper, then you proceed with the sum as usual. Just do normal
addition or subtraction.
If it's in hardware, you would probably convert the mantissas to two's complement,
and perform the addition, while keeping track of the radix point (read about fixed
point representation.
Bias
Does the bias representation help us in floating point addition? The main difficulty
lies in computing the differences in the exponent. Still, that's not so bad because we
can just do unsigned subtraction. For the most part, the bias doesn't pose too many
problems.
Overflow/Underflow
It's possible for a result to overflow (a result that's too large to be represented) or
underflow (smaller in magnitude than the smallest denormal, but not zero). Real
hardware has rules to handle this. We won't worry about it much, except to
acknowledge that it can happen.
Summary
Adding two floating point values isn't so difficult. It basically consists of adjusting the
number with the smaller exponent (call this Y) to that of the larger (call it X), and
shifting the radix point of the mantissa of the Y left to compensate.

Once the addition is done, we may have to renormalize and to truncate bits if there are
too many bits to be represented.
If the differences in the exponent is too great, then the adding X + Y effectively
results in X.
Real floating point hardware uses more sophisticated means to round the summed
result. We take the simplification of truncating bits if there are more bits than can be
represented.

FOM Euro Practice Book C2 Answers
No ratings yet
FOM Euro Practice Book C2 Answers
17 pages
How To Add Floating Point Numbers
No ratings yet
How To Add Floating Point Numbers
4 pages
Floating-Point Numbers and Operations Representation
No ratings yet
Floating-Point Numbers and Operations Representation
8 pages
9-Algorithms For Floating Point Arithmetic Operations-22-01-2024
No ratings yet
9-Algorithms For Floating Point Arithmetic Operations-22-01-2024
49 pages
Floating Points
No ratings yet
Floating Points
31 pages
Floating-Point Arithmetic: Second Slide
No ratings yet
Floating-Point Arithmetic: Second Slide
4 pages
Floating Point Arithmetic Example
No ratings yet
Floating Point Arithmetic Example
4 pages
2.5 Floating Point Addition and Multiplication
No ratings yet
2.5 Floating Point Addition and Multiplication
6 pages
Floating Point Representation of Numbers: Wide Range
No ratings yet
Floating Point Representation of Numbers: Wide Range
11 pages
Floating Point Example
No ratings yet
Floating Point Example
2 pages
IEEE Paper On Floating Point
No ratings yet
IEEE Paper On Floating Point
28 pages
Lecture 10 (Temp)
No ratings yet
Lecture 10 (Temp)
50 pages
181
No ratings yet
181
11 pages
Design of Single Precision Floating Point Multiplication Algorithm With Vector Support
No ratings yet
Design of Single Precision Floating Point Multiplication Algorithm With Vector Support
8 pages
Dit 705 - DSP
No ratings yet
Dit 705 - DSP
15 pages
International Journal of Engineering Research and Development
No ratings yet
International Journal of Engineering Research and Development
6 pages
4.16. Floating Point
No ratings yet
4.16. Floating Point
5 pages
Float Point Multiplier
No ratings yet
Float Point Multiplier
6 pages
Lec4 Computer Architecture
No ratings yet
Lec4 Computer Architecture
39 pages
Lab 1
100% (1)
Lab 1
10 pages
Lec 21
No ratings yet
Lec 21
18 pages
COA-Module6-FloatingPoint
No ratings yet
COA-Module6-FloatingPoint
17 pages
Chapter 7 - Floating Point Arithmetic
No ratings yet
Chapter 7 - Floating Point Arithmetic
8 pages
NXN Crossbar Design For Barrel Shifter: X-Input Y-Output
No ratings yet
NXN Crossbar Design For Barrel Shifter: X-Input Y-Output
18 pages
Review: How To Represent Real Numbers
No ratings yet
Review: How To Represent Real Numbers
9 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
Computer Organization
No ratings yet
Computer Organization
22 pages
CO Unit-V
No ratings yet
CO Unit-V
10 pages
Unit 3 Chapter1 Computer Arithmetic
No ratings yet
Unit 3 Chapter1 Computer Arithmetic
28 pages
A Brief Introduction To The IEEE Standard
No ratings yet
A Brief Introduction To The IEEE Standard
4 pages
COA
No ratings yet
COA
14 pages
Q1: Why Is The Exponent Biased in Floating Point Hardware Design, and What Does Biased Mean in Floating Point?
No ratings yet
Q1: Why Is The Exponent Biased in Floating Point Hardware Design, and What Does Biased Mean in Floating Point?
2 pages
Floating Point Numbers: CS031 September 12, 2011
No ratings yet
Floating Point Numbers: CS031 September 12, 2011
22 pages
Floating Point Tutorial
No ratings yet
Floating Point Tutorial
15 pages
Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic 33333
No ratings yet
Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic Floating-Point Arithmetic 33333
18 pages
Computer Organisation
No ratings yet
Computer Organisation
4 pages
Lec 7
No ratings yet
Lec 7
18 pages
Itec1000 Lecture Note 5
No ratings yet
Itec1000 Lecture Note 5
10 pages
Chapter2 2.5
No ratings yet
Chapter2 2.5
34 pages
Computer Arithmetic: Multiplication Algorithms Division Algorithms Floating-Point Arithmetic Operations
No ratings yet
Computer Arithmetic: Multiplication Algorithms Division Algorithms Floating-Point Arithmetic Operations
70 pages
Final Project
No ratings yet
Final Project
11 pages
2.4 Floating Point Representation
No ratings yet
2.4 Floating Point Representation
7 pages
Floating Point Arithmetic
No ratings yet
Floating Point Arithmetic
15 pages
Arithmetic & Logic Unit
No ratings yet
Arithmetic & Logic Unit
58 pages
Lec07 - Computer Arithmetic - Floating-Point Representation and Arithmetic
No ratings yet
Lec07 - Computer Arithmetic - Floating-Point Representation and Arithmetic
42 pages
HW_2
No ratings yet
HW_2
4 pages
#3 - Floating Point
No ratings yet
#3 - Floating Point
38 pages
Floating Point Representation of Data: By-Astha Jain Class-It1 0827IT171019
No ratings yet
Floating Point Representation of Data: By-Astha Jain Class-It1 0827IT171019
16 pages
BiD 09
No ratings yet
BiD 09
56 pages
How To Represent Real Numbers: - in Decimal Scientific Notation
No ratings yet
How To Represent Real Numbers: - in Decimal Scientific Notation
16 pages
L12 Representation of Numbers
No ratings yet
L12 Representation of Numbers
27 pages
Integer:: Arithmetic and Logic Unit
No ratings yet
Integer:: Arithmetic and Logic Unit
10 pages
Data Representation Workbook
No ratings yet
Data Representation Workbook
8 pages
EC-502 - Aritra Dutta
No ratings yet
EC-502 - Aritra Dutta
6 pages
Floating Point Arithmetic Class
No ratings yet
Floating Point Arithmetic Class
24 pages
Ece552 10 Floating Point
No ratings yet
Ece552 10 Floating Point
15 pages
Module 2 - PART D Floating
No ratings yet
Module 2 - PART D Floating
30 pages
Real Number Representation and Floating Point Arithmetic
No ratings yet
Real Number Representation and Floating Point Arithmetic
12 pages
Floating Point Arithmetic
No ratings yet
Floating Point Arithmetic
17 pages
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Top Numerical Methods With Matlab For Beginners!
From Everand
Top Numerical Methods With Matlab For Beginners!
Andrei Besedin
No ratings yet
Common Core Math Worksheets
100% (1)
Common Core Math Worksheets
79 pages
Gen. Math Module 4
No ratings yet
Gen. Math Module 4
4 pages
Kami Export - Teacher Copy_2025 MA11_Unit 3 - Functions and Graphs - Part 2
No ratings yet
Kami Export - Teacher Copy_2025 MA11_Unit 3 - Functions and Graphs - Part 2
26 pages
Number System
No ratings yet
Number System
13 pages
Math Formula List
No ratings yet
Math Formula List
4 pages
Simplifying Rational Expressions: Algebra 2 9.1 Name
No ratings yet
Simplifying Rational Expressions: Algebra 2 9.1 Name
6 pages
Westmead International School
No ratings yet
Westmead International School
7 pages
Math Practice Book
No ratings yet
Math Practice Book
169 pages
Math Algebra Pack
No ratings yet
Math Algebra Pack
15 pages
Math8 Q3 Module5
No ratings yet
Math8 Q3 Module5
32 pages
Percentage to Fraction
100% (1)
Percentage to Fraction
4 pages
Class 6
100% (1)
Class 6
5 pages
Very important grade 9 maths
No ratings yet
Very important grade 9 maths
2 pages
MLL Study Materials Maths Basic Class X 2019 20
No ratings yet
MLL Study Materials Maths Basic Class X 2019 20
71 pages
Unit Circle Lesson Plan
100% (1)
Unit Circle Lesson Plan
5 pages
Textbook+&+Workbook+Introduction
No ratings yet
Textbook+&+Workbook+Introduction
18 pages
Factoring Review Worksheet PDF
100% (2)
Factoring Review Worksheet PDF
4 pages
Sine-And-Cosine-Rules 3
No ratings yet
Sine-And-Cosine-Rules 3
10 pages
Puerto Rican Mathematical Olympiad: Luis F. C Aceres
No ratings yet
Puerto Rican Mathematical Olympiad: Luis F. C Aceres
7 pages
RD Sharma Solutions For Class 8 Chapter 4 Cubes and Cube Roots
No ratings yet
RD Sharma Solutions For Class 8 Chapter 4 Cubes and Cube Roots
46 pages
Weekly Home Learning Plan 1st Distribution 4th Quarter
No ratings yet
Weekly Home Learning Plan 1st Distribution 4th Quarter
4 pages
CBSE 12 Engineering Medical Maths Integrals Min PDF
No ratings yet
CBSE 12 Engineering Medical Maths Integrals Min PDF
116 pages
Variations
No ratings yet
Variations
4 pages
Instructions: (Covering The Notebook, Completion of Your Note Book, Worksheet, Highlighting Important Results, Self-Correction, Extended Learning, Redoing The Exam Paper Etc.)
No ratings yet
Instructions: (Covering The Notebook, Completion of Your Note Book, Worksheet, Highlighting Important Results, Self-Correction, Extended Learning, Redoing The Exam Paper Etc.)
6 pages
A Tutorial - Data Representation
No ratings yet
A Tutorial - Data Representation
13 pages
C4 Revision
No ratings yet
C4 Revision
52 pages
RXQ Atk OQn 8 I9 Is ULq Mu M
No ratings yet
RXQ Atk OQn 8 I9 Is ULq Mu M
22 pages
Syed Ali Imran: BIC Room, Ground Floor Iqbal Block Near I - 102
No ratings yet
Syed Ali Imran: BIC Room, Ground Floor Iqbal Block Near I - 102
15 pages
2023 Singapore Math Challenge Grade 8 Sec 2
100% (1)
2023 Singapore Math Challenge Grade 8 Sec 2
21 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Multiplying Floating Point Numbers

Uploaded by

Multiplying Floating Point Numbers

Uploaded by

Multiplying Floating Point Numbers

Here are the steps again:

1. First, convert the two representations to scientific notation. Thus, we

Does the bias representation help us in floating point multiplication? In multiplication,

Adding Floating Point Numbers

Here are the steps again:

Here are the steps again:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.