0% found this document useful (0 votes)

11 views21 pages

Lecture 4

Uploaded by

jam khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views21 pages

Lecture 4

Uploaded by

jam khan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

Representing and Manipulating Information

Floating-Point Number Representation

 A floating-point number (or real number) can represent a very large value
(e.g., 1.23×10^88)
 or a very small value (e.g., 1.23×10^-88).
 It could also represent very large negative number (e.g., -1.23×10^88)
 and very small negative number (e.g., -1.23×10^-88), as well as zero, as
illustrated:
Floating-Point Number Representation
 A floating-point number is typically expressed in the scientific notation,
 With a fraction (F), and an exponent (E) of a certain radix (r), in the form of
F×rÊ.
 Decimal numbers use radix of 10 (F×10Ê);
 While binary numbers use radix of 2 (F×2Ê).
 Representation of floating point number is not unique.
 For example, the number 55.66 can be represented as 5.566×101,
0.5566×102, 0.05566×103, and so on.
Floating-Point Number Representation
 The fractional part can be normalized.
 In the normalized form, there is only a single non-zero digit before the
radix point.
 For example, decimal number 123.4567 can be normalized as
1.234567×102;
 binary number 1010.1011B can be normalized as 1.0101011B×23.
Floating-Point Number Representation
 It is important to note that floating-point numbers suffer from loss of
precision.
 When represented with a fixed number of bits (e.g., 32-bit or 64-bit).
 This is because there are infinite number of real numbers (even within a
small range of says 0.0 to 0.1).
 On the other hand, a n-bit binary pattern can represent a finite 2n distinct
numbers.
 Hence, not all the real numbers can be represented.
 The nearest approximation will be used instead, resulted in loss of
accuracy.
Floating-Point Number Representation
 Floating number arithmetic is very much less efficient than integer
arithmetic.
 It could be speed up with a so-called dedicated floating-point co-
processor.
 Hence, use integers if your application does not require floating-point
numbers.
Floating-Point Number Representation
 In computers, floating-point numbers are represented in scientific
notation of fraction (F) and exponent (E) with a radix of 2, in the form of
F×2Ê.
 Both E and F can be positive as well as negative.
 Modern computers adopt IEEE 754 standard for representing floating-
point numbers.
 There are two representation schemes: 32-bit single-precision and 64-bit
double-precision.
IEEE-754 32-bit Single-Precision Floating-Point Numbers
 In 32-bit single-precision floating-point representation:
 The most significant bit is the sign bit (S),
 with 0 for positive numbers and 1 for negative numbers.
 The following 8 bits represent exponent (E).
 The remaining 23 bits represents fraction (F).
Normalized Form
 Let's illustrate with an example, suppose that the 32-bit pattern is,
 1 1000 0001 011 0000 0000 0000 0000 0000, with:
 S=1
 E = 1000 0001
 F = 011 0000 0000 0000 0000 0000
Normalized Form
 In the normalized form, the actual fraction is normalized with an implicit
leading 1 in the form of 1.F.
 In this example, the actual fraction is 1.011 0000 0000 0000 0000 0000 = 1
+ 1×2-2 + 1×2-3 = 1.375D.
 The sign bit represents the sign of the number,
 with S=0 for positive and S=1 for negative number.
 In this example with S=1, this is a negative number, i.e., -1.375D.
Normalized Form
 The exponent field is interpreted as representing a signed integer in
biased form.
 That is, the exponent value is E = e − Bias,
 where e is the unsigned number having bit representation ek−1 . . . e1e0
 and Bias is a bias value equal to 2k−1 − 1.
 This yields exponent ranges from −126 to +127.
Normalized Form
 Why set the bias this way for denormalized values?
 Having the exponent value be 1 − Bias rather than simply −Bias.
 it provides for smooth transition from denormalized to normalized values.
Normalized Form
 In this example, E=e-127=129-127=2D.
 Hence, the number represented is -1.375×22=-5.5D.
De-Normalized Form
 Normalized form has a serious problem, with an implicit leading 1 for the
fraction,
 it cannot represent the number zero.
 When the exponent field is all zeros, the represented number is in
denormalized form.
 In this case, the exponent value is E = 1 − Bias.
 The value of the fraction field without an implied leading 1.
De-Normalized Form
 Denormalized numbers serve two purposes.
 First, they provide a way to represent numeric value 0,
 Since with a normalized number we must always have F ≥ 1, and hence we
cannot represent 0.
 In fact, the floating-point representation of +0.0 has a bit pattern of all
zeros: the sign bit is 0,
 the exponent field is all zeros (indicating a denormalized value), and the
fraction field is all zeros, giving F = 0.
De-Normalized Form
 when the sign bit is 1, but the other fields are all zeros, we get the value
−0.0.
 A second function of denormalized numbers is to represent numbers that
are very close to 0.0.
De-Normalized Form
 We can also represent very small positive and negative numbers in de-
normalized form with E=0.
 For example, if S=1, E=0, and F=011 0000 0000 0000 0000 0000.
 The actual fraction is 0.011=1×2-2+1×2-3=0.375D.
 Since S=1, it is a negative number.
 With E=0, the actual exponent is -126.
 Hence the number is -0.375×2-126 = -4.4×10-39,
 which is an extremely small negative number (close to zero).
Special Values
 A final category of values occurs when the exponent field is all ones.
 When the fraction field is all zeros, the resulting values represent infinity,
 either +∞ when s = 0 or −∞ when s = 1.
 Infinity can represent results that overflow, as when we multiply two very
large numbers, or when we divide by zero.
 When the fraction field is nonzero, the resulting value is called a NaN,
short for “not a number.”
IEEE-754 64-bit Double-Precision Floating-Point Numbers
 The representation scheme for 64-bit double-precision is similar to the 32-
bit single-precision:
 The most significant bit is the sign bit (S), with 0 for positive numbers and
1 for negative numbers.
 The following 11 bits represent exponent (E).
 The remaining 52 bits represents fraction (F).
IEEE-754 64-bit Double-Precision Floating-Point Numbers
 The value (N) is calculated as follows:
 Normalized form: For 1 ≤ E ≤ 2046, N = (-1)^S × 1.F × 2^(E-1023).
 Denormalized form: For E = 0, N = (-1)^S × 0.F × 2^(-1022). These are in the
denormalized form.
 For E = 2047, N represents special values, such as ±INF (infinity), NaN (not
a number).

Ruaumoko 2 D
No ratings yet
Ruaumoko 2 D
101 pages
Floating Point Arithmetic
100% (1)
Floating Point Arithmetic
30 pages
L2-Variables and Floating Point Number System
No ratings yet
L2-Variables and Floating Point Number System
38 pages
08-FloatingPoint
No ratings yet
08-FloatingPoint
52 pages
VLSI Implementation of Floating Point Adder
100% (1)
VLSI Implementation of Floating Point Adder
46 pages
ML System Optimization Lecture 11 Quantization
No ratings yet
ML System Optimization Lecture 11 Quantization
150 pages
Lecture 5
No ratings yet
Lecture 5
68 pages
Chap 1 Java 6 TH
No ratings yet
Chap 1 Java 6 TH
74 pages
Week 3
No ratings yet
Week 3
66 pages
Com Prog
No ratings yet
Com Prog
42 pages
COA UNIT-III PPTs Dr.G.Bhaskar ECE
No ratings yet
COA UNIT-III PPTs Dr.G.Bhaskar ECE
64 pages
Lecture 1
No ratings yet
Lecture 1
32 pages
CS501-Advance Computer Architecture: Solved MCQS From Final Term Papers
No ratings yet
CS501-Advance Computer Architecture: Solved MCQS From Final Term Papers
27 pages
Lecture 3
No ratings yet
Lecture 3
50 pages
LEC03 Data II
No ratings yet
LEC03 Data II
45 pages
3. Floating_Point_Number
No ratings yet
3. Floating_Point_Number
36 pages
u-4
No ratings yet
u-4
21 pages
4-Floating-Point-inclass
No ratings yet
4-Floating-Point-inclass
33 pages
Advanced PLC Presentation
No ratings yet
Advanced PLC Presentation
60 pages
Lecture 2
No ratings yet
Lecture 2
23 pages
CH03-Data-II(2) (2)
No ratings yet
CH03-Data-II(2) (2)
31 pages
Floating Point Representation: Reading: B&O 2.4
No ratings yet
Floating Point Representation: Reading: B&O 2.4
44 pages
Electromagnetic Flowmeter MODBUS Communication Protocol
No ratings yet
Electromagnetic Flowmeter MODBUS Communication Protocol
17 pages
L1 FloatingPointNumbers Intro
No ratings yet
L1 FloatingPointNumbers Intro
17 pages
Open PL - I Language Reference Manual
100% (1)
Open PL - I Language Reference Manual
166 pages
Lecture Notes 01-Introduction and Error Analysis (Print Version)
No ratings yet
Lecture Notes 01-Introduction and Error Analysis (Print Version)
37 pages
Lect4 Floats
No ratings yet
Lect4 Floats
64 pages
Weintek Macro Manual
No ratings yet
Weintek Macro Manual
123 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
30 pages
Floating Point & fixed point Representation_BCA II
No ratings yet
Floating Point & fixed point Representation_BCA II
24 pages
Python Practical Synopsis
No ratings yet
Python Practical Synopsis
18 pages
Lec07 - Computer Arithmetic - Floating-Point Representation and Arithmetic
No ratings yet
Lec07 - Computer Arithmetic - Floating-Point Representation and Arithmetic
42 pages
Data Representation
No ratings yet
Data Representation
19 pages
Introduction To Polynomials Chaos With NISP: Michael Baudin (EDF) Jean-Marc Martinez (CEA) January 2013
No ratings yet
Introduction To Polynomials Chaos With NISP: Michael Baudin (EDF) Jean-Marc Martinez (CEA) January 2013
52 pages
3-EED220 Lecture 3
No ratings yet
3-EED220 Lecture 3
22 pages
5 Data - Floating - Point v1
No ratings yet
5 Data - Floating - Point v1
25 pages
Lecture 2
No ratings yet
Lecture 2
27 pages
Lecture 4
No ratings yet
Lecture 4
21 pages
Week2 CM MDL CP1212
No ratings yet
Week2 CM MDL CP1212
13 pages
Floating-Point Numbers
No ratings yet
Floating-Point Numbers
23 pages
Week8 Slides
No ratings yet
Week8 Slides
43 pages
Unit 2
No ratings yet
Unit 2
16 pages
Soft Computing
No ratings yet
Soft Computing
38 pages
Fixed & Floating Point
No ratings yet
Fixed & Floating Point
31 pages
Mod 2
No ratings yet
Mod 2
26 pages
COMP0068 Lecture10 High Level Data Types
No ratings yet
COMP0068 Lecture10 High Level Data Types
25 pages
Calculo de Tendencia
No ratings yet
Calculo de Tendencia
35 pages
Module 1 Data Rep
No ratings yet
Module 1 Data Rep
14 pages
8.3 Floating Point Numbers
No ratings yet
8.3 Floating Point Numbers
19 pages
Cosc 2150: Computer Organization: Chapter 9, Part 3 Floating Point Numbers
No ratings yet
Cosc 2150: Computer Organization: Chapter 9, Part 3 Floating Point Numbers
39 pages
Module 2 - PART D Floating
No ratings yet
Module 2 - PART D Floating
30 pages
Computer Architecture and Organization: Lecture 6: Floating Points
No ratings yet
Computer Architecture and Organization: Lecture 6: Floating Points
20 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
30 pages
Floating Point
No ratings yet
Floating Point
13 pages
Floating Points
No ratings yet
Floating Points
31 pages
3D STAAD-Pro Help Editor Manual - 2017: January 2017
No ratings yet
3D STAAD-Pro Help Editor Manual - 2017: January 2017
49 pages
Floating Point Sept 6, 2006 15-213: "The Course That Gives CMU Its Zip!"
No ratings yet
Floating Point Sept 6, 2006 15-213: "The Course That Gives CMU Its Zip!"
34 pages
Chapter2 2.5
No ratings yet
Chapter2 2.5
34 pages
Asm 6809
No ratings yet
Asm 6809
8 pages
Lecture 02 - Floating Point Arithmetic
No ratings yet
Lecture 02 - Floating Point Arithmetic
14 pages
Unit-Iv Computer Arithmetic
No ratings yet
Unit-Iv Computer Arithmetic
16 pages
EE 109 Unit 20: IEEE 754 Floating Point Representation Floating Point Arithmetic
No ratings yet
EE 109 Unit 20: IEEE 754 Floating Point Representation Floating Point Arithmetic
31 pages
#3 - Floating Point
No ratings yet
#3 - Floating Point
38 pages
Compiler Design Lab Manual
0% (1)
Compiler Design Lab Manual
59 pages
2.4 Floating Point Representation
No ratings yet
2.4 Floating Point Representation
7 pages
Floating Point
No ratings yet
Floating Point
2 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
C Basics
0% (1)
C Basics
23 pages
Fixed Point and Floating Point Number Representations
No ratings yet
Fixed Point and Floating Point Number Representations
5 pages
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
No ratings yet
The World Is Not Just Integers: Programming Languages Support Numbers With Fraction
51 pages
ECE 252 - Quiz - 1 - Solutions
No ratings yet
ECE 252 - Quiz - 1 - Solutions
5 pages
ARCh Presentation1
No ratings yet
ARCh Presentation1
12 pages
Fixed and Floating Point Error Analysis of
No ratings yet
Fixed and Floating Point Error Analysis of
4 pages
Arduino Programming (The Beginning)
No ratings yet
Arduino Programming (The Beginning)
33 pages
Floating Point Arithmetic Class
No ratings yet
Floating Point Arithmetic Class
24 pages
Floating Point
No ratings yet
Floating Point
6 pages
Fixed and Floating Point Representation
No ratings yet
Fixed and Floating Point Representation
5 pages
C Data Types
No ratings yet
C Data Types
3 pages
"The Course That Gives CMU Its Zip!": Topics
No ratings yet
"The Course That Gives CMU Its Zip!": Topics
31 pages
Computer Arithmetic Representations
No ratings yet
Computer Arithmetic Representations
24 pages
Floating Point Representation of Data: By-Astha Jain Class-It1 0827IT171019
No ratings yet
Floating Point Representation of Data: By-Astha Jain Class-It1 0827IT171019
16 pages
Floating Point Representation - M.eng Term Paper
No ratings yet
Floating Point Representation - M.eng Term Paper
6 pages
The IEEE Standard For Floating Point Arithmetic
No ratings yet
The IEEE Standard For Floating Point Arithmetic
9 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
8 pages
This Unit: Arithmetic and ALU Design Floating Point Arithmetic
No ratings yet
This Unit: Arithmetic and ALU Design Floating Point Arithmetic
8 pages
PHP UNIT 01-Notes PDF
100% (1)
PHP UNIT 01-Notes PDF
62 pages
Floating Point Numbers
No ratings yet
Floating Point Numbers
5 pages
Number Representation
No ratings yet
Number Representation
7 pages
SDM230 Protocol
No ratings yet
SDM230 Protocol
12 pages
Fixed Point and Floating Point Number Representations
No ratings yet
Fixed Point and Floating Point Number Representations
7 pages
IEEE 754 Floating Point Notes
No ratings yet
IEEE 754 Floating Point Notes
4 pages
Answer:: Mean Static - Cast (Float) (Total) /value? What Do You Think Will Happen If It Is Removed?
No ratings yet
Answer:: Mean Static - Cast (Float) (Total) /value? What Do You Think Will Happen If It Is Removed?
2 pages
Previous Year TCS Coding Question
50% (2)
Previous Year TCS Coding Question
19 pages
Master Fracions Addition, Subtraction And Multiplication: Math Childern Book
From Everand
Master Fracions Addition, Subtraction And Multiplication: Math Childern Book
Mourad Boufadene
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lecture 4

Uploaded by

Lecture 4

Uploaded by

Representing and Manipulating Information

Floating-Point Number Representation

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.