0% found this document useful (0 votes)

6 views

10_Pipelining

Uploaded by

2021ucp1377

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views

10_Pipelining

Uploaded by

2021ucp1377

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 44

Chapter 12

Pipelining
• Strategies

• Performance

• Hazards
Example Register Organizations
Pentium 4 Organization
PowerPC Register Organization
Simple Instruction Cycle Model

Where is the time spent here ?

Faster Processing

Can be achieved through:

— Faster cycle time

— Divide cycle into more States

— Implementing parallelism
Prefetch

Consider the instruction sequence as:

— Fetch instruction
— Execution instruction (often does not access main memory)

Can computer fetch next instruction during

execution of current instruction ?
— Called instruction Pre-fetch

What are the implications of Pre-fetch?

A Two Stage Instruction Pipeline

What additional hardware is required for Pre-

fetch ?
Improved Performance with Prefetch
• Improved speed, but not doubled, why?
—Fetch usually shorter than execution
—Any jump or branch means that pre-fetched
instructions are not the required instructions

• Could we Prefetch more than one

instruction ?

• Could we add “more stages” to further

improve performance?
Instruction Cycle with Indirect Addressing

What is the benefit of this organization ?

Five State Instruction Cycle

• Fetch instructions

• Decode instructions

• Fetch Operands (Calc Addr & get data)

• Execute (Process data)

• Write results (Calculate Addr & store

data)
Instruction Cycle State Diagram
Pipelining
Consider the instruction sequence as:
— Fetch instruction (FI) ,
— Decode instruction (DI),
— Calculate Operands (CO),
— Fetch Operands (FO)
— Execute Instruction (EI),
— Write Operand (WO),
— Check for Interrupt (CI)

Consider it as an “assembly line” of operations.

Then we can begin the next instruction assembly line sequence

before the last has finished. Actually we can fetch the next
instruction while the present one is being decoded.

This is pipelining
Pipeline “stations”

Let’s define a possible set of Pipeline

stations:
• Fetch Instruction (FI)
• Decode Instruction (DI)
• Calculate Operand Addresses (CO)
• Fetch Operands (FO)
• Execute Instruction (EI)
• Write Operand (WO)
Possible Timing Diagram for
Instruction Pipeline Operation

Limitation: - maximum time for any stage,

- unnecessary stages, and
- overhead of transfers
The Impact of a Conditional Branch on
Instruction Pipeline Operation
Instruction 3 is a conditional branch to instruction 15:
Alternative Pipeline View
Instruction 3 is conditional branch to instruction 15:
Pipeline Flowchart for Branches
Speedup Factors with Instruction
Pipelining
Pipeline Hazards

Types of Pipeline Hazards:

— Structural (or Resource)

— Data

— Control
Structural Hazards

Structural hazards occur when instruction in

the pipeline need the same resource:
— Memory

— CPU

— Etc.
Example: Resource Hazard

Fetch of I3 has to stall for memory access of I1 operand.

Data Hazard

Data Hazards occur when there is a conflict

in the access of:
— a memory location or

— a register
Types of Data Hazards

• Read after Write (RAW) – true dependency

— A Hazard occurs if the Read occurs before the Write is
complete

• Write after Read (WAR) – anti-dependency

— A Hazard occurs if the Write occurs before the Read
happens

• Write after Write (WAW) – output

dependency
— A Hazard occurs if the two Writes occur in the reverse
order than intended
Example: RAW Data Hazard

The second instruction needs to stall for EAC to be written by

the first instruction before fetching it.

Is there a way of stalling one cycle instead of two?

The Other Data Hazards

• Write after Read (WAR) – anti-dependency

— A Hazard occurs if the Write occurs before the Read
happens

– Example?

• Write after Write (WAW) – output

dependency
— A Hazard occurs if the two Writes occur in the reverse
order than intended

– Example?
Control Hazard

Control Hazards occur when a wrong fetch decision

results in a new instruction fetch and the pipeline
being flushed

Solutions include:
— Multiple Pipeline streams
— Prefetching the branch target
— Using a Loop Buffer
— Branch Prediction
— Delayed Branch
— Reordering of Instructions
— Multiple Copies of Registers
— Get branch target early
Multiple Streams
• Have two pipelines
• Prefetch each branch into a separate
pipeline
• Use appropriate pipeline

Challenges:
• Leads to bus & register contention
• Multiple branches lead to further pipelines
being needed
Prefetch Branch Target

• Target of branch is prefetched in addition

to instructions following branch

• Keep target until branch is executed

Using a Loop Buffer

Have a small fast memory to hold the past n

instructions – perhaps already decoded

This likely contains loops that are executed

repeatedly
Loop Buffer
Branch Prediction

• Predict branch never taken

• Predict branch always taken
• Predict by opcode
• Use Predict branch taken/not taken
switch
• Maintain branch history table
• Get help from Compiler
Predict Branch Taken / Not taken
• Predict never taken
—Assume that jump will not happen
—Always fetch next instruction

• Predict always taken

—Assume that jump will happen
—Always fetch target instruction

Which is better – consider possible page

faults?
Branch Prediction by Opcode /
Switch
• Predict by Opcode
—Some instructions are more likely to result in a
jump than others
—Can get up to 75% success with this stategy

• Taken/Not taken switch

—Based on previous history
—Good for loops
—Perhaps good to match programmer style
Branch Prediction Flowchart
Branch Prediction State Diagram
Maintain Branch Table

• Perhaps maintain a cache table of three

entries:
- Address of branch
- History of branching
- Targets of branch
Branch History Table
Delayed Branch

In Delayed Branch, the branch is moved before

“independent instructions” preceding it. Then
those instructions which now follow the branch
can be executed while the branch target is being
determined.

What would it take to actually do this ?

Instruction Reordering

Instruction reordering requires a judicious

reordering of instructions so that data hazards
can be eliminated.

How can this be implemented ?

Multiple Copies of Registers

Having multiple copies of registers – perhaps as

many as one set for each stage can eliminate
many data hazards

How would you implement this ?

Get Branch Target Early

The branch target is often available before the end

of the pipeline, e.g. a JMP has it available as soon
as the source operand stage is completed. There
is no need to wait until the completion of the
write back stage to begin fetching the next
instruction.

What would it take to implement this ?

Example: Intel 80486 Pipelining
• Fetch (Fetch)
— From cache or external memory
— Put in one of two 16-byte prefetch buffers
— Fill buffer with new data as soon as old data consumed
— Average 5 instructions fetched per load
— Independent of other stages to keep buffers full

• Decode stage 1 (D1)

— Opcode & address-mode info
— At most first 3 bytes of instruction
— Can direct D2 stage to get rest of instruction

• Decode stage 2 (D2)

— Expand opcode into control signals
— Computation of complex address modes

• Execute (EX)
— ALU operations, cache access, register update

• Writeback (WB)
— Update registers & flags
— Results sent to cache & bus interface write buffers
80486 Instruction Pipeline Examples

Processor Structure and Function
100% (1)
Processor Structure and Function
55 pages
Writing A Literature Review
80% (5)
Writing A Literature Review
12 pages
Robin KMP4120 Loop Tester
No ratings yet
Robin KMP4120 Loop Tester
12 pages
Lecutre-7 Instruction Pipelining
No ratings yet
Lecutre-7 Instruction Pipelining
29 pages
Ch2 Lec7 Instruction Piplining
No ratings yet
Ch2 Lec7 Instruction Piplining
34 pages
12 - Processor Structure and Function
No ratings yet
12 - Processor Structure and Function
73 pages
Chapter_4
No ratings yet
Chapter_4
78 pages
Lecutre-7 Instruction Pipelining
No ratings yet
Lecutre-7 Instruction Pipelining
29 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
55 pages
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
No ratings yet
William Stallings Computer Organization and Architecture 8 Edition Processor Structure and Function
74 pages
CH14 COA9e Processor Structure and Function
No ratings yet
CH14 COA9e Processor Structure and Function
40 pages
Module 5_Processor Structure and Function
No ratings yet
Module 5_Processor Structure and Function
74 pages
Unit - 1 Microprocessor Architecture
No ratings yet
Unit - 1 Microprocessor Architecture
52 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Chapter 3 PPTV 31 Sem IIv 31
No ratings yet
Chapter 3 PPTV 31 Sem IIv 31
40 pages
CH10-Processor Structure and Function
No ratings yet
CH10-Processor Structure and Function
14 pages
Week 4 - Pipelining
No ratings yet
Week 4 - Pipelining
44 pages
Instruction Pipelining
No ratings yet
Instruction Pipelining
32 pages
moduel 5
No ratings yet
moduel 5
46 pages
CH 12.ppt Type I
No ratings yet
CH 12.ppt Type I
54 pages
Slot24 25 CH14 ProcessorStructureAndFunction 42 Slots
No ratings yet
Slot24 25 CH14 ProcessorStructureAndFunction 42 Slots
42 pages
Pipelining (All Slides)
No ratings yet
Pipelining (All Slides)
45 pages
Unit V
No ratings yet
Unit V
23 pages
Chapter 8 - Pipelining
No ratings yet
Chapter 8 - Pipelining
38 pages
Ch#16(CPU Structure and Function)
No ratings yet
Ch#16(CPU Structure and Function)
48 pages
Lec3 PDF
No ratings yet
Lec3 PDF
15 pages
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
No ratings yet
Pipelining. Pipeline Hazards: Sabina Batyrkhanovna
19 pages
L10-L11-Instruction Pipelining
No ratings yet
L10-L11-Instruction Pipelining
38 pages
Enhancing Performance With Pipelining
No ratings yet
Enhancing Performance With Pipelining
85 pages
11 Processor Structure and Function 20 3 18
No ratings yet
11 Processor Structure and Function 20 3 18
27 pages
Microprocessors Piplining Slides
No ratings yet
Microprocessors Piplining Slides
38 pages
Hello: Let's Get Started!
No ratings yet
Hello: Let's Get Started!
26 pages
Chapter 10 Principles of Pipelining
No ratings yet
Chapter 10 Principles of Pipelining
124 pages
Pipelining: Basic Concepts
No ratings yet
Pipelining: Basic Concepts
20 pages
Pipe Lining
No ratings yet
Pipe Lining
16 pages
01 - Mod 2 - Livro Autorresponsabilidade
No ratings yet
01 - Mod 2 - Livro Autorresponsabilidade
9 pages
Chapter 5
No ratings yet
Chapter 5
38 pages
HRY-312 Computer Organization Introduction To Pipelining
No ratings yet
HRY-312 Computer Organization Introduction To Pipelining
30 pages
Pipeline
No ratings yet
Pipeline
50 pages
DLCO Module 6 Sem 3
No ratings yet
DLCO Module 6 Sem 3
40 pages
SIMD Machines:: Pipeline System
No ratings yet
SIMD Machines:: Pipeline System
35 pages
CoA Batch13
No ratings yet
CoA Batch13
30 pages
Pipelining2019_(1)[1]
No ratings yet
Pipelining2019_(1)[1]
82 pages
COA Unit-2 Notes (P3)
No ratings yet
COA Unit-2 Notes (P3)
13 pages
Lec03 - Processor Structure and Function
No ratings yet
Lec03 - Processor Structure and Function
55 pages
Processor Organization
100% (1)
Processor Organization
55 pages
Pipeline Hazards: Structural Hazards: Resource Conflict
No ratings yet
Pipeline Hazards: Structural Hazards: Resource Conflict
49 pages
Ca07 2014 PDF
No ratings yet
Ca07 2014 PDF
56 pages
Unit 5.2 Processor
No ratings yet
Unit 5.2 Processor
40 pages
UNIT 6
No ratings yet
UNIT 6
20 pages
Kuliah 14 Pipeliningg
No ratings yet
Kuliah 14 Pipeliningg
28 pages
CPU Structure & Functions
No ratings yet
CPU Structure & Functions
44 pages
CAP EndSem Unit 5
No ratings yet
CAP EndSem Unit 5
8 pages
CH14 COA10e
No ratings yet
CH14 COA10e
54 pages
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
CA unit-2 Chapter-2
No ratings yet
CA unit-2 Chapter-2
36 pages
Rfghj
No ratings yet
Rfghj
20 pages
COA Unit 3
No ratings yet
COA Unit 3
89 pages
Pipeline Hazards (1)
No ratings yet
Pipeline Hazards (1)
53 pages
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
WAN TECHNOLOGY FRAME-RELAY: An Expert's Handbook of Navigating Frame Relay Networks
From Everand
WAN TECHNOLOGY FRAME-RELAY: An Expert's Handbook of Navigating Frame Relay Networks
Mamta Devi
No ratings yet
7) December Math Weekly Assessment 3rd Grade
No ratings yet
7) December Math Weekly Assessment 3rd Grade
15 pages
Cambridge IGCSE ™: Chemistry 0620/22
No ratings yet
Cambridge IGCSE ™: Chemistry 0620/22
3 pages
Simulator Guide
100% (1)
Simulator Guide
45 pages
21.2 HYDAC KineSys HFI-MM Commissioning Instruction V1.0 變頻馬達DAV-Kit 試車說明
No ratings yet
21.2 HYDAC KineSys HFI-MM Commissioning Instruction V1.0 變頻馬達DAV-Kit 試車說明
30 pages
English Grammar Class 1 Gender - Learn and Practice - Download Free PDF
No ratings yet
English Grammar Class 1 Gender - Learn and Practice - Download Free PDF
8 pages
Domicile 28CJS 20051203
No ratings yet
Domicile 28CJS 20051203
51 pages
Is Mathematical Truth established through Peer Review
No ratings yet
Is Mathematical Truth established through Peer Review
17 pages
Emile Durkheim's Perspective On Religion
No ratings yet
Emile Durkheim's Perspective On Religion
3 pages
The Challenge of The Other Towards Dialogical
No ratings yet
The Challenge of The Other Towards Dialogical
16 pages
04. Greedy algorithm
No ratings yet
04. Greedy algorithm
11 pages
Prepositions Document
100% (1)
Prepositions Document
12 pages
Lab Assignment For Chapter 1: 1.1 Packet Sniffing
No ratings yet
Lab Assignment For Chapter 1: 1.1 Packet Sniffing
9 pages
Airport_Security_Forces(3)
No ratings yet
Airport_Security_Forces(3)
2 pages
Curses and Blessings in Ancient Greek Oaths-faraone2005-Journal of Ancient Near Eastern Religions, Volume 5, Issue 1, Pages 139 - 156 Publication Year 2005
No ratings yet
Curses and Blessings in Ancient Greek Oaths-faraone2005-Journal of Ancient Near Eastern Religions, Volume 5, Issue 1, Pages 139 - 156 Publication Year 2005
19 pages
Vsphere Resource Management Guide: Esx 4.1 Esxi 4.1 Vcenter Server 4.1
No ratings yet
Vsphere Resource Management Guide: Esx 4.1 Esxi 4.1 Vcenter Server 4.1
120 pages
ATmega 323
No ratings yet
ATmega 323
248 pages
102 User Experiences With The Portable Stimulus Standard
No ratings yet
102 User Experiences With The Portable Stimulus Standard
61 pages
HTML Lab Report No. 3 (RAMH)
No ratings yet
HTML Lab Report No. 3 (RAMH)
7 pages
NATG6 ELLNA Div Orientation Copy For Schools
No ratings yet
NATG6 ELLNA Div Orientation Copy For Schools
83 pages
Knowing Allaah s Books and the Qur aan 2nd Edition Muhammad Mustafa Al-Jibaly - The ebook with rich content is ready for you to download
100% (1)
Knowing Allaah s Books and the Qur aan 2nd Edition Muhammad Mustafa Al-Jibaly - The ebook with rich content is ready for you to download
61 pages
SP L3ELNTCOM TD1 SignalTeory ProbStat 2324
No ratings yet
SP L3ELNTCOM TD1 SignalTeory ProbStat 2324
3 pages
0520_Teacher_Guide_(for_examination_from_2021)
No ratings yet
0520_Teacher_Guide_(for_examination_from_2021)
20 pages
Theseus vs. The Minotaur: Finding The Common Thread in The Chomsky - Foucault Debate Brian Lightbody
No ratings yet
Theseus vs. The Minotaur: Finding The Common Thread in The Chomsky - Foucault Debate Brian Lightbody
9 pages
Net Architecture
No ratings yet
Net Architecture
28 pages
Ebook Quran Vimarshana Patanam
No ratings yet
Ebook Quran Vimarshana Patanam
225 pages
English Handout of TI STMIK BMP
No ratings yet
English Handout of TI STMIK BMP
28 pages
Applications: A Guide To Using Alchemy CATALYST 4.0 To Accelerate Revenue Growth and Reduce Localization Costs
No ratings yet
Applications: A Guide To Using Alchemy CATALYST 4.0 To Accelerate Revenue Growth and Reduce Localization Costs
19 pages
5 - Getting - Started - With - Blynk - Console - Help
No ratings yet
5 - Getting - Started - With - Blynk - Console - Help
41 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

10_Pipelining

Uploaded by

10_Pipelining

Uploaded by

Chapter 12

Where is the time spent here ?

Can be achieved through:

— Divide cycle into more States

Consider the instruction sequence as:

Can computer fetch next instruction during

What are the implications of Pre-fetch?

What additional hardware is required for Pre-

• Could we Prefetch more than one

• Could we add “more stages” to further

What is the benefit of this organization ?

• Fetch Operands (Calc Addr & get data)

• Execute (Process data)

• Write results (Calculate Addr & store

Consider it as an “assembly line” of operations.

Then we can begin the next instruction assembly line sequence

Let’s define a possible set of Pipeline

Limitation: - maximum time for any stage,

Types of Pipeline Hazards:

— Structural (or Resource)

Structural hazards occur when instruction in

Fetch of I3 has to stall for memory access of I1 operand.

Data Hazards occur when there is a conflict

• Read after Write (RAW) – true dependency

• Write after Read (WAR) – anti-dependency

• Write after Write (WAW) – output

The second instruction needs to stall for EAC to be written by

Is there a way of stalling one cycle instead of two?

• Write after Read (WAR) – anti-dependency

• Write after Write (WAW) – output

Control Hazards occur when a wrong fetch decision

• Target of branch is prefetched in addition

• Keep target until branch is executed

Have a small fast memory to hold the past n

This likely contains loops that are executed

• Predict branch never taken

• Predict always taken

Which is better – consider possible page

• Taken/Not taken switch

• Perhaps maintain a cache table of three

In Delayed Branch, the branch is moved before

What would it take to actually do this ?

Instruction reordering requires a judicious

How can this be implemented ?

Having multiple copies of registers – perhaps as

How would you implement this ?

The branch target is often available before the end

What would it take to implement this ?

• Decode stage 1 (D1)

• Decode stage 2 (D2)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.