0% found this document useful (0 votes)

9 views

Pipelining

ntg

Uploaded by

saib12830

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

9 views

Pipelining

ntg

Uploaded by

saib12830

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 32

Pipelining

Performance Issues
 Longest delay determines clock period
 Critical path: load instruction
 Instruction memory → register file → ALU →
data memory → register file
 Not feasible to vary period for different
instructions
 Violates design principle
 Making the common case fast
 We will improve performance by pipelining

Chapter 4 — The Processor — 2

§4.5 An Overview of Pipelining
Pipelining Analogy
 Pipelined laundry: overlapping execution
 Parallelism improves performance

 Four loads:
 Speedup
= 16/7 = 2.3
 Non-stop:
 Speedup
= 4n/n + 3 ≈ 4
= number of stages

Chapter 4 — The Processor — 3

MIPS Pipeline
 Five stages, one step per stage
1. IF: Instruction fetch from memory
2. ID: Instruction decode & register read
3. EX: Execute operation or calculate address
4. MEM: Access memory operand
5. WB: Write result back to register

Chapter 4 — The Processor — 4

Pipeline Performance
Single-cycle (Tc= 800ps)

Pipelined (Tc= 200ps)

Chapter 4 — The Processor — 5

Pipeline Speedup
 If all stages are balanced
 i.e., all take the same time
 Time between instructionspipelined
= Time between instructionsnonpipelined
Number of stages
 If not balanced, speedup is less
 Speedup due to increased throughput
 Latency (time for each instruction) does not
decrease

Chapter 4 — The Processor — 6

Hazards
 Situations that prevent starting the next
instruction in the next cycle
 Structure hazards
 A required resource is busy
 Data hazard
 Need to wait for previous instruction to
complete its data read/write
 Control hazard
 Deciding on control action depends on
previous instruction

Chapter 4 — The Processor — 7

Structure Hazards
 Conflict for use of a resource
 In MIPS pipeline with a single memory
 Load/store requires data access
 Instruction fetch would have to stall for that
cycle

Would cause a pipeline “bubble”
 Hence, pipelined datapaths require
separate instruction/data memories
 Or separate instruction/data caches

Chapter 4 — The Processor — 8

Data Hazards
 An instruction depends on completion of
data access by a previous instruction
 add $s0, $t 0, $t 1
sub $t 2, $s0, $t 3

Chapter 4 — The Processor — 9

Forwarding (aka Bypassing)
 Use result when it is computed
 Don’t wait for it to be stored in a register
 Requires extra connections in the datapath

Chapter 4 — The Processor — 10

Load-Use Data Hazard
 Can’t always avoid stalls by forwarding
 If value not computed when needed
 Can’t forward backward in time!

Chapter 4 — The Processor — 11

Code Scheduling to Avoid Stalls
 Reorder code to avoid use of load result in
the next instruction
 C code for A = B + E; C = B + F;

lw $t 1, 0( $t 0) lw $t 1, 0( $t 0)
lw $t 2, 4( $t 0) lw $t 2, 4( $t 0)
stall add $t 3, $t 1, $t 2 lw $t 4, 8( $t 0)
sw $t 3, 12( $t 0) add $t 3, $t 1, $t 2
lw $t 4, 8( $t 0) sw $t 3, 12( $t 0)
stall add $t 5, $t 1, $t 4 add $t 5, $t 1, $t 4
sw $t 5, 16( $t 0) sw $t 5, 16( $t 0)
13 cycles 11 cycles

Chapter 4 — The Processor — 12

Control Hazards
 Branch determines flow of control
 Fetching next instruction depends on branch
outcome
 Pipeline can’t always fetch correct instruction
 Still working on ID stage of branch
 In MIPS pipeline
 Need to compare registers and compute
target early in the pipeline
 Add hardware to do it in ID stage

Chapter 4 — The Processor — 13

Stall on Branch
 Wait until branch outcome determined
before fetching next instruction

Chapter 4 — The Processor — 14

Branch Prediction
 Longer pipelines can’t readily determine
branch outcome early
 Stall penalty becomes unacceptable
 Predict outcome of branch
 Only stall if prediction is wrong
 In MIPS pipeline
 Can predict branches not taken
 Fetch instruction after branch, with no delay

Chapter 4 — The Processor — 15

MIPS with Predict Not Taken

Prediction
correct

Prediction
incorrect

Chapter 4 — The Processor — 16

More-Realistic Branch Prediction
 Static branch prediction
 Based on typical branch behavior
 Example: loop and if-statement branches

Predict backward branches taken
 Predict forward branches not taken
 Dynamic branch prediction
 Hardware measures actual branch behavior

e.g., record recent history of each branch
 Assume future behavior will continue the trend

When wrong, stall while re-fetching, and update history

Chapter 4 — The Processor — 17

Pipelining and ISA Design
 MIPS ISA designed for pipelining
 All instructions are 32-bits
 Easier to fetch and decode in one cycle
 c.f. x86: 1- to 17-byte instructions
 Few and regular instruction formats

Can decode and read registers in one step
 Load/store addressing

Can calculate address in 3rd stage, access memory
in 4th stage
 Alignment of memory operands

Memory access takes only one cycle

Chapter 4 — The Processor — 18

Pipeline Summary
The BIG Picture
 Pipelining improves performance by
increasing instruction throughput
 Executes multiple instructions in parallel
 Each instruction has the same latency
 Subject to hazards
 Structure, data, control
 Instruction set design affects complexity of
pipeline implementation
Chapter 4 — The Processor — 19
§4.6 Pipelined Datapath and Control
MIPS Pipelined Datapath

MEM

Right-to-left WB
flow leads to
hazards

Chapter 4 — The Processor — 20

Pipeline registers
 Need registers between stages
 To hold information produced in previous cycle

Chapter 4 — The Processor — 21

IF for Load, Store, …

Chapter 4 — The Processor — 22

ID for Load, Store, …

Chapter 4 — The Processor — 23

EX for Load

Chapter 4 — The Processor — 24

MEM for Load

Chapter 4 — The Processor — 25

WB for Load

Wrong
register
number

Chapter 4 — The Processor — 26

Corrected Datapath for Load

Chapter 4 — The Processor — 27

EX for Store

Chapter 4 — The Processor — 28

MEM for Store

Chapter 4 — The Processor — 29

WB for Store

Chapter 4 — The Processor — 30

Pipelined Control (Simplified)

Chapter 4 — The Processor — 31

Pipelined Control

Chapter 4 — The Processor — 32

8051 Microcontroller Instruction
No ratings yet
8051 Microcontroller Instruction
32 pages
Lecture-14 CH-04 2
No ratings yet
Lecture-14 CH-04 2
20 pages
Chapter4 Pipelining END FA11
No ratings yet
Chapter4 Pipelining END FA11
84 pages
Comp206 Lecture8
No ratings yet
Comp206 Lecture8
32 pages
Ca Lecture 9
No ratings yet
Ca Lecture 9
26 pages
lec2
No ratings yet
lec2
28 pages
lec3
No ratings yet
lec3
30 pages
06- CS F342 Pipelining(ForMIDSEM_upto35slides)
No ratings yet
06- CS F342 Pipelining(ForMIDSEM_upto35slides)
69 pages
Chapter4 Part1
No ratings yet
Chapter4 Part1
51 pages
Computer Architecture: Chapter 4: The Processor Part 1
No ratings yet
Computer Architecture: Chapter 4: The Processor Part 1
51 pages
Pipeline Processor Design
No ratings yet
Pipeline Processor Design
89 pages
5 Pipelining
No ratings yet
5 Pipelining
38 pages
L15 MipsPipeline
No ratings yet
L15 MipsPipeline
26 pages
Chapter 04 Computer Architecture and D
No ratings yet
Chapter 04 Computer Architecture and D
95 pages
Patterson6e MIPS Ch04 PPT
No ratings yet
Patterson6e MIPS Ch04 PPT
137 pages
Chapter 4 The Processor
No ratings yet
Chapter 4 The Processor
131 pages
Chapter 4 The Processor
No ratings yet
Chapter 4 The Processor
131 pages
Chapter - 04 Mips Assembly Data Path
No ratings yet
Chapter - 04 Mips Assembly Data Path
137 pages
Patterson6e MIPS Ch04 PPT
No ratings yet
Patterson6e MIPS Ch04 PPT
137 pages
The Processor: The Hardware/Software Interface 5
No ratings yet
The Processor: The Hardware/Software Interface 5
149 pages
Chapter 04 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
67% (6)
Chapter 04 Computer Organization and Design, Fifth Edition: The Hardware/Software Interface (The Morgan Kaufmann Series in Computer Architecture and Design) 5th Edition
137 pages
Processor PDF
No ratings yet
Processor PDF
98 pages
Cse410 10 Pipelining A
No ratings yet
Cse410 10 Pipelining A
7 pages
L14 MipsPipeline Ovw
No ratings yet
L14 MipsPipeline Ovw
17 pages
Comp206 Inclass8
No ratings yet
Comp206 Inclass8
20 pages
Ca06 2014 PDF
No ratings yet
Ca06 2014 PDF
53 pages
CODch 6 Slides
No ratings yet
CODch 6 Slides
77 pages
16.482 / 16.561 Computer Architecture and Design: Instructor: Dr. Michael Geiger Fall 2013
No ratings yet
16.482 / 16.561 Computer Architecture and Design: Instructor: Dr. Michael Geiger Fall 2013
42 pages
Chapter 4 The Processor
No ratings yet
Chapter 4 The Processor
72 pages
Ca07 2014 PDF
No ratings yet
Ca07 2014 PDF
56 pages
Comp206 Lecture9
No ratings yet
Comp206 Lecture9
53 pages
Pipe Lining
No ratings yet
Pipe Lining
66 pages
module 4-Pipelining
No ratings yet
module 4-Pipelining
39 pages
L4 - The Processor-Pipelined2
No ratings yet
L4 - The Processor-Pipelined2
47 pages
Chapter 04MHE Kabir
No ratings yet
Chapter 04MHE Kabir
171 pages
Pipelining Lecture
No ratings yet
Pipelining Lecture
39 pages
Computer Architecture: Nguyễn Trí Thành
No ratings yet
Computer Architecture: Nguyễn Trí Thành
77 pages
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
No ratings yet
Topic 10: Pipelining: Cos / Ele 375 Computer Architecture and Organization
64 pages
Pipelined MIPS Processor: Dmitri Strukov ECE 154A
No ratings yet
Pipelined MIPS Processor: Dmitri Strukov ECE 154A
81 pages
1. Lecture 13 Pipelining
No ratings yet
1. Lecture 13 Pipelining
12 pages
5.1-5.3 Pipelining and Parallel Processing
No ratings yet
5.1-5.3 Pipelining and Parallel Processing
56 pages
Chapter 4 Part 2
No ratings yet
Chapter 4 Part 2
50 pages
Module 8 The Processor Pipelining I
No ratings yet
Module 8 The Processor Pipelining I
36 pages
MIPS
No ratings yet
MIPS
70 pages
Lect8 Pipelined DP Control
No ratings yet
Lect8 Pipelined DP Control
59 pages
Pipeline and Vector
No ratings yet
Pipeline and Vector
29 pages
Pipelinehazard 160823134502
No ratings yet
Pipelinehazard 160823134502
61 pages
Pipelinehazard For Class
No ratings yet
Pipelinehazard For Class
61 pages
Hazards: Situations That Prevent Starting The Next Instruction in The Next Cycle Structure Hazards Data Hazard
No ratings yet
Hazards: Situations That Prevent Starting The Next Instruction in The Next Cycle Structure Hazards Data Hazard
6 pages
Advanced Linux Programming
No ratings yet
Advanced Linux Programming
31 pages
Module 9 The Processor Pipelining II
No ratings yet
Module 9 The Processor Pipelining II
68 pages
07 MIPS Pipelining CH4
No ratings yet
07 MIPS Pipelining CH4
73 pages
The Big Picture: Requirements Algorithms Prog. Lang./Os Isa Uarch Circuit Device
No ratings yet
The Big Picture: Requirements Algorithms Prog. Lang./Os Isa Uarch Circuit Device
60 pages
L 0 ILP Optional Extra Topic
No ratings yet
L 0 ILP Optional Extra Topic
44 pages
Lecture 8 Chapter_04 RISC-V Pipelining - Student Version (1)
No ratings yet
Lecture 8 Chapter_04 RISC-V Pipelining - Student Version (1)
59 pages
Chapter 2 Lecture 4 and 5
No ratings yet
Chapter 2 Lecture 4 and 5
56 pages
Computer Architecture Chapter 4: The Processor Part 3: Dr. Phạm Quốc Cường
No ratings yet
Computer Architecture Chapter 4: The Processor Part 3: Dr. Phạm Quốc Cường
23 pages
Computer Architecture: Appendix A Pipelining Prof. Jerry Breecher CSCI 240 Fall 2003
No ratings yet
Computer Architecture: Appendix A Pipelining Prof. Jerry Breecher CSCI 240 Fall 2003
58 pages
Pipelining
No ratings yet
Pipelining
44 pages
Module 4 - Parallel & Pipeline Processing - Final
No ratings yet
Module 4 - Parallel & Pipeline Processing - Final
31 pages
Accelerated Computing With HIP: Second Edition
From Everand
Accelerated Computing With HIP: Second Edition
Yifan Sun
No ratings yet
CSO Model Question
No ratings yet
CSO Model Question
5 pages
cs146 Fall2017 Midterm1xx
No ratings yet
cs146 Fall2017 Midterm1xx
12 pages
MPCA Assignment 11 B - 66
No ratings yet
MPCA Assignment 11 B - 66
5 pages
Risc Cisc in Microcontroller and Microprocessor
No ratings yet
Risc Cisc in Microcontroller and Microprocessor
31 pages
Chapter 4
No ratings yet
Chapter 4
73 pages
Chapter 2 Programming and Instruction Set PDF
No ratings yet
Chapter 2 Programming and Instruction Set PDF
122 pages
470 HW2 W14 Ans
No ratings yet
470 HW2 W14 Ans
3 pages
Pipelines - #1 RISC ISA Without Pipe
No ratings yet
Pipelines - #1 RISC ISA Without Pipe
9 pages
Module 5_Processor Structure and Function
No ratings yet
Module 5_Processor Structure and Function
74 pages
Tuning The Pentium Pro Microarchitecture
No ratings yet
Tuning The Pentium Pro Microarchitecture
8 pages
ILP ScoreBoard
No ratings yet
ILP ScoreBoard
45 pages
AMD64 Architecture Programmer's Manual Volume 3 General-Purpose and System Instructions
No ratings yet
AMD64 Architecture Programmer's Manual Volume 3 General-Purpose and System Instructions
474 pages
Instruction Set 1 Compressed 2 1
No ratings yet
Instruction Set 1 Compressed 2 1
25 pages
Microcontrollers and Embedded Systems Unit 2:8051 Programming
No ratings yet
Microcontrollers and Embedded Systems Unit 2:8051 Programming
6 pages
Unit-6: Pipeline & Vector Processing
No ratings yet
Unit-6: Pipeline & Vector Processing
41 pages
Addressing Modes
No ratings yet
Addressing Modes
26 pages
Difference Between Vector Processor and Scalar Processor
No ratings yet
Difference Between Vector Processor and Scalar Processor
1 page
Questions Ch5 1
No ratings yet
Questions Ch5 1
2 pages
COA Unit-2 Notes (P3)
No ratings yet
COA Unit-2 Notes (P3)
13 pages
William Stallings Computer Organization and Architecture 9 Edition
No ratings yet
William Stallings Computer Organization and Architecture 9 Edition
55 pages
Slot15 CH14 ProcessorStructureAndFunction 42 Slots
No ratings yet
Slot15 CH14 ProcessorStructureAndFunction 42 Slots
42 pages
Design of A Pipelined Powerpc Processor Using Verilog
No ratings yet
Design of A Pipelined Powerpc Processor Using Verilog
72 pages
DANIELCT
No ratings yet
DANIELCT
43 pages
8 - RISCV - Pipelined - Arch2
No ratings yet
8 - RISCV - Pipelined - Arch2
57 pages
Intel x86 Processors: Presented by Kiyeon Lee
No ratings yet
Intel x86 Processors: Presented by Kiyeon Lee
22 pages
Pipelining
No ratings yet
Pipelining
5 pages
Delayed Branching
No ratings yet
Delayed Branching
4 pages
Coa Unit - 5 Notes
No ratings yet
Coa Unit - 5 Notes
6 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Pipelining

Uploaded by

Pipelining

Uploaded by

Pipelining

Chapter 4 — The Processor — 2

Chapter 4 — The Processor — 3

Chapter 4 — The Processor — 4

Pipelined (Tc= 200ps)

Chapter 4 — The Processor — 5

Chapter 4 — The Processor — 6

Chapter 4 — The Processor — 7

Chapter 4 — The Processor — 8

Chapter 4 — The Processor — 9

Chapter 4 — The Processor — 10

Chapter 4 — The Processor — 11

Chapter 4 — The Processor — 12

Chapter 4 — The Processor — 13

Chapter 4 — The Processor — 14

Chapter 4 — The Processor — 15

Chapter 4 — The Processor — 16

Chapter 4 — The Processor — 17

Chapter 4 — The Processor — 18

Chapter 4 — The Processor — 20

Chapter 4 — The Processor — 21

Chapter 4 — The Processor — 22

Chapter 4 — The Processor — 23

Chapter 4 — The Processor — 24

Chapter 4 — The Processor — 25

Chapter 4 — The Processor — 26

Chapter 4 — The Processor — 27

Chapter 4 — The Processor — 28

Chapter 4 — The Processor — 29

Chapter 4 — The Processor — 30

Chapter 4 — The Processor — 31

Chapter 4 — The Processor — 32

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.