0% found this document useful (0 votes)

28 views31 pages

Slide 5

The lecture covers the design of a single-cycle processor, focusing on the RISC-V architecture, which includes components like the datapath and control logic. It outlines the instruction execution process, detailing the five stages: Fetch, Decode, Execute, Memory, and Write Back, while emphasizing the requirements and block diagrams for each stage. The lecture also discusses the implementation of specific instruction formats and the role of the ALU in processing arithmetic and logical operations.

Uploaded by

Dang Nguyen Uyen My

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views31 pages

Slide 5

Uploaded by

Dang Nguyen Uyen My

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 31

ELT3047 Computer Architecture

Lecture 5: Single cycle processor design

Hoang Gia Hung

Faculty of Electronics and Telecommunications
University of Engineering and Technology, VNU Hanoi
Today’s lecture overview

❑ A single-core processor consists of

▪ Datapath: HW elelements that process data,
e.g. perform the arithmetic, logical & memory
operations.
▪ Control: HW elements that tell the datapath,
memory & I/O devices what to do according
to program instructions.

❑ Building two RISC-V implementations

➢ Single cycle processor (starting this week)
➢ Pipelined processor (later)

❑ Only a simplified RISC-V ISA subset

➢ Memory reference: lw, sw
➢ Arithmetic/logical: add, sub, ori
➢ Control transfer: beq
Instruction execution in a single-cycle
processor
❑ Any instruction must be executed in exactly one single clock
cycle, which comprises 5 sequential phases.
➢ Example: add t3, t1, t2 vs lw t3, 20(t1)

add t3, t1, t2 lw t3, 20(t1) Clk

Fetch Read inst. at [PC] Read inst. at [PC] Fetch

Decode
o Addition o Load word
&

Next Instruction
o Read [t1] as opr1 o Read [t1] as opr1 Decode
Operand
o Read [t2] as opr2 o Use 20 as opr2
Fetch

ALU Result = opr1 + opr2 MemAddr = opr1 + opr2 ALU

Memory Use MemAddr to read Memory

Access from memory

Result Result WB
Result stored in t3 Memory data stored in t3
Write
Steps to design a datapath

? ?
? ? ?

Clk

1. Instruction Fetch 2. Decode 3. Execute 4. Memory 5. Write back

❑ We will build a lite RISC-V datapath incrementally:

1. Look at each stage closely, figure out the requirements and processes.
2. Sketch a high level block diagram, then zoom in for each elements.
3. With the simple starting design, check whether different type of instructions
can be handled, add modifications when needed.
A prelude to control
1. Instruction Fetch 2. Decode 3. Execute 4. Memory 5. Write back

? ?
? ? ?

Control Logic

❑ Not all instructions need all 5 stages → the control logic selects
“needed” datapath lines based on the instruction.
➢ MUX selector, ALU op selector, write enable, etc.
Fetch Stage: Requirements
❑ Instruction Fetch Stage:
1. Use the Program Counter (PC) to fetch the instruction from memory
▪ PC is implemented as a special register in the processor
2. Increment the PC by 4 to get the address of the next instruction:
▪ How do we know the next instruction is at PC+4?
▪ Note the exception when branch/jump instruction is executed

❑ Output to the next stage (Decode):

➢ The instruction to be executed

1. Fetch
2. Decode
3. ALU
4. Memory
5. RegWrite
Fetch Stage: Block diagram

Increment by
4 for next
instruction

Decode Stage
32-bit
register
Zoomed-in element: PC register
❑ Seems that we’re reading and updating PC at the same time!
➢ How can it works properly during a single cycle?

❑ Magic of clock
➢ PC is read during the first half of the clock period and it is updated with PC+4
at the next rising clock edge.
Time
Clk
𝑡𝑠𝑒𝑡𝑢𝑝 𝑡𝑠𝑒𝑡𝑢𝑝
𝑡𝑐𝑙𝑘−2−𝑄
Add
4 PC 100 104
𝑡𝑎𝑑𝑑
PC Read
In address
In 104 108
Instruction
Clk Instruction Flip-flop timing
memory
D Q 𝑡𝑠𝑒𝑡𝑢𝑝 time that D must not change before ↑
𝑡𝑐𝑙𝑘−2−𝑄 Delay after ↑ until D appears at Q

clk 𝑡𝑎𝑑𝑑 Delay at the adder

Zoomed-in element: Instruction Memory

❑ Idealized memory
➢ One input bus: Address Address DataOut
Instruction
➢ One output bus: Data Out 32 32
Memory
❑ Memory word is found by
➢ Address selects the word to put on Data Out
➢ The word must had been written to the memory prior to instruction fetch.
➢ During instruction fetch operation, the memory behaves as a combinational
logic block: Address valid → Data Out valid after “access time”.

❑ Note: in practice, there must be more inputs but they are not
used during instruction fetch.
➢ E.g. Data In, Clock, Write Enable had been used to write the instructions to
the memory (prior to instruction fetch).
Decode Stage: Requirements
❑ Instruction Decode Stage:
➢ Gather data from the instruction fields:
1. Read the opcode to determine instruction type and field lengths
2. Read data from general purpose registers (in the register file)
▪ Can be two (e.g. add), one (e.g. addi) or zero (e.g. auipc)

❑ Input from previous stage (Fetch):

➢ The instruction to be executed

❑ Output to the next stage (ALU):

➢ Operation and the necessary operands
1. Fetch
2. Decode
3. ALU
4. Memory
5. RegWrite
Decode Stage: Block Diagram
Register Register
numbers File
Data
Fetch Stage

5 Read 32

ALU Stage
Read
register A data A
5 Read
register B Operands
Inst. 5 Write
register Read 32
data B
Operation

Collection of
registers, known
as register file
Zoomed-in element: Register File
5 RA RA 32
❑ A collection of 32 data BusA
Register
registers: numbers
5 RB
Register
➢ Two 32-bit output busses: 5 RW File Data
busA and busB RB 32
32 data
➢ One 32-bit input bus: Write BusB
Data
data
busW BusW

❑ Register is selected by: Clk Write Enable

➢ RA (number) selects the register to put on busA (data)

➢ RB (number) selects the register to put on busB (data)
➢ RW (number) selects the register to be written via busW (data) when Write
Enable is 1.

❑ Clock input (CLK)

➢ CLK input is a factor ONLY during write op.
➢ During read op., behaves as a combinational logic block: RA/RB valid →
busA/busB valid after “access time”
Decode Stage: R-Format Instruction
add x18, x19, x20
Notation:
0000 000

Inst [Y:X]
= bits X to Y in Instruction
10100

5 AddrB
DataA
32 content of
5
BusA register x19
AddrA
10011 000

DataB
32
content of
32
DataD BusB register x20
BusW
10010

Clk Write Enable

Result to be stored
011 0011

into register x18

(produced by later
stage)
Decode Stage: I-Format Instruction
addi x15, x1, -50
111111001110

Inst [24:20]
5
AddrB DataA
32 content of
5
BusA register x1
00001

AddrA
Register
5 File
AddrD
32
000

DataB
32
DataD BusB
01111

BusW
Problems:
Clk Write Enable RB data is an
Result to be stored
0010011

into register x15

immediate value,
(produced by later not from register!
stage)
Adding addi to datapath

+4 Reg[]
DataD
ALU
pc IMEM
inst[11:7]
AddrD DataA Reg[rs1] alu
pc+4 inst[19:15] AddrA 0
inst[24:20] AddrB DataB
Reg[rs2] 1

Imm. imm[31:0]
Gen
inst[31:20]

ALUSel=Add
ImmSel=I BSel=1

❑ Decoding problem for addi is completely solved at ALU stage

➢ Decoding stage: copy inst[31:20] to low 12 bits of immediate & then sign-
extended by filling up the upper 20 bits of the immediate with inst[31].
➢ ALU stage: use a MUX prior to the ALU to select busB/immediate operand.
➢ Note: this set-up also works for all other I-format arithmetic instructions
(sltiu,andi,ori, …) just by changing the control signal ALUSel.
I- & S-type Immediate Generator

31 25 24 20 19 15 14 12 11 7 6 0
imm[11:0] rs1 funct3 rd I-opcode
imm[11:5] rs2 rs1 funct3 imm[4:0] S-opcode

5
5
1 6
I S

inst[31](sign-extension) inst[30:25] inst[24:20] I

inst[31](sign-extension) inst[30:25] inst[11:7] S
31 11 10 5 4 0

❑ Immediates are decoded differently for I-type and S-type instr’s.

➢ Just need a 5-bit mux to select between two positions where low five bits of
immediate can reside in instruction.
➢ Other bits in immediate are wired to fixed positions in instruction.
ALU Stage: Requirements
❑ Instruction ALU Stage:
➢ ALU = Arithmetic-Logic Unit
➢ Also called the Execution stage
➢ Perform the real work for most instructions here
▪ Arithmetic (e.g. add, sub), Shifting (e.g. sll), Logical (e.g. and, or)
▪ Memory operation (e.g. lw, sw): Address calculation
▪ Branch operation (e.g. bne, beq): Perform register comparison and
target address calculation

❑ Input from previous stage (Decode):

➢ Operation and Operand(s)
1. Fetch
❑ Output to the next stage (Memory): 2. Decode
➢ Calculation result 3. ALU
4. Memory
5. RegWrite
ALU Stage: Block Diagram

Memory Stage
Decode Stage

ALU result
Operands ALU

Operation

Logic to perform
arithmetic and
logical operations
Element: Arithmetic Logic Unit
ALUSel
4
❑ ALU (Arithmetic Logic Unit) A
32
➢ Combinational logic to implement
arithmetic and logical operations ALU 32
result

❑ Inputs: B A op B
➢ Two 32-bit numbers 32

❑ Control:
➢ 4-bit to decide the particular operation ALUSel Function
0000 AND
❑ Output: OR
0001
➢ Result of arithmetic/logical operation
0010 add
0110 subtract
0111 slt
1100 NOR
ALU Stage: Branch Instructions
❑ Branch instruction is harder as we need to perform two
calculations
❑ Example: "beq x9, x0, 3"
1. Branch Outcome:
▪ Need a comparator to compare the registers
2. Branch Target Address:
▪ Use ALU to calculate the address
▪ Need PC (from Fetch Stage)
▪ Need Offset (from Decode Stage)

❑ Also need to feed the branch target address back to the fetch
stage!
Branch Comparator
❑ BrEq = 1, if A=B
A Branch
❑ BrLT = 1, if A < B
Comp.
❑ BrUn =1 selects unsigned B
comparison for BrLT, 0=signed
❑ BGE branch: A >= B, if !(A<B)

BrUnBrEq BrLT
B-type Immediate Generator

❑ Only bit inst[7] changes role in immediate between S and B

➢ Only need a single-bit 2-way mux,

❑ 12-bit immediate encodes PC-relative offset of -4096 to +4094

bytes in multiples of 2 bytes:
➢ Treat immediate as in range -2048 to +2047, then shift left by 1 bit to
multiply by 2 for branches
Adding branch to the datapath

alu
+4 Reg[] pc
wb 1
DataD Reg[rs1]
1 ALU
0
pc IMEM inst[11:7] AddrD 0
Reg[rs2]
pc+4 inst[19:15] AddrA DataA Branch 0
Comp.
inst[24:20] AddrB DataB 1

inst[31:7]
Imm. imm[31:0]

Gen
BrUn
PCSel=taken/not-taken ImmSel=B RegWEn=0 Bsel=1 ALUSel=Add
Choose to BrEq BrLT ASel=1
Control Signal generate Choose PC as
to select B-type ALU opr 1,
between immediate imm[31:0] as
(PC+4) or ALU opr2, to
Branch Target calculate the
branch target
Memory Stage: Requirements
❑ Instruction Memory Access Stage:
➢ Only the load and store instructions need to perform operation in this stage
▪ Use memory address calculated by ALU Stage
▪ Read from or write to data memory
➢ All other instructions remain idle
▪ Result from ALU Stage will pass through to be used in Register Write
stage (later in this lecture) if applicable

❑ Input from previous stage (ALU):

➢ Computation result to be used as memory address (if applicable)

❑ Output to the next stage (Register Write):

➢ Result to be stored (if applicable) 1. Fetch
2. Decode
3. ALU
4. Memory
5. RegWrite
Memory Stage: Block Diagram

32 Address

Stage
Read 32
Result Data
32 Write
Data Data
Memory

MemRW

Memory which
stores data values
Adding lw to datapath

lw x14, 8(x2)

+4 Reg[] pc alu
wb 1
1 DataD Reg[rs1]
alu pc inst[11:7] 0 ALU DMEM 1
pc+4
0 IMEM AddrD Reg[rs2] Addr wb
inst[19:15] DataA Branch 0 DataR 0
AddrA Comp. DataW mem
inst[24:20] DataB 1
AddrB

inst[31:7]
Imm. imm[31:0]

Gen
RegWEn=1 Bsel=1
Asel=0 ALUSel=Add WBSel=0
PCSel ImmSel=I BrUnBrEq BrLT MemRW=Read

❑ Supporting narrower loads (lh/lb) requires additional circuits.

Adding sw to datapath

sw x14, 8(x2)
Do we need any modification?

+4 Reg[] pc alu
wb 1
DataD Reg[rs1]
1 ALU
alu pc inst[11:7] AddrD 0 DMEM 1
pc+4
0 IMEM Reg[rs2] Addr wb
inst[19:15] AddrA DataA Branch 0 DataR 0
Comp. mem
inst[24:20] AddrB DataB 1 DataW

inst[31:7]
Imm. imm[31:0]

Gen
RegWEn=0 Bsel=1
Asel=0 ALUSel=Add WBSel=*
PCSel ImmSel=S BrUnBrEq BrLT MemRW=Write

*= “Don’t Care”
Register Write Stage: Requirements
❑ Instruction Register Write Stage:
➢ Most instructions write the result of some computation into a register
▪ Examples: arithmetic, logical, shifts, loads, set-less-than
▪ Need destination register number and computation result
➢ Exceptions are stores, branches, jumps
▪ There are no results to be written
▪ These instructions remain idle in this stage

❑ Input from previous stage (Memory):

➢ Computation result either from memory or ALU
1. Fetch
2. Decode
3. ALU
4. Memory
5. RegWrite
Register Write Stage: Block Diagram
Memory Stage
5 32
AddrB DataA
BusA
5
AddrA
Register
Result 5 File
AddrD
32
DataB
32
DataD BusB
BusW

Clk Write Enable

❑ Result Write stage has no additional element:

➢ Basically just route the correct result into register file
➢ The Write Register number (AddrD) had been generated way back in the
Decode Stage
Adding jalr to datapath
jalr rd, rs1, imm

pc+4
alu +4 Reg[] pc alu
wb 1
DataD Reg[rs1] 2
1 ALU
pc inst[11:7] AddrD 0 DMEM 1
pc+4
0 IMEM Reg[rs2] Addr DataR
wb
inst[19:15] AddrA DataA Branch 0 0
Comp. mem
inst[24:20] AddrB DataB 1 DataW

inst[31:7]
Imm. imm[31:0]

Gen

PCSel=1 inst[31:0] Bsel=1 Asel=1 WBSel=2

ImmSel=I RegWEn=1 MemRW=Read
BrUn=* BrLT=* ALUSel=Add
BrEq = *

❑ Enlarging WB MUX to enable PC+4 to be written to Reg[rd]

➢ Uses same immediates as arithmetic and loads: PC = Reg[rs1] + immediate
The complete RV32I datapath

pc+4
+4 Reg[] pc alu
alu wb 1
DataD Reg[rs1] 2
1 ALU
pc inst[11:7] AddrD 0 DMEM 1
pc+4
0 IMEM Reg[rs2] Addr DataR
wb
inst[19:15] AddrA DataA Branch 0 0
Comp. DataW mem
inst[24:20] AddrB DataB 1

inst[31:7]
Imm. imm[31:0]

Gen

PCSel inst[31:0] ImmSel RegWEn BrUn BrEq BrLT BSel ASel ALUSel MemRW WBSel

Control Unit

❑ Can execute any RV32I instruction in one clock cycle.

➢ The way the datapath is operated is governed by the Control Unit
➢ How do we design the Control Unit? Look forward to the next lecture ☺

Computer Architecture Notes
89% (18)
Computer Architecture Notes
85 pages
Lecture 4.1 - The Processor
No ratings yet
Lecture 4.1 - The Processor
29 pages
CA I - Chapter 3 RISC V Processor
No ratings yet
CA I - Chapter 3 RISC V Processor
107 pages
CA04 2024S2 Printout
No ratings yet
CA04 2024S2 Printout
31 pages
CST Antenna
100% (1)
CST Antenna
101 pages
Unit 4
No ratings yet
Unit 4
53 pages
Introduction To ARM Cortex-M Processor
100% (2)
Introduction To ARM Cortex-M Processor
19 pages
Est3 Installation&Service Manual
100% (1)
Est3 Installation&Service Manual
388 pages
Lec5b-Singlecycle - Datapath
No ratings yet
Lec5b-Singlecycle - Datapath
35 pages
CH 5
No ratings yet
CH 5
68 pages
Lec09 Datapath
No ratings yet
Lec09 Datapath
36 pages
Chapter 08 Processor Design
No ratings yet
Chapter 08 Processor Design
70 pages
Chapter V Processor Architecture
No ratings yet
Chapter V Processor Architecture
140 pages
COA Module4
No ratings yet
COA Module4
50 pages
Chapter 7 Basic Processing Unit
No ratings yet
Chapter 7 Basic Processing Unit
58 pages
Chapter 7 Basic Processing Unit
No ratings yet
Chapter 7 Basic Processing Unit
58 pages
Unit 3
No ratings yet
Unit 3
55 pages
15-Micro Programmed Control Unit-13!02!2023
No ratings yet
15-Micro Programmed Control Unit-13!02!2023
45 pages
Computer Organization & Assembly Language: CS/COE0447
No ratings yet
Computer Organization & Assembly Language: CS/COE0447
82 pages
3 - Processor (Single Cycle)
No ratings yet
3 - Processor (Single Cycle)
53 pages
Lec07 Annotated
No ratings yet
Lec07 Annotated
26 pages
COA Module5
No ratings yet
COA Module5
35 pages
Week7 - Processor Part 2
No ratings yet
Week7 - Processor Part 2
23 pages
Processor DP Control
No ratings yet
Processor DP Control
44 pages
Unit Ii
No ratings yet
Unit Ii
84 pages
Lecture 12
No ratings yet
Lecture 12
34 pages
w9 One PDF
No ratings yet
w9 One PDF
37 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
15 pages
L06 - RISCV Datapath Design
100% (1)
L06 - RISCV Datapath Design
78 pages
cs2100 14 Datapath
No ratings yet
cs2100 14 Datapath
43 pages
ELEN 350 Single Cycle Datapath: Adapted From The Lecture Notes of John Kubiatowicz (UCB) and Hank Walker (TAMU)
No ratings yet
ELEN 350 Single Cycle Datapath: Adapted From The Lecture Notes of John Kubiatowicz (UCB) and Hank Walker (TAMU)
61 pages
W9 Config
No ratings yet
W9 Config
37 pages
Single Cycle
No ratings yet
Single Cycle
28 pages
COA Chapter 5
No ratings yet
COA Chapter 5
12 pages
SAP-1 (Simple As Possible-1) Computer Architecture
No ratings yet
SAP-1 (Simple As Possible-1) Computer Architecture
8 pages
L7 Single Cycle DP
No ratings yet
L7 Single Cycle DP
24 pages
Wa0031.
No ratings yet
Wa0031.
10 pages
Module-2: Memory Systems Basic Processing Unit
No ratings yet
Module-2: Memory Systems Basic Processing Unit
183 pages
QGH46982 06
No ratings yet
QGH46982 06
406 pages
An Example Hardwired CPU
No ratings yet
An Example Hardwired CPU
29 pages
Unit 2 The CPU and Register Org.
No ratings yet
Unit 2 The CPU and Register Org.
11 pages
LEC 11 Instruction Set 8085 Part 2 Arithmetic Group
No ratings yet
LEC 11 Instruction Set 8085 Part 2 Arithmetic Group
34 pages
The Processor: (Datapath and Pipelining)
No ratings yet
The Processor: (Datapath and Pipelining)
144 pages
Chapter 09 Processor Design
No ratings yet
Chapter 09 Processor Design
71 pages
The Processor
No ratings yet
The Processor
27 pages
Chapter4 SingleCycleCPU
No ratings yet
Chapter4 SingleCycleCPU
31 pages
Multicycle Datapath PDF
No ratings yet
Multicycle Datapath PDF
22 pages
Multi Cycle PDF
No ratings yet
Multi Cycle PDF
16 pages
MA C6000 2DAY Student Guide Rev2.3
No ratings yet
MA C6000 2DAY Student Guide Rev2.3
164 pages
The Final Datapath: Add M U X
No ratings yet
The Final Datapath: Add M U X
32 pages
Lecture 2 Transistors BJT and FET - Updated 5
No ratings yet
Lecture 2 Transistors BJT and FET - Updated 5
132 pages
Nstruction Datapath
No ratings yet
Nstruction Datapath
10 pages
RISC Processor Design: Multi-Cycle Cycle Implementation: Mips
No ratings yet
RISC Processor Design: Multi-Cycle Cycle Implementation: Mips
49 pages
CA I - Chapter 3 RISC V Processor
No ratings yet
CA I - Chapter 3 RISC V Processor
103 pages
Lec12 DataPath
No ratings yet
Lec12 DataPath
43 pages
Cpu Data Path: Professor Michael Mcgarry
No ratings yet
Cpu Data Path: Professor Michael Mcgarry
8 pages
GE VersaMax Workshop Student Guide
No ratings yet
GE VersaMax Workshop Student Guide
83 pages
Brkewn 3013
No ratings yet
Brkewn 3013
118 pages
Computer Systems and Java Programming
No ratings yet
Computer Systems and Java Programming
48 pages
Dissertation 13944
No ratings yet
Dissertation 13944
67 pages
CS104: Computer Organization: 11 March, 2020
No ratings yet
CS104: Computer Organization: 11 March, 2020
37 pages
Cbca2103 SG
No ratings yet
Cbca2103 SG
63 pages
Neverwinter Nights v1.69 Patch Details: New Content Added From The
No ratings yet
Neverwinter Nights v1.69 Patch Details: New Content Added From The
65 pages
Cryptocurrency Mining
No ratings yet
Cryptocurrency Mining
6 pages
Ch2 - Lec4 - Major Components of The Cpu
No ratings yet
Ch2 - Lec4 - Major Components of The Cpu
31 pages
Basic Processing Unit
No ratings yet
Basic Processing Unit
49 pages
Lect 07 Processordesign PDF
No ratings yet
Lect 07 Processordesign PDF
55 pages
4 The Processors
No ratings yet
4 The Processors
112 pages
Lecture08 RISCV Impl 2
No ratings yet
Lecture08 RISCV Impl 2
55 pages
Slide 4
No ratings yet
Slide 4
35 pages
It3030e CA Chap5 Cpu p1
No ratings yet
It3030e CA Chap5 Cpu p1
62 pages
Slide 2
No ratings yet
Slide 2
35 pages
Unit 4 Notes
No ratings yet
Unit 4 Notes
15 pages
4th Sem Whole Syllabus RB
No ratings yet
4th Sem Whole Syllabus RB
27 pages
KAIST cs311 05 Proc I
No ratings yet
KAIST cs311 05 Proc I
28 pages
13.2 Technical Specifications of The CPU 417-4H (6ES7 417-4HL01-0AB0)
No ratings yet
13.2 Technical Specifications of The CPU 417-4H (6ES7 417-4HL01-0AB0)
6 pages
12-Interaction Design Models - Model Human Processor - Principles-31-Jul-2020Material - I - 31-Jul-2020 - Interaction - Design - Models
No ratings yet
12-Interaction Design Models - Model Human Processor - Principles-31-Jul-2020Material - I - 31-Jul-2020 - Interaction - Design - Models
22 pages
100 Hardware Questions
No ratings yet
100 Hardware Questions
17 pages
Resource Governor in SQL Server 2012
No ratings yet
Resource Governor in SQL Server 2012
19 pages
CSCE 5610 Computer System Architecture: Instruction Level Parallelism
No ratings yet
CSCE 5610 Computer System Architecture: Instruction Level Parallelism
11 pages
Engine Control Unit (ECU) System Operation
No ratings yet
Engine Control Unit (ECU) System Operation
3 pages
Introduction of Microprocessor
No ratings yet
Introduction of Microprocessor
13 pages
Manual 7
No ratings yet
Manual 7
13 pages
Lab 2-1 Develop Software Nios PIO
No ratings yet
Lab 2-1 Develop Software Nios PIO
11 pages
CS101 Preperation 2024 by ZB FILE F .. (Mids)
No ratings yet
CS101 Preperation 2024 by ZB FILE F .. (Mids)
8 pages
Comparch 04
No ratings yet
Comparch 04
73 pages
Cs8491 Computer Architecture Unit - 2: Ans: A
No ratings yet
Cs8491 Computer Architecture Unit - 2: Ans: A
8 pages
Ddco With Answers
No ratings yet
Ddco With Answers
46 pages
Amity School of Engineering & Technology: B. Tech. (CSE), V Semester Computer Architecture Jitendra Rajpurohit
No ratings yet
Amity School of Engineering & Technology: B. Tech. (CSE), V Semester Computer Architecture Jitendra Rajpurohit
13 pages
EWS
No ratings yet
EWS
4 pages
Simulasi Rakitan Enterkomputer: NO Item Jumlah Harga Satuan Sub Total # Processor
No ratings yet
Simulasi Rakitan Enterkomputer: NO Item Jumlah Harga Satuan Sub Total # Processor
1 page
LPIC-1 Primer
From Everand
LPIC-1 Primer
John Greene
4.5/5 (3)

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Slide 5

Uploaded by

Slide 5

Uploaded by

ELT3047 Computer Architecture

Lecture 5: Single cycle processor design

Hoang Gia Hung

❑ A single-core processor consists of

❑ Building two RISC-V implementations

❑ Only a simplified RISC-V ISA subset

add t3, t1, t2 lw t3, 20(t1) Clk

Fetch Read inst. at [PC] Read inst. at [PC] Fetch

ALU Result = opr1 + opr2 MemAddr = opr1 + opr2 ALU

Memory Use MemAddr to read Memory

1. Instruction Fetch 2. Decode 3. Execute 4. Memory 5. Write back

❑ We will build a lite RISC-V datapath incrementally:

❑ Output to the next stage (Decode):

clk 𝑡𝑎𝑑𝑑 Delay at the adder

❑ Input from previous stage (Fetch):

❑ Output to the next stage (ALU):

❑ Register is selected by: Clk Write Enable

➢ RA (number) selects the register to put on busA (data)

❑ Clock input (CLK)

Clk Write Enable

into register x18

into register x15

❑ Decoding problem for addi is completely solved at ALU stage

inst[31](sign-extension) inst[30:25] inst[24:20] I

❑ Immediates are decoded differently for I-type and S-type instr’s.

❑ Input from previous stage (Decode):

❑ Only bit inst[7] changes role in immediate between S and B

❑ 12-bit immediate encodes PC-relative offset of -4096 to +4094

❑ Input from previous stage (ALU):

❑ Output to the next stage (Register Write):

❑ Supporting narrower loads (lh/lb) requires additional circuits.

❑ Input from previous stage (Memory):

Clk Write Enable

❑ Result Write stage has no additional element:

PCSel=1 inst[31:0] Bsel=1 Asel=1 WBSel=2

❑ Enlarging WB MUX to enable PC+4 to be written to Reg[rd]

❑ Can execute any RV32I instruction in one clock cycle.

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.