0% found this document useful (0 votes)

58 views

Nmam Institute of Technology: Department of Computer Science and Engineering

This document contains a list of problems to be solved in tutorial classes for units 1 and 2 of the subject Advanced Computer Architecture. The problems cover topics like speedup from using a floating-point processor, calculating CPI for different instruction mixes and pipelines, determining faster systems based on clock speed and instruction breakdown, optimization options to improve speedup, and analyzing dependencies and hazards in pipelines. The document is from the Department of Computer Science and Engineering at NMAM Institute of Technology in Karnataka, India.

Uploaded by

smitha shetty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views

Nmam Institute of Technology: Department of Computer Science and Engineering

Uploaded by

smitha shetty

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

You are on page 1/ 8

NMAM INSTITUTE OF TECHNOLOGY

(An Autonomous Institution affiliated to VTU, Belgaum)

(AICTE, approved, NBA Accredited, ISO 9001:2008 Certified)
Nitte – 574110, Karkala, Udupi District, Karnataka, India.
Department of Computer Science and Engineering

List of Problems to be solved in Tutorial Classes for Unit I and II

Subject Title : ADVANCED COMPUTER ARCHITECTURE

Subject Code : CS702

1) You have a system that contains a special processor for doing floating-point operations. You have
determined that 50% of your computations can use the floating-point processor. The speedup of the
floating pointing-point processor is 15.
a) What is the overall speedup achieved by using the floating-point processor?
b) What is the overall speedup achieved if you modify the compiler so that 75% of the
computations can use the floating-point processor?
c) What fraction of the computations should be able to use the floating–point processor in order to
achieve an overall speedup of 2.25?

Solution:
i) Overall speedup achieved by using the floating-point processor.
F = 0.5 S = 15

ii) Overall speedup achieved if you modify the compiler so that 75% of the computations can use
the floating-point processor.
F = 0.75 S = 15

iii) Fraction of the computations that should be able to use the floating–point processor in order to
achieve an overall speedup of 2.25:
F = ? S = 15

1|Page
2) Suppose you have a load/store computer with the following instruction mix:
Operation Frequency No. of Clock cycles
ALU ops 35% 1
Loads 25% 2
Stores 15% 2
Branches 25% 3

Solution
a) Compute the CPI. b) We observe that 35% of the ALU ops are paired with a load, and we
propose to replace these ALU ops and their loads with a new instruction. The new instruction takes
1 clock cycle. With the new instruction added, branches take 5 clock cycles, Compute the CPI for
the new version.
a)
CPI old = (0.35*1) + (0.25*2) + (0.15*2) + (0.25*3) =1.9

b) 0.35*0.35 = 0.1225

c) If the clock of the old version is 20% faster than the new version, which version has faster CPU
Execution time and by how much percent?

2
NMAM INSTITUTE OF TECHNOLOGY
(An Autonomous Institution affiliated to VTU, Belgaum)
(AICTE, approved, NBA Accredited, ISO 9001:2008 Certified)
Nitte – 574110, Karkala, Udupi District, Karnataka, India.
Department of Computer Science and Engineering

3) For the purpose of solving a given application problem, you benchmark a program on two
computer systems. On system A, the object code executed 80 million Arithmetic Logic Unit
operations (ALU ops), 40 million load instructions, and 25 million branch instructions. On system
B, the object code executed 50 million ALU ops, 50 million loads, and 40 million branch
instructions. In both systems, each ALU op takes 1 clock cycles, each load takes 3 clock cycles, and
each branch takes 5 clock cycles.
a) Compute the relative frequency of occurrence of each type of instruction executed in both
systems.
b) Find the CPI for each system.
c) Assuming that the clock on system B is 10% faster than the clock on system A, which system is
faster for the given application problem and by how much percent?
Solution: Compute the relative frequency of occurrence of each type of instruction executed in both
systems.

A B
ALU ops 80/145= 0.55 50/140=0.36
Loads 40/145=0.28 50/140=0.36
Branches 25/145=0.17 40/140=0.28

b) Find the CPI for each system.

3|Page
c)

4) Suppose that a system contains a special floating point processor for doing floating-point
operations. When a program uses the floating-point processor, the speedup that the floating-point
processor offers is 1.4.
In order to improve the speedup two options are considered:
Option 1: Modifying the compiler so that 70% of the computations can use the floating-point
processor. Cost of this option is Rs. 2500.
Option 2: Modifying the floating-point processor. The speedup offered by the modified floating-
point processor is 2. Assume in this case that 50% of the computations can use the floating-point
processor. Cost of this option is Rs. 3000.
Which option would you recommend? Justify your answer quantitatively.
Solution:
Option 1:
Speedup= 1/ [(1-0.7)+(0.7/1.4)] = 1.25
Cost/speedup = 2500/1.25 = 2000

Option 2:
Speedup= 1/[(1-0.5) + (0.5/2)] =1.33
Cost/speedup = 3000/1.33 = 2255

Therefore, Option 1 is better because it has a smaller Cost/Speedup ratio.

5) An unpipelined processor takes 6 ns to work on one instruction. The pipelined version of the
processor has 6 stages with the following lengths: 1.0ns; 0.8ns; 0.4ns; 1.2ns; 1.3ns; 1.3ns. It then
takes 0.3 ns to latch its results into latches. Answer the following, assuming that there are no stalls
in the pipeline.
a) What are the cycle times in both processors?
b) How long does it take (in nano-seconds ) to finish one instruction in both processors?
(Note : Ignore the initial fill time in the pipelined processor)
c) What is the speedup achieved by the 6 stage pipeline with respect to unpipelined processor?

4
NMAM INSTITUTE OF TECHNOLOGY
(An Autonomous Institution affiliated to VTU, Belgaum)
(AICTE, approved, NBA Accredited, ISO 9001:2008 Certified)
Nitte – 574110, Karkala, Udupi District, Karnataka, India.
Department of Computer Science and Engineering

d) How long does it take (in nano-seconds) to finish 1000 instructions in both processors?
(Note : Do not ignore the initial fill time in the pipelined processor)

6) Compute the overall CPI of a computer which executes a program with following instruction
mix:
Operation Frequency No. of Clock cycles
ALU ops 35% 1
Loads 25% 2

5|Page
Stores 15% 2
Branches 25% 3
Solution:

CPI= (0.351) + (0.252) + (0.152) + (0.253) =1.9

7) An unpipelined processor takes 6 ns to work on one instruction. The pipelined version of the
processor has 6 stages with the following lengths: 1.0ns; 0.8ns; 0.4ns; 1.2ns; 1.3ns; 1.3ns. It then
takes 0.3 ns to latch its results into latches. Answer the following, assuming that there are no stalls
in the pipeline.
1. What are the cycle times in both processors?
2. How long does it take (in nano-seconds ) to finish one instruction in both processors? (Note :
Ignore the initial fill time in the pipelined processor)
3. What is the speedup achieved by the 6 stage pipeline with respect to unpipelined processor?
4) How long does it take (in nano-seconds) to finish 1000 instructions in both processors? (Note :
Do not ignore the initial fill time in the pipelined processor)
Ans.
1) T_unpipelined = 6ns
T_pipelined = 1.3+0.3=1.6ns
2) exec. Time per instr_unpipelined= 6ns
exec. Time per instr_pipelined= 1.6ns
3) speedup=6/1.6=3.75
4) exec. Time for 1000 instr_unpipelined= 1000x6=6000ns
exec. Time for 1000 instr_pipelined= (6+999)*1.6=1608ns

8) A 400 MHz processor was used to execute a benchmark program with the following
instruction mix and clock cycle counts?
Instruction type Instruction counts Clock cycle counts
Integer Arithmetic 45000 1
Data transfer 32000 2
Floating point 15000 2
Control transfer 8000 2
Determine the effective CPI, MIPS rate and execution time for this program?

6
NMAM INSTITUTE OF TECHNOLOGY
(An Autonomous Institution affiliated to VTU, Belgaum)
(AICTE, approved, NBA Accredited, ISO 9001:2008 Certified)
Nitte – 574110, Karkala, Udupi District, Karnataka, India.
Department of Computer Science and Engineering

9) Consider a branch that is taken 80% of the time. On average, how many stalls are introduced for
this branch for each approach below:
i) Stall fetch until branch outcome is known
ii) Assume not-taken and squash if the branch is taken
iii) Assume a branch delay slot:
a. No instruction is found to put in the delay slot
b. An instruction before the branch is put in the delay slot
c. An instruction from the taken side is put in the delay slot
d. An instruction from the not-taken side is put in the slot
e.
Solotion:
i) Stall fetch until branch outcome is known – 1
ii) Assume not-taken and squash if the branch is taken – 0.8
iii) Assume a branch delay slot
a. You can’t find anything to put in the delay slot – 1
b. An instr before the branch is put in the delay slot – 0
c. An instr from the taken side is put in the slot – 0.2
d. An instr from the not-taken side is put in the slot – 0.8

10) Analyse the data dependence among the following statements in a given program?

Where (Ri) means the content of register Ri and memory (10) contains 64 initially

(a) List out the dependences among these instructions.

(b) Are there any resource dependences if only one copy of each functional unit is available in the
CPU?

11) For the following reservation table of a nonlinear pipeline find:

(a) What are the forbidden latencies?
(b) Draw the state transition diagram.
(c) List all the simple cycles and greedy cycles.
(d) Determine the optimal constant latency and minimal average latency.
(e) Let the pipeline clock period be T =20 ns determine the throughput of this pipeline.
1 2 3 4
S1 X X
S2 X
S3 X

7|Page
12) For the following reservation table of a nonlinear pipeline find the minimal average latency
(MAL) for a collision free scheduling and calculate the efficiency of the pipeline.

1 2 3 4 5 6 7 8
S1 X x
S2 x X x
S3 x X x

13) For the following reservation table of a nonlinear pipeline find the minimal average latency
(MAL) for a collision free scheduling.

1 2 3 4 5
S1 x
S2 x x
S3 x x

14) Consider the following pipeline reservation table. Find the minimal average latency (MAL) for
a collision free scheduling.

1 2 3 4 5
S1 x x
S2 x
S3 x x

15) Explain the characteristics of CISC and RISC architecture?

Module 1 Problems-CSA
63% (19)
Module 1 Problems-CSA
3 pages
Solution Chapter 1
91% (22)
Solution Chapter 1
2 pages
COA Numerical:: Performance: Q1
100% (1)
COA Numerical:: Performance: Q1
8 pages
Trade Mark Search Report - NTMC2020060 PDF
No ratings yet
Trade Mark Search Report - NTMC2020060 PDF
9 pages
CSE 530 Homework #1 Due September 26 Anthony Dotterer: C C C T C T C C T T
No ratings yet
CSE 530 Homework #1 Due September 26 Anthony Dotterer: C C C T C T C C T T
9 pages
Chap 2 Exercises With Solutions
No ratings yet
Chap 2 Exercises With Solutions
7 pages
Numerical: Central Processing Unit
No ratings yet
Numerical: Central Processing Unit
28 pages
Illinois Exam2 Practice Solfa08
No ratings yet
Illinois Exam2 Practice Solfa08
4 pages
Sample Questions
No ratings yet
Sample Questions
5 pages
Assignment - 1
0% (1)
Assignment - 1
4 pages
Assignment 1 2020coa
No ratings yet
Assignment 1 2020coa
5 pages
archmidsem2009sol
No ratings yet
archmidsem2009sol
5 pages
Aca Midsem2011 Question Paper
No ratings yet
Aca Midsem2011 Question Paper
1 page
Computer Science 321
No ratings yet
Computer Science 321
2 pages
COA QP PerformanceQuestions MUST READ
No ratings yet
COA QP PerformanceQuestions MUST READ
4 pages
TD Micro Chap1 With Sol-2022
No ratings yet
TD Micro Chap1 With Sol-2022
4 pages
1 Computer - Component Performance
No ratings yet
1 Computer - Component Performance
4 pages
Ejercicios 2
No ratings yet
Ejercicios 2
13 pages
CA PDF
No ratings yet
CA PDF
10 pages
Quiz Questions
No ratings yet
Quiz Questions
2 pages
15IF11 Multicore E PDF
No ratings yet
15IF11 Multicore E PDF
14 pages
Homework 1
No ratings yet
Homework 1
18 pages
CS-3010 (HPC) - CS Mid Sept 2023
No ratings yet
CS-3010 (HPC) - CS Mid Sept 2023
7 pages
Discussion Session 4-11
No ratings yet
Discussion Session 4-11
12 pages
PP 1
No ratings yet
PP 1
41 pages
Computer Component Performance-Nguyễn Hoàng Long - BI11-157
100% (1)
Computer Component Performance-Nguyễn Hoàng Long - BI11-157
9 pages
Sheet 1
No ratings yet
Sheet 1
6 pages
Sample Problems Pipe&Memory
No ratings yet
Sample Problems Pipe&Memory
57 pages
Ca Mid1 2017
No ratings yet
Ca Mid1 2017
9 pages
Pipelining Tutorial
No ratings yet
Pipelining Tutorial
14 pages
111 Computer Organization - Quiz 1
No ratings yet
111 Computer Organization - Quiz 1
2 pages
Sheet1 Computer
No ratings yet
Sheet1 Computer
2 pages
CNE211-Tutorial 2 - Sem1 - 2018
No ratings yet
CNE211-Tutorial 2 - Sem1 - 2018
5 pages
Chapter 1 Notes
No ratings yet
Chapter 1 Notes
28 pages
CompEng 361 Final Review Problems - Solutions
No ratings yet
CompEng 361 Final Review Problems - Solutions
6 pages
Final 222 2009 Sol
No ratings yet
Final 222 2009 Sol
6 pages
Coss MidSemester Regular
No ratings yet
Coss MidSemester Regular
3 pages
Sheet 1
No ratings yet
Sheet 1
2 pages
CS/COE 1541 Term 2174 Quiz 1: (Solutions)
No ratings yet
CS/COE 1541 Term 2174 Quiz 1: (Solutions)
2 pages
Department of Computer Science and Engineering: State University of Bangladesh
No ratings yet
Department of Computer Science and Engineering: State University of Bangladesh
2 pages
Archi Second 2013 2014 JCE
No ratings yet
Archi Second 2013 2014 JCE
2 pages
ASSIGNMENT1 Acsa
No ratings yet
ASSIGNMENT1 Acsa
3 pages
MIS 6110 Assignment #1 (Spring 2015)
No ratings yet
MIS 6110 Assignment #1 (Spring 2015)
14 pages
CCE 514 CAT Model Answers 2021-2022
No ratings yet
CCE 514 CAT Model Answers 2021-2022
4 pages
COA ASsignment
No ratings yet
COA ASsignment
7 pages
Solution CSE332 Sec 5 MT Fall2021 1
No ratings yet
Solution CSE332 Sec 5 MT Fall2021 1
3 pages
A5 Solution
No ratings yet
A5 Solution
4 pages
High Performance Computer Architecture (CS60003)
No ratings yet
High Performance Computer Architecture (CS60003)
2 pages
Chapter2_Vol1_Extra Exercies_Assignment 2
No ratings yet
Chapter2_Vol1_Extra Exercies_Assignment 2
2 pages
Midtermarch 2
No ratings yet
Midtermarch 2
9 pages
Instructions: Csce 212: Final Exam Spring 2009
No ratings yet
Instructions: Csce 212: Final Exam Spring 2009
5 pages
CS220 Quizzes
No ratings yet
CS220 Quizzes
4 pages
Assignment 1
No ratings yet
Assignment 1
4 pages
Homework 1
No ratings yet
Homework 1
10 pages
ISA Certified Automation Professional (CAP) Associate: Certification Exam Prep: 500 Practice Exam Questions and Explanations
From Everand
ISA Certified Automation Professional (CAP) Associate: Certification Exam Prep: 500 Practice Exam Questions and Explanations
Steve Brown
No ratings yet
Advanced Backend Code Optimization
From Everand
Advanced Backend Code Optimization
Sid Touati
No ratings yet
Multivariable Predictive Control: Applications in Industry
From Everand
Multivariable Predictive Control: Applications in Industry
Sandip K. Lahiri
No ratings yet
Comptia Network+ Primer
From Everand
Comptia Network+ Primer
John Greene
No ratings yet
Control of DC Motor Using Different Control Strategies
From Everand
Control of DC Motor Using Different Control Strategies
Dr. Hidaia Mahmood Alassouli
No ratings yet
Microsoft AZ-400: Designing and Implementing Microsoft DevOps Solutions - Certification Exam Prep
From Everand
Microsoft AZ-400: Designing and Implementing Microsoft DevOps Solutions - Certification Exam Prep
Steve Brown
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
CV_Unit 2_Ch 10
No ratings yet
CV_Unit 2_Ch 10
36 pages
OTM Module 1_Part 1
No ratings yet
OTM Module 1_Part 1
23 pages
Module 3
No ratings yet
Module 3
10 pages
OTM Module 3_Part 1
No ratings yet
OTM Module 3_Part 1
20 pages
CV_MOdule1_Intel
No ratings yet
CV_MOdule1_Intel
81 pages
OTM Module 1_Part 2
No ratings yet
OTM Module 1_Part 2
27 pages
Cloud_Computing_Virtualization_MCQs (2)
No ratings yet
Cloud_Computing_Virtualization_MCQs (2)
8 pages
Instruction Level Parallelism and Its Exploitation: Unit Ii by Raju K, Cse Dept
No ratings yet
Instruction Level Parallelism and Its Exploitation: Unit Ii by Raju K, Cse Dept
201 pages
CLASS H AMP BOM - B by MICROSIM
No ratings yet
CLASS H AMP BOM - B by MICROSIM
5 pages
Oe Week 2 Assignments
No ratings yet
Oe Week 2 Assignments
9 pages
LCD 320 y 240 Dtfhvi
No ratings yet
LCD 320 y 240 Dtfhvi
20 pages
Belmont Trading Colombia SAS: Página De2
No ratings yet
Belmont Trading Colombia SAS: Página De2
2 pages
MOSFET Si4686dy t1 E3
No ratings yet
MOSFET Si4686dy t1 E3
7 pages
Counterpoint Smartphone Infographic Q4 2023
No ratings yet
Counterpoint Smartphone Infographic Q4 2023
1 page
Abstract-: Keywords: Applications, Architecture, Business Component of Cloud
No ratings yet
Abstract-: Keywords: Applications, Architecture, Business Component of Cloud
15 pages
Es-3000, Geode and Stratavisor NZ/NZC Operator'S Manual P/N 28519-01 Rev K
No ratings yet
Es-3000, Geode and Stratavisor NZ/NZC Operator'S Manual P/N 28519-01 Rev K
234 pages
Annual Report 2016-17
No ratings yet
Annual Report 2016-17
218 pages
Fujifilm LTO Ultrium Seminar
No ratings yet
Fujifilm LTO Ultrium Seminar
46 pages
Voltage Tripler and Quadruples: Date: AIM: Calculate Voltage at Various Points at Voltage Multiplier Circuit. Theory
No ratings yet
Voltage Tripler and Quadruples: Date: AIM: Calculate Voltage at Various Points at Voltage Multiplier Circuit. Theory
2 pages
GPS500
No ratings yet
GPS500
1 page
Acer Aspire 5738 Aspire 5338 JV50 - MV - SB
No ratings yet
Acer Aspire 5738 Aspire 5338 JV50 - MV - SB
60 pages
University College of Engineering
No ratings yet
University College of Engineering
3 pages
Silicon Carbide Benefits
No ratings yet
Silicon Carbide Benefits
19 pages
ADS Tutorial PDF
100% (3)
ADS Tutorial PDF
246 pages
Manual DMX9708S
No ratings yet
Manual DMX9708S
31 pages
Coa Unit-3
No ratings yet
Coa Unit-3
35 pages
JAO18 - MKII - en - 1699028325.0927637
No ratings yet
JAO18 - MKII - en - 1699028325.0927637
4 pages
Mobile Operating System - OS
No ratings yet
Mobile Operating System - OS
14 pages
Datasheet
No ratings yet
Datasheet
13 pages
Asus X53U PDF
No ratings yet
Asus X53U PDF
112 pages
Frequency Compensation in Two-stage Operational Amplifiers for Achieving High 3-DB Bandwidth
No ratings yet
Frequency Compensation in Two-stage Operational Amplifiers for Achieving High 3-DB Bandwidth
4 pages
Manual Control Remoto Aconcagua
100% (2)
Manual Control Remoto Aconcagua
2 pages
3 - Computer L3 - CoCu 1 (PG 37 - 58)
No ratings yet
3 - Computer L3 - CoCu 1 (PG 37 - 58)
22 pages
Intcom: Computer System
No ratings yet
Intcom: Computer System
2 pages
Mux Ra1911004010178
No ratings yet
Mux Ra1911004010178
10 pages
Reconfigurable Computing Es Zg554 / Mel ZG 554 Session 6: BITS Pilani
No ratings yet
Reconfigurable Computing Es Zg554 / Mel ZG 554 Session 6: BITS Pilani
26 pages
M54HC74 M74HC74: Dual D Type Flip Flop With Preset and Clear
No ratings yet
M54HC74 M74HC74: Dual D Type Flip Flop With Preset and Clear
11 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Nmam Institute of Technology: Department of Computer Science and Engineering

Uploaded by

Nmam Institute of Technology: Department of Computer Science and Engineering

Uploaded by

NMAM INSTITUTE OF TECHNOLOGY

(An Autonomous Institution affiliated to VTU, Belgaum)

List of Problems to be solved in Tutorial Classes for Unit I and II

Subject Title : ADVANCED COMPUTER ARCHITECTURE

b) Find the CPI for each system.

Therefore, Option 1 is better because it has a smaller Cost/Speedup ratio.

CPI= (0.351) + (0.252) + (0.152) + (0.253) =1.9

(a) List out the dependences among these instructions.

11) For the following reservation table of a nonlinear pipeline find:

15) Explain the characteristics of CISC and RISC architecture?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Nmam Institute of Technology: Department of Computer Science and Engineering

Uploaded by

Nmam Institute of Technology: Department of Computer Science and Engineering

Uploaded by

NMAM INSTITUTE OF TECHNOLOGY

(An Autonomous Institution affiliated to VTU, Belgaum)

List of Problems to be solved in Tutorial Classes for Unit I and II

Subject Title : ADVANCED COMPUTER ARCHITECTURE

b) Find the CPI for each system.

Therefore, Option 1 is better because it has a smaller Cost/Speedup ratio.

CPI= (0.35*1) + (0.25*2) + (0.15*2) + (0.25*3) =1.9

(a) List out the dependences among these instructions.

11) For the following reservation table of a nonlinear pipeline find:

15) Explain the characteristics of CISC and RISC architecture?

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

CPI= (0.351) + (0.252) + (0.152) + (0.253) =1.9