0% found this document useful (0 votes)

62 views

CS553 Homework #3: Benchmarking Storage

This document outlines the instructions for CS553 Homework #3 on benchmarking storage systems. Students must design benchmarking programs in C or C++ to evaluate disk performance under varying conditions such as record size, access pattern, and concurrency. Experiments must be conducted on the Chameleon cloud using 1 instance. Results must be reported in tables including metrics like throughput, operations per second, efficiency compared to theoretical maximums, and comparisons to results from the IOZone benchmark. Source code and documentation must be submitted by the due date to receive credit.

Uploaded by

Hariharan Shankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views

CS553 Homework #3: Benchmarking Storage

Uploaded by

Hariharan Shankar

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

CS553 Homework #3

Benchmarking Storage
Instructions:
● Assigned date: Friday March 13th, 2020
● Due date: 11:59PM on Sunday March 29th, 2020
● Maximum Points: 100%
● This homework can be done in groups up to 3 students
● Please post your questions to the Piazza forum
● Only a softcopy submission is required; it will automatically be collected through GIT after the
deadline; email confirmation will be sent to your HAWK email address
● Late submission will be penalized at 10% per day; an email to the TA with the subject “CS553:
late homework submission” must be sent

1 Your Assignment
This project aims to teach you how to benchmark storage systems. You can be creative with this project.
You must use either the C or C++ programming languages. Libraries such as PThreads will be necessary
to complete the assignment. Libraries such as STL could be used if necessary. Other programming
languages will not be allowed due to the variability of difficulty between languages, and increased
complexity in grading. Do not write code that relies on complex libraries (e.g. boost), as those will
simplify certain parts of your assignments, and will increase complexity in grading. If you are not sure if
certain libraries are allowed, ask the TAs.
You can use any Linux system for your development, but you must use the Chameleon testbed
[https://www.chameleoncloud.org]; more information about the hardware in this testbed can be found
at https://www.chameleoncloud.org/about/hardware-description/, under Standard Cloud Units. Even
more details can be found at https://www.chameleoncloud.org/user/discovery/, choose “Compute”,
then Click the “View” button. You are to use “Compute Haswell” node types; if there are no Haswell
nodes available, please use “Compute Skylake” node types. You are to use the advanced reservation
system to reserve 1 bare-metal instance to conduct your experiments. You will need to assign your
instance a floating IP address so that you can connect to your instance remotely.
In this project, you need to design a benchmarking program that evaluates the storage system. You will
perform strong scaling studies, unless otherwise noted; this means you will set the amount of work (e.g.
the number of objects or the amount of data to evaluate in your benchmark), and reduce the amount of
work per thread as you increase the number of threads. The TAs will compile (with the help of make)
and test your code on Chameleon bare-metal instances (Haswell or Skylake). If your code does not
compile and the TAs cannot run your project, you will get 0 for the assignment.
1. Disk:
a. Implement: MyDiskBench benchmark; Hint: there are multiple ways to read and write to
disk, explore the different APIs, and pick the fastest one out of all them; also make sure
you are measuring the speed of your disk and not your memory (you may need to flush
your disk cache managed by the OS)
b. Dataset: 10GB data split up in 7 different configurations (note these are similar to the
way IOZone deals with multi-threading and multiple concurrent file access):

1 CS553 Spring 2020 – HW3

i. D1: 1 file of 10GB size for a total of 10GB of data
ii. D2: 2 files of 5GB size each for a total of 10GB of data
iii. D3: 4 files of 2.5GB size each for a total of 10GB of data
iv. D4: 8 files of 1.25GB size each for a total of 10GB of data
v. D5: 12 files of 833.33MB size each for a total of 10GB of data
vi. D6: 24 files of 416.67MB size each for a total of 10GB of data
vii. D7: 48 files of 208.33MB size each for a total of 10GB of data
c. Workload:
i. Operate over 10GB data (in 7 different configurations D1, D2, D3, D4, D5, D6,
and D7) 1X times with various access patterns and various record sizes
ii. Access pattern
· WS: write with sequential access pattern
· RS: read with sequential access pattern
· WR: write with random access pattern
· RR: read with random access pattern
iii. Record size
· 64KB, 1MB, 16MB
d. Concurrency:
i. Use varying number of threads (1, 2, 4, 8, 12, 24, 48) to do the I/O operations
e. Measure:
i. throughput, in terms of MB per second; report data in MB/sec, megabytes (106)
per second; these experiments should be conducted over 10GB of data
f. Run the Disk benchmark IOZone benchmark (http://www.iozone.org/) with varying
workloads and measure the disk performance:
i. Threads:
· 1, 2, 4, 8, 12, 24, and 48 threads
ii. Record:
· Throughput (MB/sec): 64KB, 1MB, 16MB
· Latency (OPS/sec): 4KB
iii. Access Patterns:
· 0 (sequential write), 1 (sequential read), and 2 (random read/write)
iv. All other parameters for IOZone should be kept as close as possible to
your own implementation
v. You are likely to use the following command line arguments for iozone:
· -T -t -s -r -F -I -+z -O
g. Compute the theoretical bandwidth of your disk; you can use information from
Chameleon and the manufacturer of the disk to figure this out (the number should be a
constant value that you list in the entire column Theoretical Throughput. What
efficiency do you achieve compared to the theoretical performance? Compare and
contrast your performance to that achieved by IOZone, MyDiskBench, and to the
theoretical performance.
h. Fill in the table 1 below for Disk Throughput; table 1a table should be for workload WS,
table 1b should be for workload RS, table 1c should be for workload WR and table 1d for
workload RR:

2 CS553 Spring 2020 – HW3

Work- Con- Re- MyDiskBench IOZone Theoretical MyDiskBench IOZone
load cur- cord Measured Measured Through- Efficiency (%) Efficien-
ren- Size Throughput Through- put cy (%)
cy (MB/sec) put (MB/sec)
(MB/sec)
WS/RS 1 64KB
WR/RR
WS/RS 1 1MB
WR/RR
WS/RS 1 16MB
WR/RR
WS/RS 2 64KB
WR/RR
WS/RS 2 1MB
WR/RR
WS/RS 2 16MB
WR/RR
WS/RS 4 64KB
WR/RR
WS/RS 4 1MB
WR/RR
WS/RS 4 16MB
WR/RR
WS/RS 8 64KB
WR/RR
WS/RS 8 1MB
WR/RR
WS/RS 8 16MB
WR/RR
WS/RS 12 64KB
WR/RR
WS/RS 12 1MB
WR/RR
WS/RS 12 16MB
WR/RR
WS/RS 24 64KB
WR/RR
WS/RS 24 1MB
WR/RR
WS/RS 24 16MB
WR/RR
WS/RS 48 64KB
WR/RR
WS/RS 48 1MB
WR/RR

3 CS553 Spring 2020 – HW3

WS/RS 48 16MB
WR/RR
i. Fill in the table 2 below for random disk operations per second (measured in OPS/sec);
table 2a table should be for workload WR and table 2b for workload RR.
Work- Con- Record MyDiskBench IOZone Theoreti- MyDiskBench IOZone
load curren- Size Measured Measured cal IOPS Efficiency (%) Efficien-
cy IOPS IOPS (OPS/sec) cy (%)
(OPS/sec) (OPS/sec)
WR/RR 1 4KB
WR/RR 2 4KB
WR/RR 4 4KB
WR/RR 8 4KB
WR/RR 12 4KB
WR/RR 24 4KB
WR/RR 48 4KB

Other requirements:
● You must write all benchmarks from scratch. Do not use code you find online, as you will get 0
credit for this assignment. If you have taken other courses where you wrote similar benchmarks,
you are welcome to start with your codebase as long as you wrote the code in your prior class.
● All of the benchmarks will have to evaluate concurrency performance; concurrency can be
achieved using threads. Use strong scaling in all experiments, unless it is not possible, in which
case you need to explain why a strong scaling experiment was not done. Be aware of the thread
synchronizing issues to avoid inconsistency or deadlock in your system.
● All benchmarks can be run on a single machine.
● Not all timing functions have the same accuracy; you must find one that has at least 1ms accuracy
or better, assuming you are running the benchmarks for at least seconds at a time.
● Since there are many experiments to run, find ways (e.g. scripts) to automate the performance
evaluation. Besides BASH scripts, it is possible to automate your experiments using the “parallel”
tool in Linux.
● For the best reliability in your results, repeat each experiment 3 times and report the average and
standard deviation. This will help you get more stable results that are easier to understand and
justify.
● Don’t forget to benchmark your disk, and not your memory. You may need to flush caches that
might be stored in memory.
● You may find it more efficient to deal with binary data when reading or writing in this evaluation.
● No GUIs are required. Simple command line interfaces are required. Make your benchmark and
iozone as similar as possible from a command line argument and how the program behaves.

3 What you will submit

When you have finished implementing the complete assignment as described above, you should submit
your solution to your private git repository. Each program must work correctly and be detailed in-line
documented. You should hand in:
1. Source code (30%): All of the source code; in order to get full credit for the source code, your code
must have in-line documents, must compile (with a Makefile), and must be able to run a variety of

4 CS553 Spring 2020 – HW3

benchmarks through command line arguments (matching relevant iozone command line
arguments).
2. Readme (10%): A detailed manual describing how the program works. The manual should be able
to instruct users other than the developer to run the program step by step. The manual should
contain example commands to invoke the benchmark. This should be included as readme.txt in
the source code folder.
3. Report / Performance (60%): Must have working code that compiles and runs on Chameleon
assuming Ubuntu Linux 18.04 to receive credit for report/performance. If your code requires non-
standard libraries to compile and run, include instructions on what libraries are needed, and how
to install them. Furthermore, the code must match the performance presented in the report. A
separate (typed) design document (named hw3-report.pdf) of approximately 1-3 pages describing
the overall benchmark design, and design tradeoffs considered and made. Also describe possible
improvements and extensions to your program (and sketch how they might be made). Since this is
an assignment aimed at teaching you about benchmarking, this is one of the most important part;
you must evaluate the disk benchmarks with the entire parameters space outlined. You must
produce 6 tables (based on the tables included above) to showcase the results; we encourage you
to create additional figures to highlight your performance evaluation based on the data in your
tables. Please combine data and plot on the same graph wherever possible, for either
compactness reasons, or comparison reasons. Don’t forget to plot the average and standard
deviation. Also, you need to explain each graph’s results in words. Hint: graphs with no axis labels,
legends, well defined units, and lines that all look the same, are likely very hard to read and
understand graphs. You will be penalized if your graphs are not clear to understand. Please specify
which student contributed to what benchmark experiments, as well as what part of the code or
scripts each student contributed to.

Submit code/report through GIT. If you cannot access your repository contact the TAs. You can find a
git cheat sheet here: https://www.git-tower.com/blog/git-cheat-sheet/

Grades for late programs will be lowered 10% per day late.

5 CS553 Spring 2020 – HW3

Q Tips: Fast, Scalable, and Maintainable Kdb+
From Everand
Q Tips: Fast, Scalable, and Maintainable Kdb+
Nick Psaris
No ratings yet
Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
From Everand
Assembly Programming:Simple, Short, And Straightforward Way Of Learning Assembly Language
Sherwyn Allibang
5/5 (2)
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Plane and Solid Geometry Formulas
71% (7)
Plane and Solid Geometry Formulas
2 pages
Personal Statement For Masters in Cyber Security
100% (1)
Personal Statement For Masters in Cyber Security
11 pages
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
From Everand
Sega Saturn Architecture: Architecture of Consoles: A Practical Analysis, #5
Rodrigo Copetti
No ratings yet
Node.js 63 Interview Questions and Answers
From Everand
Node.js 63 Interview Questions and Answers
John Edward Cooper Berg
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
How Do Vestas Manufacture Nacelles - PE Rev3
No ratings yet
How Do Vestas Manufacture Nacelles - PE Rev3
67 pages
Iozone Filesystem Benchmark
No ratings yet
Iozone Filesystem Benchmark
18 pages
NFSClientPerf Revised
No ratings yet
NFSClientPerf Revised
8 pages
Establishing The IO-500 Benchmark: Julian M. Kunkel, John Bent, Jay Lofstead, George S. Markomanolis
No ratings yet
Establishing The IO-500 Benchmark: Julian M. Kunkel, John Bent, Jay Lofstead, George S. Markomanolis
1 page
BCSP 801 Aos Lab File
No ratings yet
BCSP 801 Aos Lab File
29 pages
Build Your Own Distributed Compilation Cluster - A Practical Walkthrough
From Everand
Build Your Own Distributed Compilation Cluster - A Practical Walkthrough
Hunter Davis
No ratings yet
File System Comarisons - DL580 - Multi
No ratings yet
File System Comarisons - DL580 - Multi
8 pages
CS553 Homework #5: Sort On Single Shared Memory Node
No ratings yet
CS553 Homework #5: Sort On Single Shared Memory Node
3 pages
Benchmarking IO
No ratings yet
Benchmarking IO
50 pages
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
From Everand
Nintendo 64 Architecture: Architecture of Consoles: A Practical Analysis, #8
Rodrigo Copetti
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Node.js, JavaScript, API: Interview Questions and Answers
From Everand
Node.js, JavaScript, API: Interview Questions and Answers
John Edward Cooper Berg
5/5 (1)
Performance Analysis of Commodity and Enterprise Class Flash Devices
No ratings yet
Performance Analysis of Commodity and Enterprise Class Flash Devices
5 pages
DDR Benchmarking Tools (LMBench)
100% (1)
DDR Benchmarking Tools (LMBench)
29 pages
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
From Everand
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
Steve Brown
No ratings yet
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
The Art of Debugging with GDB, DDD, and Eclipse
From Everand
The Art of Debugging with GDB, DDD, and Eclipse
Norman Matloff
3.5/5 (6)
I. Extending Project 2: Designs Over The Budget Will Get 0 Point
No ratings yet
I. Extending Project 2: Designs Over The Budget Will Get 0 Point
4 pages
Basic Information About C language PDF
From Everand
Basic Information About C language PDF
Suraj Das
No ratings yet
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
From Everand
Computer Science: Learn about Algorithms, Cybersecurity, Databases, Operating Systems, and Web Design
Jonathan Rigdon
No ratings yet
DA3 QP - D2
No ratings yet
DA3 QP - D2
2 pages
All My IT Tech Posts
From Everand
All My IT Tech Posts
Stephen Edwards
No ratings yet
Hack into your Friends Computer
From Everand
Hack into your Friends Computer
Magelan Cyber Security
No ratings yet
Destinationsof Benims Projems
No ratings yet
Destinationsof Benims Projems
6 pages
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
From Everand
PLC: Programmable Logic Controller – Arktika.: EXPERIMENTAL PRODUCT BASED ON CPLD.
MARIO FRANCO
No ratings yet
Google BigQuery Analytics
From Everand
Google BigQuery Analytics
Jordan Tigani
3/5 (1)
ACA UNit 1
No ratings yet
ACA UNit 1
29 pages
Dart for Flutter
From Everand
Dart for Flutter
Zeuz IT
No ratings yet
Gluster Filesystem - Practical Method
From Everand
Gluster Filesystem - Practical Method
Fabian Mestre
No ratings yet
Mega Drive Architecture: Architecture of Consoles: A Practical Analysis, #3
From Everand
Mega Drive Architecture: Architecture of Consoles: A Practical Analysis, #3
Rodrigo Copetti
No ratings yet
PC Engine / TurboGrafx-16 Architecture: Architecture of Consoles: A Practical Analysis, #16
From Everand
PC Engine / TurboGrafx-16 Architecture: Architecture of Consoles: A Practical Analysis, #16
Rodrigo Copetti
No ratings yet
GameCube Architecture: Architecture of Consoles: A Practical Analysis, #10
From Everand
GameCube Architecture: Architecture of Consoles: A Practical Analysis, #10
Rodrigo Copetti
No ratings yet
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
From Everand
DRBD-Cookbook: How to create your own cluster solution, without SAN or NAS!
Joerg Christian Seubert
No ratings yet
Master System Architecture: Architecture of Consoles: A Practical Analysis, #15
From Everand
Master System Architecture: Architecture of Consoles: A Practical Analysis, #15
Rodrigo Copetti
2/5 (1)
HW 6
No ratings yet
HW 6
4 pages
Node.js: Tools & Skills
From Everand
Node.js: Tools & Skills
James Hibbard
No ratings yet
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
From Everand
Cisco Packet Tracer Implementation: Building and Configuring Networks: 1, #1
S. R. Jena
No ratings yet
Cloudsigma Iozone 2 2
No ratings yet
Cloudsigma Iozone 2 2
3 pages
HY425 ProgAssignment2
No ratings yet
HY425 ProgAssignment2
3 pages
Nintendo DS Architecture: Architecture of Consoles: A Practical Analysis, #14
From Everand
Nintendo DS Architecture: Architecture of Consoles: A Practical Analysis, #14
Rodrigo Copetti
No ratings yet
Using SQLIO To Stress Test An I/O Subsystem
No ratings yet
Using SQLIO To Stress Test An I/O Subsystem
6 pages
Baidu Sort 2014
No ratings yet
Baidu Sort 2014
4 pages
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
From Everand
Dreamcast Architecture: Architecture of Consoles: A Practical Analysis, #9
Rodrigo Copetti
No ratings yet
Simple Golang Programming for Beginners
From Everand
Simple Golang Programming for Beginners
Terry T. Diaz
No ratings yet
EMMC-SSD File System Tuning Methodology v1.0 PDF
No ratings yet
EMMC-SSD File System Tuning Methodology v1.0 PDF
30 pages
Raspberry Pi :The Ultimate Step by Step Raspberry Pi User Guide (The Updated Version )
From Everand
Raspberry Pi :The Ultimate Step by Step Raspberry Pi User Guide (The Updated Version )
Jason Scotts
4/5 (4)
Interview Questions for IBM Mainframe Developers
From Everand
Interview Questions for IBM Mainframe Developers
Robert Wingate
1/5 (1)
5.1 Hardware and Software Configuration
No ratings yet
5.1 Hardware and Software Configuration
1 page
Game Boy Advance Architecture: Architecture of Consoles: A Practical Analysis, #7
From Everand
Game Boy Advance Architecture: Architecture of Consoles: A Practical Analysis, #7
Rodrigo Copetti
No ratings yet
Practical Rust 1.x Cookbook
From Everand
Practical Rust 1.x Cookbook
Rustacean Team
No ratings yet
Practical Rust 1.x Cookbook: 100+ Solutions across Command Line, CI/CD, Kubernetes, Networking, Code Performance and Microservices
From Everand
Practical Rust 1.x Cookbook: 100+ Solutions across Command Line, CI/CD, Kubernetes, Networking, Code Performance and Microservices
Rustacean Team
No ratings yet
Sequential I/O On Windows NT™ 4.0 - Achieving Top Performance
No ratings yet
Sequential I/O On Windows NT™ 4.0 - Achieving Top Performance
32 pages
Complete Group A - CMP 408 Assignment
No ratings yet
Complete Group A - CMP 408 Assignment
13 pages
Storage Area Networks For Dummies
From Everand
Storage Area Networks For Dummies
Christopher Poelker
3.5/5 (2)
HackerTools Crack With Disassembling
From Everand
HackerTools Crack With Disassembling
Omega Brdarevic
2.5/5 (3)
Bio-Control Agents - Bioinsecticides and Bioherbicides
No ratings yet
Bio-Control Agents - Bioinsecticides and Bioherbicides
7 pages
Lesson_Plan_for_APA_Style_Workshop
No ratings yet
Lesson_Plan_for_APA_Style_Workshop
2 pages
Cylinder Head Cover, 6T-830 and 6ta-830 Emissions Certified Engine
No ratings yet
Cylinder Head Cover, 6T-830 and 6ta-830 Emissions Certified Engine
3 pages
SWMS Premium Scaffold Soultions Scaffold
100% (1)
SWMS Premium Scaffold Soultions Scaffold
22 pages
CLB-1
No ratings yet
CLB-1
1 page
Force12 C3-C3e Manual
No ratings yet
Force12 C3-C3e Manual
27 pages
Computer Science Practical Manual 24-25
No ratings yet
Computer Science Practical Manual 24-25
28 pages
DLL in Statistics
No ratings yet
DLL in Statistics
4 pages
Nokia BSC Commands
100% (1)
Nokia BSC Commands
3 pages
Unit 3 Bda
No ratings yet
Unit 3 Bda
9 pages
A Thing of Beauty
No ratings yet
A Thing of Beauty
3 pages
Current GK 2016 Final
No ratings yet
Current GK 2016 Final
174 pages
Fungsi Green
No ratings yet
Fungsi Green
8 pages
A Sampling of Medieval Fire Steels: Self-Handled (Iron/steel)
No ratings yet
A Sampling of Medieval Fire Steels: Self-Handled (Iron/steel)
7 pages
P5 P6 The Principles of Signal Theory
50% (2)
P5 P6 The Principles of Signal Theory
12 pages
Linking Words (Theory)
No ratings yet
Linking Words (Theory)
5 pages
Basix GrannySquareVest
No ratings yet
Basix GrannySquareVest
10 pages
PCC 0409 Etg e
No ratings yet
PCC 0409 Etg e
4 pages
Mag Pi 144
No ratings yet
Mag Pi 144
100 pages
Financial Advisor
No ratings yet
Financial Advisor
3 pages
Ow350 57
No ratings yet
Ow350 57
39 pages
CL B155 Mba520 Assessment
No ratings yet
CL B155 Mba520 Assessment
12 pages
Cultured Foods Ebook
No ratings yet
Cultured Foods Ebook
39 pages
Band Skill Experience Range Sbu-Vbu - Cbu Name
No ratings yet
Band Skill Experience Range Sbu-Vbu - Cbu Name
12 pages
Term Paper On Embedded System
0% (1)
Term Paper On Embedded System
11 pages
LearningObjectives Chapters 1to6
100% (1)
LearningObjectives Chapters 1to6
12 pages
2BHK Drain
No ratings yet
2BHK Drain
17 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CS553 Homework #3: Benchmarking Storage

Uploaded by

CS553 Homework #3: Benchmarking Storage

Uploaded by

CS553 Homework #3

1 CS553 Spring 2020 – HW3

2 CS553 Spring 2020 – HW3

3 CS553 Spring 2020 – HW3

3 What you will submit

4 CS553 Spring 2020 – HW3

5 CS553 Spring 2020 – HW3

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.