0% found this document useful (0 votes)

63 views

Parallel Processing Report

This document discusses parallel processing and parallel computer architectures. It describes two main methods of parallelism: temporal parallelism and data parallelism. It then discusses different types of parallel computer architectures according to Flynn's taxonomy: SISD, SIMD, MISD, and MIMD. The most common architecture is MIMD, where multiple processors execute different instructions on different data simultaneously. Shared memory systems are also discussed, where processors can access a shared memory location and see updates immediately.

Uploaded by

ZaidB

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views

Parallel Processing Report

Uploaded by

ZaidB

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 9

MSc.

Coursework 2017

Parallel Processing
SUPERVISOR: Dr. SAGVAN ALI SALIH

BY: ZAID HUSSEIN BERJIS

Contents
Chapter One : Introduction to Parallel Processing
Chapter One : Introduction to Parallel Processing

SOME INTERESTING FEATURES OF PARALLEL COMPUTERS

Better quality of solution.

Better algorithms.

Better storage distribution.

Greater reliability.
Parallelism Methods

1) TEMPORAL PARALLELISM
2) DATA PARALLELISM

TABLE 1.1 Comparison of Temporal and Data Parallel Processing

Temporal parallel processing (pipelining idea) Data parallel
processing
Job is divided into a set of independent tasks and tasks Full jobs are assigned for
are assigned for processing. processing
Tasks should take equal time. Pipeline stages should thus Jobs may take different
be synchronized. times. No need to
synchronize beginning of
jobs.
Bubbles in jobs lead to idling of processors. Bubbles do not cause
idling of processors.
Processors specialized to do specific tasks efficiently. Processors should be
general purpose and may
not do all tasks efficiently.
Task assignment static. Job assignment may be
static, dynamic, or
quasi-dynamic.
Not tolerant to processor faults. Tolerates processor faults.
Efficient with fine grained tasks. Efficient with coarse
grained tasks and quasi-
dynamic scheduling.
Scales well as long as number of data items to be Scales well as long as
processed is much larger than the number of processors number of jobs is much
in the pipeline and time taken to communicate task from greater than the number of
one processor to the next is negligible. processors and
processing time is much
higher than the time to
distribute data to
processors.
Lately there has been a lot of interest generated all over the world on parallel
processors and parallel computers. This is due to the fact that all current micro-
processors are parallel processors. Each processor in a microprocessor chip is called a
core and such a microprocessor is called a multicore processor. Multicore processors
have an on-chip memory of a few megabytes (MB). Before trying to answer the question
“What is a parallel computer?”, Let’s review the structure of a single processor computer
(Fig. 1.1). It consists of an input unit which accepts (or reads) the list of instructions to
solve a problem (a program) and data relevant to that problem. It has a memory or
storage unit in which the program, data and intermediate results are stored, a
processing element which we will abbreviate as PE (also called a Central Processing
Unit (CPU)) which interprets and executes instructions, and an output unit which
displays or prints the results.

Figure 1.1 : Von Neumann Structure

This structure of a computer was proposed by John Von Neumann in the mid 1940s
and is known as the Von Neumann Architecture. In this architecture, a program is first
stored in the memory. The PE retrieves one instruction of this program at a time,
interprets it and executes it. The operation of this computer is thus sequential. At a time,
the PE can execute only one instruction. The speed of this sequential computer is thus
limited by the speed at which a PE can retrieve instructions and data from the memory
and the speed at which it can process the retrieved data. To increase the speed of
processing of data one may increase the speed of the PE by increasing the clock
speed. The clock speed increased from a few hundred kHz in the 1970s to 3 GHz in
2005. Processor designers found it difficult to increase the clock speed further as the
chip was getting overheated. The number of transistors which could be integrated in a
chip could, however, be doubled every two years. Thus, processor designers placed
many processing “cores” inside the processor chip to increase its effective throughput.
The processor retrieves a sequence of instructions from the main memory and stores
them in an on-chip memory. The “cores” can then cooperate to execute these
instructions in parallel. A computer which consists of a number of inter-connected
computers which cooperatively execute a single program to solve a problem is called a
parallel computer. Rapid developments in electronics have led to the emergence of
processors which can process over 5 billion instructions per second. Such processors
cost only around $100. It is thus possible to economically construct parallel computers
which use around 4000 such multicore processors to carry out ten trillion (1013 )
instructions per second assuming 50% efficiency. The more difficult problem is to
perceive parallelism in algorithms and develop a software environment which will enable
application programs to utilize this potential parallel processing power.

Parallel Computing Architecture:

Hardware: A parallel computer is a collection of several interconnected nodes that cooperate

and communicate with each other in order to solve complex problems by splitting them into
parallel tasks.

In parallel computer systems (or parallel computing) Flynn’s taxonomy is frequently

used to classify computer architectures. Flynn classifies parallel processor systems
according to the number of instruction streams and the number of data streams it can
simultaneously manage.

Flynn’s taxonomy classifies computer architecture into four main categories:

• Single Instruction Single Data machine (SISD).

• Single Instruction Multiple Data machine (SIMD).

• Multiple Instruction Single Data machine (MISD).

• Multiple Instruction Multiple Data machine (MIMD).

Single Instruction Single Data (SISD):

A sequential computer which exploits no parallelism in either the instruction or data streams.
Single control unit fetches single instruction stream from memory. The control unit then
generates appropriate control signals to direct single processing unit (PU) to operate on
single data stream i.e., one operation at a time.
Single Instruction Multiple Data (SIMD):

Single instruction, multiple data, or SIMD, systems are parallel systems. As the name
suggests, SIMD systems operate on multiple data streams by applying the same
instruction to multiple data items, so an abstract SIMD system can be thought of as
having a single control unit and multiple ALUs.

An instruction is broadcast from the control unit to the

ALUs, and each ALU either applies the instruction to

the current data item, or it is idle.

Note that: in a SIMD system, the ALUs must operate synchronously, that is, each ALU
must wait for the next instruction to be broadcast before proceeding.

Finally, SIMD systems are ideal for parallelizing simple loops that operate on large
arrays of data. Parallelism that’s obtained by dividing data among the processors and
having the processors all apply the same instructions to their subsets of the data is
called data-parallelism.

Multiple Instruction Single Data (MISD):

Multiple instructions operate on one data stream. It is a type of parallel computing

architecture where many functional units perform different operations by executing
different instructions on the same data set.

Multiple Instruction Multiple Data (MIMD):

Multiple autonomous processors simultaneously executing different instructions on

different data. MIMD architectures include multi-core processors, and/or distributed
systems.
Thus, MIMD systems typically consist of a collection of fully independent processing
units or cores, each of which has its own control unit and its own ALU. Furthermore,
unlike SIMD systems, MIMD systems are usually asynchronous, that is, the processors
can operate at their own speed.

MIMD are currently the most common than others and can be broadly divided according
to the organization of the memory into three sub-classes: :

• Shared memory machines.

• Distributed memory machines across a network in a distributed environment.

• Mixed memory machines.

The MIMD architectures is primarily used in a number of application areas, including the
following:

• Computer-aided design.

• Computer-aided manufacturing.

• Simulation.

• Modeling.

• Communication switches etc.

Shared Memory Systems:

In a shared-memory system a collection of autonomous processors is connected

to a memory system via an interconnection network, and each processor can
access each memory location.
All processors can simultaneously access the same memory. Meanwhile, all performed
operations on that memory is immediately available to all other processors

Shared memory is an efficient means of communications for passing data between

processors. Generally, parallel programming with share memory is easier and more
convenient than with the other types of memory architectures, since all processors can
simultaneously access the same memory.

However, in practice, this class is best suited to machines with a limited number of
processors, because increasing the number of processors, may constitute a bottleneck
with the access to the shared memory.

Distributed Memory Systems:

Distributed memory system refer to a computer system in which each processor has its
own memory space. Different processors communicate over an interconnected network.
So in distributed-memory systems the processors usually communicate explicitly by
sending messages through the network or by using special functions that provide
access to the memory of another processor.

Distributed memory machines can integrate a large number of processors (may be in

thousands). Thus, parallel applications using this type of memory architecture can be
scalable.

The communication model can allow a considerable increase in speed, but its
programming is difficult since the programmers have to handle all communication
operations.

Mixed (or hybrid) Memory Systems:

The combination of both shared and distributed memory mechanisms (noted as mixed
or hybrid memory architectures) provides a flexible means to adapt to various
computing platforms.
This combination may increase scalability, increase performance computing, speed up
computation, and permit to efficient utilization of the existing hardware capacities.

However, this type of architecture combines advantages, but also may combine
the disadvantages of the both architectures.

Siemens g120c Operator Manual
100% (3)
Siemens g120c Operator Manual
324 pages
Learn Multithreading with Modern C++
From Everand
Learn Multithreading with Modern C++
James Raynard
No ratings yet
Parallel Processing
No ratings yet
Parallel Processing
35 pages
ANT-ATD4516R8-2235 Datasheet PDF
No ratings yet
ANT-ATD4516R8-2235 Datasheet PDF
4 pages
IJARCCE6G S Prabhudev Parallel PDF
No ratings yet
IJARCCE6G S Prabhudev Parallel PDF
4 pages
Parallel Processing in Processor Organization: Prabhudev S Irabashetti
No ratings yet
Parallel Processing in Processor Organization: Prabhudev S Irabashetti
4 pages
Model
No ratings yet
Model
14 pages
Evolution Computer1
No ratings yet
Evolution Computer1
17 pages
Parallel
No ratings yet
Parallel
5 pages
Aca Unit 1.1
No ratings yet
Aca Unit 1.1
20 pages
APznzaaBPbq19r7DttJsFJDiz6xdljQmPxg0oflqRAoyoqcN6IEEo4yrW Ck8XgHkH5PDMZIHRNz7h0ZpQWHOHwyjvO3PX93sVHvLd5fwcGETUu8XvmdTkaodNRbNrLgkDFPQZVQMfz8KHkZay30aqD0CVLA10PSummzrUt1vN32NEahcaq-m3CTYqZXjSBaBus9kPl5fj8KDKPT (1)
No ratings yet
APznzaaBPbq19r7DttJsFJDiz6xdljQmPxg0oflqRAoyoqcN6IEEo4yrW Ck8XgHkH5PDMZIHRNz7h0ZpQWHOHwyjvO3PX93sVHvLd5fwcGETUu8XvmdTkaodNRbNrLgkDFPQZVQMfz8KHkZay30aqD0CVLA10PSummzrUt1vN32NEahcaq-m3CTYqZXjSBaBus9kPl5fj8KDKPT (1)
80 pages
Unit 7 - Parallel Processing Paradigm
No ratings yet
Unit 7 - Parallel Processing Paradigm
26 pages
Unit 9: Fundamentals of Parallel Processing
No ratings yet
Unit 9: Fundamentals of Parallel Processing
16 pages
Unit -01 easid
No ratings yet
Unit -01 easid
18 pages
08 Parallel algorithms approches
No ratings yet
08 Parallel algorithms approches
12 pages
Parallel Processing
100% (1)
Parallel Processing
4 pages
COA U5 PPT Full
No ratings yet
COA U5 PPT Full
43 pages
ACA1
No ratings yet
ACA1
26 pages
Parallel Computer Models: PCA Chapter 1
No ratings yet
Parallel Computer Models: PCA Chapter 1
61 pages
CS802A Lec-2 PDF
No ratings yet
CS802A Lec-2 PDF
28 pages
Parallel Computing
No ratings yet
Parallel Computing
14 pages
Lec1 Introduction to Parallel Computing (2)
No ratings yet
Lec1 Introduction to Parallel Computing (2)
40 pages
Cloud Computing - Lecture 3
No ratings yet
Cloud Computing - Lecture 3
22 pages
Parallel Computig Assignment
No ratings yet
Parallel Computig Assignment
15 pages
Flynn's Taxonomy of Computer Architecture
No ratings yet
Flynn's Taxonomy of Computer Architecture
8 pages
Unit 1
No ratings yet
Unit 1
22 pages
Assign
No ratings yet
Assign
12 pages
Lecture 2
No ratings yet
Lecture 2
12 pages
Ca Unit 4 Prabu
No ratings yet
Ca Unit 4 Prabu
24 pages
Coa-Unit - 5 Notes
No ratings yet
Coa-Unit - 5 Notes
38 pages
5 Marks Q. Describe Array Processor Architecture
No ratings yet
5 Marks Q. Describe Array Processor Architecture
11 pages
W3C1 Principles of Parallel Computing
No ratings yet
W3C1 Principles of Parallel Computing
28 pages
Coa Unit-3,4 Notes
No ratings yet
Coa Unit-3,4 Notes
17 pages
CA Classes-221-225
No ratings yet
CA Classes-221-225
5 pages
Coa Unit 04
No ratings yet
Coa Unit 04
85 pages
Introduction To Parallel Processing
No ratings yet
Introduction To Parallel Processing
49 pages
Parallel Processing
No ratings yet
Parallel Processing
22 pages
07 - Chapter 1 PDF
No ratings yet
07 - Chapter 1 PDF
27 pages
COA Module5 Notes
No ratings yet
COA Module5 Notes
20 pages
Parallel and Distributed Algorithms: Johnnie W. Baker
No ratings yet
Parallel and Distributed Algorithms: Johnnie W. Baker
67 pages
Parallel_computing
No ratings yet
Parallel_computing
32 pages
Lecture3 (Form Parallelism&flynn)
No ratings yet
Lecture3 (Form Parallelism&flynn)
12 pages
Parallel Computing
No ratings yet
Parallel Computing
32 pages
Hardware Multithreading
No ratings yet
Hardware Multithreading
10 pages
Parallel Processing Parallel Processing
No ratings yet
Parallel Processing Parallel Processing
64 pages
downloadfile (3)
No ratings yet
downloadfile (3)
16 pages
HPA - Notes
No ratings yet
HPA - Notes
5 pages
Flynn's Classification
No ratings yet
Flynn's Classification
4 pages
Advanced Computer Architecture
No ratings yet
Advanced Computer Architecture
28 pages
Parallel Computing
No ratings yet
Parallel Computing
34 pages
Class 2_Computer Architecture and Organization_Introduction_Number System
No ratings yet
Class 2_Computer Architecture and Organization_Introduction_Number System
11 pages
ACA Unit. 1 Parallel Processing
No ratings yet
ACA Unit. 1 Parallel Processing
10 pages
Module 5
No ratings yet
Module 5
45 pages
Parallel and Distributed Computing Systems
100% (1)
Parallel and Distributed Computing Systems
57 pages
Baker CHPT 5 SIMD Good
No ratings yet
Baker CHPT 5 SIMD Good
94 pages
Chapter 08 - Pipeline and Vector Processing
No ratings yet
Chapter 08 - Pipeline and Vector Processing
14 pages
NOTES
No ratings yet
NOTES
19 pages
Swami Vivekananda Institute of Science &: Technology
No ratings yet
Swami Vivekananda Institute of Science &: Technology
8 pages
CA Classes-16-20
No ratings yet
CA Classes-16-20
5 pages
Operating Systems Interview Questions You'll Most Likely Be Asked
From Everand
Operating Systems Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
Dataflow and Reactive Programming Systems
From Everand
Dataflow and Reactive Programming Systems
Matt Carkci
No ratings yet
Computer Science I Essentials
From Everand
Computer Science I Essentials
Randall Raus
5/5 (7)
Ch4 Review 2
No ratings yet
Ch4 Review 2
11 pages
Ch2 Review
No ratings yet
Ch2 Review
11 pages
Q1/ Sketch The Graphs of The Equations and Determine The Solution of The System of Linear Equations
No ratings yet
Q1/ Sketch The Graphs of The Equations and Determine The Solution of The System of Linear Equations
7 pages
Quiz Ch81234
No ratings yet
Quiz Ch81234
5 pages
Q1/ Sketch The Graphs of The Equations and Determine The Solution of The System of Linear Equations
No ratings yet
Q1/ Sketch The Graphs of The Equations and Determine The Solution of The System of Linear Equations
7 pages
PCM Examples and Home Works 2013
No ratings yet
PCM Examples and Home Works 2013
8 pages
MSC Final Exam June 2015
No ratings yet
MSC Final Exam June 2015
2 pages
Lakowicz 1999
No ratings yet
Lakowicz 1999
23 pages
1 The Four Quadrant DC-DC Converter
No ratings yet
1 The Four Quadrant DC-DC Converter
3 pages
Final Exam Artificial
No ratings yet
Final Exam Artificial
2 pages
Power Electronics 2: Course Information 2009/10
No ratings yet
Power Electronics 2: Course Information 2009/10
3 pages
Detailed Z80 Instruction Set
No ratings yet
Detailed Z80 Instruction Set
11 pages
Stc 201 英文用户手册
No ratings yet
Stc 201 英文用户手册
13 pages
ps2 Vga
No ratings yet
ps2 Vga
3 pages
Electrical Circuits Questionnaire
No ratings yet
Electrical Circuits Questionnaire
2 pages
Potential Divider and IR Circuits MS
No ratings yet
Potential Divider and IR Circuits MS
8 pages
BITS Pilani: Digital Signal Processing
No ratings yet
BITS Pilani: Digital Signal Processing
73 pages
Practical Design of LCC
No ratings yet
Practical Design of LCC
4 pages
Tps 65171
100% (2)
Tps 65171
46 pages
SHARP-mxm350n M350u m450n M450u
No ratings yet
SHARP-mxm350n M350u m450n M450u
12 pages
Ec1403 Microwave Engineering
No ratings yet
Ec1403 Microwave Engineering
3 pages
HP Probook 640 g1 - 14 - Core I7 4600m - 16 GB Ram - 500 GB HDD Specs - Cnet
No ratings yet
HP Probook 640 g1 - 14 - Core I7 4600m - 16 GB Ram - 500 GB HDD Specs - Cnet
3 pages
Maven Silicon - VLSI Design Internship
No ratings yet
Maven Silicon - VLSI Design Internship
5 pages
Sony hcd-rv20 rv50 rv60 Ver-1.0
No ratings yet
Sony hcd-rv20 rv50 rv60 Ver-1.0
74 pages
24 KV VT Catalogue
No ratings yet
24 KV VT Catalogue
6 pages
Chapter 2 The Fundamentals of Electronics A Review STUDENT
No ratings yet
Chapter 2 The Fundamentals of Electronics A Review STUDENT
95 pages
Logic Timer From MiCOM P14x P141, P142, P143, P144 & P145 Feeder Management Relay Technical Manual-P14xEN MDe6+Gf7
No ratings yet
Logic Timer From MiCOM P14x P141, P142, P143, P144 & P145 Feeder Management Relay Technical Manual-P14xEN MDe6+Gf7
2 pages
EC203
No ratings yet
EC203
2 pages
Electrical Machines Lecc 5
No ratings yet
Electrical Machines Lecc 5
17 pages
BAS19, BAS20, BAS21: Vishay Semiconductors
No ratings yet
BAS19, BAS20, BAS21: Vishay Semiconductors
4 pages
Embedded Systems Unit 2
No ratings yet
Embedded Systems Unit 2
78 pages
Debug 1214
No ratings yet
Debug 1214
20 pages
Hooter - 72x72mm: Terminal Connection
No ratings yet
Hooter - 72x72mm: Terminal Connection
1 page
1000 - NEPP Part Presentation Update 6-18-2019 - PJM (EEE-INST-002)
No ratings yet
1000 - NEPP Part Presentation Update 6-18-2019 - PJM (EEE-INST-002)
13 pages
SRI Sensor
No ratings yet
SRI Sensor
7 pages
MGD 2621 P Medical Greyscale Display: Installation & User Manual
No ratings yet
MGD 2621 P Medical Greyscale Display: Installation & User Manual
27 pages
03 - Line Distance Protection P44x
No ratings yet
03 - Line Distance Protection P44x
92 pages
Digital Logic Design Chapter 5
No ratings yet
Digital Logic Design Chapter 5
28 pages
Volume, Bass and Treble Circuit in Audio Amplifier: Instructables
No ratings yet
Volume, Bass and Treble Circuit in Audio Amplifier: Instructables
8 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Parallel Processing Report

Uploaded by

Parallel Processing Report

Uploaded by

MSc.

BY: ZAID HUSSEIN BERJIS

SOME INTERESTING FEATURES OF PARALLEL COMPUTERS

Better storage distribution.

TABLE 1.1 Comparison of Temporal and Data Parallel Processing

Figure 1.1 : Von Neumann Structure

Parallel Computing Architecture:

Hardware: A parallel computer is a collection of several interconnected nodes that cooperate

In parallel computer systems (or parallel computing) Flynn’s taxonomy is frequently

Flynn’s taxonomy classifies computer architecture into four main categories:

• Single Instruction Single Data machine (SISD).

• Single Instruction Multiple Data machine (SIMD).

• Multiple Instruction Single Data machine (MISD).

• Multiple Instruction Multiple Data machine (MIMD).

Single Instruction Single Data (SISD):

An instruction is broadcast from the control unit to the

ALUs, and each ALU either applies the instruction to

the current data item, or it is idle.

Multiple Instruction Single Data (MISD):

Multiple instructions operate on one data stream. It is a type of parallel computing

Multiple Instruction Multiple Data (MIMD):

Multiple autonomous processors simultaneously executing different instructions on

• Shared memory machines.

• Distributed memory machines across a network in a distributed environment.

• Mixed memory machines.

• Communication switches etc.

Shared Memory Systems:

In a shared-memory system a collection of autonomous processors is connected

Shared memory is an efficient means of communications for passing data between

Distributed Memory Systems:

Distributed memory machines can integrate a large number of processors (may be in

Mixed (or hybrid) Memory Systems:

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.