0% found this document useful (0 votes)

97 views15 pages

Tutorial Presentation 8

This document provides an introduction to OpenMP, which is an application programming interface used to explicitly direct multi-threaded, shared memory parallelism. It discusses how chip manufacturers are moving to multi-core CPUs, OpenMP's shared memory model, fork-join execution model, key components of the OpenMP API including compiler directives and runtime routines, how variables can be classified as private or shared, examples of work-sharing constructs like parallel loops, and different scheduling strategies for loop iterations.

Uploaded by

hisuin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

97 views15 pages

Tutorial Presentation 8

Uploaded by

hisuin

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

OpenMP

Arash Bakhtiari
bakhtiar@in.tum.de

2012-12-18 Tue

Introduction
I

Chip manufacturers are rapidly moving to multi-core

CPUs

Figure : Quad-core processor Intel Sandy Bridge

Shared Memory Model

All processors can access all memory in global address

space.
Threads Model: A single process can have multiple,
concurrent execution paths
On a multi-core system, the threads run at the same
time, with each core running a particular thread or task.

Figure : Shared Memory Model [1]

What is OpenMP?

I
I

An Application Program Interface (API)

Used to explicitly direct multi-threaded, shared memory
parallelism
Provides a portable, scalable model
Supports C/C++ and Fortran on a wide variety of
architectures

Fork-Join Model

I
I

OpenMP-program starts as a single thread

Additional threads (Team) are created when the master
hits a parallel region
When all threads finished the parallel region, the new
threads are given back to the runtime or operating
system.
The master continues after the parallel region

Fork-Join Model (cont.)

Figure : Fork-Join Model [1]

OpenMP API
Primary API components:
I Compiler Directives:
#pragma omp p a r a l l e l

Run-time Library Routines:

i n t omp_get_num_threads ( v o i d ) ;

Environment Variables

e x p o r t OMP_NUM_THREADS=2

Example
Listing 1: OpenMP Hello World!
#i n c l u d e <i o s t r e a m >
#i n c l u d e <omp . h>
int
{

main ( i n t

argc ,

char argv [ ] )

#pragma omp p a r a l l e l
{
s t d : : c o u t << "THREAD : " << omp_get_thread_num ( ) << " \ t H e l l o , World ! \ n " ;
}
return 0;
}

Listing 2: Compiling
g++ o h e l l o

h e l l o . c fopenmp

Classification of Variables

private(var-list):
I

shared(var-list):
I

Variables in var-list are private

Variables in var-list are shared.

default(private | shared | none):

Sets the default for all variables in this region.

Example
Listing 3: OpenMP Private Variable
#i n c l u d e <i o s t r e a m >
#i n c l u d e <omp . h>
i n t main ( i n t a r g c , c h a r a r g v [ ] )
{
int i , j ;
i = 1;
j = 2;
s t d : : c o u t << "BEFORE : i , j= "<< i << " , " << j << s t d : : e n d l ;
#pragma omp p a r a l l e l p r i v a t e ( i )
{
i = 3;
j = 5;
s t d : : c o u t << " INLOOP : i , j= "<< i << " , " << j << s t d : : e n d l ;
}

s t d : : c o u t << "AFTER :
return 0;

i , j= "<< i << " , " << j << s t d : : e n d l ;

Work-Sharing Constructs

Work-sharing constructs distribute the specified work to

all threads within the current team
Types:
I
I
I
I

Parallel loop
Parallel section
Master region
Single region

Parallel Loop

Syntax:

#pragma omp f o r

I
I

[ clause

...]

The iterations of the loop are distributed to the threads

The scheduling of loop iterations: static, dynamic,
guided, and runtime.

Scheduling Strategies
I

Schedule clause:

schedule ( type

[ , size ])

static: Chunks of the specified size are assigned in a

round- robin fashion to the threads.
dynamic: The iterations are broken into chunks of the
specified size. When a thread finishes the execution of a
chunk, the next chunk is assigned to that thread.
guided: Similar to dynamic, but the size of the chunks is
exponentially decreasing. The size parameter specifies the
smallest chunk. The initial chunk is implementation
dependent.
runtime: The scheduling type and the chunk size is
determined via environment variables.

Example
Listing 4: OpenMP Private Variable
#i n c l u d e <i o s t r e a m >
#i n c l u d e <omp . h>
#d e f i n e CHUNKSIZE 100
#d e f i n e N
1000
i n t main ( )
{
i n t i , chunk ;
d o u b l e a [N] , b [N] , c [N ] ;
s r a n d ( t i m e ( NULL ) ) ;
f o r ( i =0; i < N ; i ++) {
a [ i ] = generate_random_double ( 0 . 0 ,
b [ i ] = generate_random_double ( 0 . 0 ,
}
c h u n k = CHUNKSIZE ;
#pragma omp p a r a l l e l
{

10.0);
10.0);

s h a r e d ( a , b , c , chunk )

private ( i )

#pragma omp f o r s c h e d u l e ( dynamic , c h u n k ) n o w a i t

f o r ( i =0; i < N ; i ++)
c[ i ] = a[ i ] + b[ i ];
}
return
}

References

Blaise Barney, Lawrence Livermore National Laboratory,

https://computing.llnl.gov/tutorials/openMP/

OpenMP Tutorial - Lawrence Livermore National Laboratory
No ratings yet
OpenMP Tutorial - Lawrence Livermore National Laboratory
75 pages
Openmp: Author: Blaise Barney, Lawrence Livermore National Laboratory
No ratings yet
Openmp: Author: Blaise Barney, Lawrence Livermore National Laboratory
62 pages
Unit 4 Shared-Memory Parallel Programming With Openmp
No ratings yet
Unit 4 Shared-Memory Parallel Programming With Openmp
37 pages
Open MP
No ratings yet
Open MP
35 pages
Lec 12 OpenMP
No ratings yet
Lec 12 OpenMP
152 pages
CS-3006 8 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 8 UsingOpenMP SharedMemoryProgramming
61 pages
OpenMP Workshop Day 1
No ratings yet
OpenMP Workshop Day 1
49 pages
Omp Exercises
No ratings yet
Omp Exercises
81 pages
Unit Iii
No ratings yet
Unit Iii
61 pages
Introduction To OpenMP
No ratings yet
Introduction To OpenMP
46 pages
OpenMP Workshop Day 1
No ratings yet
OpenMP Workshop Day 1
56 pages
OpenMP - Reference Book
No ratings yet
OpenMP - Reference Book
59 pages
High Performance Computing (HPC) - Lec3
No ratings yet
High Performance Computing (HPC) - Lec3
35 pages
Lecture 10 Shared Memory Programming With OpenMP
No ratings yet
Lecture 10 Shared Memory Programming With OpenMP
30 pages
OMP Common Core-Voss
No ratings yet
OMP Common Core-Voss
217 pages
OpenMP Presentation
No ratings yet
OpenMP Presentation
51 pages
Omp Handouts
No ratings yet
Omp Handouts
109 pages
Openmp HPC Ass1
No ratings yet
Openmp HPC Ass1
43 pages
CS-3006 5 UsingOpenMP SharedMemoryProgramming
No ratings yet
CS-3006 5 UsingOpenMP SharedMemoryProgramming
76 pages
Presentation2 HS OpenMP
No ratings yet
Presentation2 HS OpenMP
29 pages
Open MP1551363136163
No ratings yet
Open MP1551363136163
29 pages
OpenMP Tutorial
100% (1)
OpenMP Tutorial
82 pages
OpenMPSlides Tamu SC
No ratings yet
OpenMPSlides Tamu SC
80 pages
Openmp Overview
No ratings yet
Openmp Overview
74 pages
About OpenMP
No ratings yet
About OpenMP
86 pages
PC File
No ratings yet
PC File
57 pages
OpenMPSlides Tamu SC PDF
No ratings yet
OpenMPSlides Tamu SC PDF
74 pages
Openmp 2pp
No ratings yet
Openmp 2pp
15 pages
OPENMP
No ratings yet
OPENMP
37 pages
OpenMP Lec11 Week4
No ratings yet
OpenMP Lec11 Week4
18 pages
Omp Hands On SC08 PDF
No ratings yet
Omp Hands On SC08 PDF
153 pages
Openmp
No ratings yet
Openmp
21 pages
OpenMP 01 Introduction
No ratings yet
OpenMP 01 Introduction
70 pages
09 OpenMP Intro
No ratings yet
09 OpenMP Intro
15 pages
OpenMP Basics
No ratings yet
OpenMP Basics
47 pages
Open MP
No ratings yet
Open MP
28 pages
Chapter 3 - Shared-Memory Programming, OpenMP
No ratings yet
Chapter 3 - Shared-Memory Programming, OpenMP
65 pages
Lect11 Openmp1
No ratings yet
Lect11 Openmp1
35 pages
Lecture Open MP
No ratings yet
Lecture Open MP
35 pages
Parallel Programming Using Openmp: Mike Bailey
No ratings yet
Parallel Programming Using Openmp: Mike Bailey
27 pages
Beginning OpenMP
No ratings yet
Beginning OpenMP
20 pages
Open MP
No ratings yet
Open MP
30 pages
OpenMP SPM
No ratings yet
OpenMP SPM
9 pages
Shared Memory Parallel Programming: Introduction To Openmp
No ratings yet
Shared Memory Parallel Programming: Introduction To Openmp
39 pages
Openmp: Parallel Processing
No ratings yet
Openmp: Parallel Processing
40 pages
Num Tech
No ratings yet
Num Tech
39 pages
Mpsoc Architectures Openmp
No ratings yet
Mpsoc Architectures Openmp
35 pages
Omp Hands On SC08
No ratings yet
Omp Hands On SC08
153 pages
Open MPLecture
No ratings yet
Open MPLecture
54 pages
A Tutorial On Parallel Computing On Shared Memory Systems
No ratings yet
A Tutorial On Parallel Computing On Shared Memory Systems
23 pages
Openmp: John H. Osorio Ríos
No ratings yet
Openmp: John H. Osorio Ríos
24 pages
Xe 62011 Open MP
No ratings yet
Xe 62011 Open MP
46 pages
Unit 3
No ratings yet
Unit 3
13 pages
Openmp 6pp
No ratings yet
Openmp 6pp
5 pages
Cs6801 Mcap MGM
No ratings yet
Cs6801 Mcap MGM
7 pages
Openmp Programming: Aiichiro Nakano
No ratings yet
Openmp Programming: Aiichiro Nakano
10 pages
Revolutionizing Connectivity: A Deep Drive Into Innovative PCB Design Strategies Internship Report 2024
No ratings yet
Revolutionizing Connectivity: A Deep Drive Into Innovative PCB Design Strategies Internship Report 2024
17 pages
Libro Azul New Edition
No ratings yet
Libro Azul New Edition
208 pages
Vicor Power Supply Manual
No ratings yet
Vicor Power Supply Manual
182 pages
Flipkart Mobile Cheatsheet GOAT Sale
No ratings yet
Flipkart Mobile Cheatsheet GOAT Sale
3 pages
Declaratory Suit
100% (1)
Declaratory Suit
2 pages
Teaching Note: Synopsis
No ratings yet
Teaching Note: Synopsis
11 pages
Grade 6 - Unit 6 - Vocabulary
No ratings yet
Grade 6 - Unit 6 - Vocabulary
29 pages
Ultrasonic Flowmeter Instruction Manual Model: SL1188
No ratings yet
Ultrasonic Flowmeter Instruction Manual Model: SL1188
82 pages
Unit 2 CH 4 - Supply and Demand-3
No ratings yet
Unit 2 CH 4 - Supply and Demand-3
69 pages
Transport Layer Security (TLS)
No ratings yet
Transport Layer Security (TLS)
22 pages
Step-By-Step Guide For File Server Resource Manager in Windows Server 2008
100% (4)
Step-By-Step Guide For File Server Resource Manager in Windows Server 2008
44 pages
Resume Yan Liang
No ratings yet
Resume Yan Liang
2 pages
Printable Accomplishment Report
No ratings yet
Printable Accomplishment Report
22 pages
MM04-Subcontracting Process
No ratings yet
MM04-Subcontracting Process
13 pages
Bell LaPadula
No ratings yet
Bell LaPadula
42 pages
PFTL99720 0ed
No ratings yet
PFTL99720 0ed
6 pages
Travel Ex
0% (1)
Travel Ex
3 pages
Working With Vulnerable People: Jan 2022 (T2, 21.22AY)
No ratings yet
Working With Vulnerable People: Jan 2022 (T2, 21.22AY)
11 pages
UT Dallas Syllabus For Taught by Alexander Edsel (Ade012000)
No ratings yet
UT Dallas Syllabus For Taught by Alexander Edsel (Ade012000)
10 pages
Himss20 Enovacom Whitepaper 1277
No ratings yet
Himss20 Enovacom Whitepaper 1277
24 pages
Microbiology
No ratings yet
Microbiology
3 pages
Dental Management System
No ratings yet
Dental Management System
8 pages
Tutorial Presentation 2
No ratings yet
Tutorial Presentation 2
19 pages
NMR Vorlesung SS 2013 3
No ratings yet
NMR Vorlesung SS 2013 3
20 pages
Tutorial Presentation 1
No ratings yet
Tutorial Presentation 1
17 pages
Free Crochet Pattern Lion Brand Tweed Stripes Tweedy Mitered Afghan
No ratings yet
Free Crochet Pattern Lion Brand Tweed Stripes Tweedy Mitered Afghan
4 pages
SE Patterns
No ratings yet
SE Patterns
15 pages
Simulation and Animation: Computer Graphics & Visualization
No ratings yet
Simulation and Animation: Computer Graphics & Visualization
15 pages
Basic Mathematical Tools For Imaging and Visualization: Dr. Tobias Lasser
No ratings yet
Basic Mathematical Tools For Imaging and Visualization: Dr. Tobias Lasser
15 pages
Marginal Costing and Profit Planning
No ratings yet
Marginal Costing and Profit Planning
12 pages
.....
No ratings yet
.....
6 pages
【Hack Fb On-Line) 2 Minutes Using Our
No ratings yet
【Hack Fb On-Line) 2 Minutes Using Our
4 pages
Effiziente Algorithmen Und Datenstrukturen I: Aufgabe 1 (10 Punkte)
No ratings yet
Effiziente Algorithmen Und Datenstrukturen I: Aufgabe 1 (10 Punkte)
8 pages
COA Ball Mill
No ratings yet
COA Ball Mill
1 page
Cruiser SE
No ratings yet
Cruiser SE
2 pages
Week 1: Required Readings and Videos/Homework
No ratings yet
Week 1: Required Readings and Videos/Homework
5 pages
Basic Mathematical Tools: I 1 I I 1 I I J
No ratings yet
Basic Mathematical Tools: I 1 I I 1 I I J
6 pages
Solution01 Wise2011
No ratings yet
Solution01 Wise2011
5 pages
Tutorial (Advanced Programming) Worksheet 6:: Assignment 1: Heat Equation
No ratings yet
Tutorial (Advanced Programming) Worksheet 6:: Assignment 1: Heat Equation
3 pages
Creative Ad Report
No ratings yet
Creative Ad Report
6 pages
Free Knitting Pattern Lion Brand Landscapes Diagonal Furrows Scarf
No ratings yet
Free Knitting Pattern Lion Brand Landscapes Diagonal Furrows Scarf
3 pages
How To Reduce, Reuse and Recycle - Instruction Essay
100% (2)
How To Reduce, Reuse and Recycle - Instruction Essay
2 pages
Tutorial (Advanced Programming) Worksheet 8:: Assignment 1: Lists
No ratings yet
Tutorial (Advanced Programming) Worksheet 8:: Assignment 1: Lists
2 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Amazing Java: Learn Java Quickly
From Everand
Amazing Java: Learn Java Quickly
Andrei Besedin
No ratings yet
Projects With Microcontrollers And PICC
From Everand
Projects With Microcontrollers And PICC
Guillermo Perez Guillen
5/5 (1)
C Programming
From Everand
C Programming
Netra
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Tutorial Presentation 8

Uploaded by

Tutorial Presentation 8

Uploaded by

OpenMP

Chip manufacturers are rapidly moving to multi-core

Figure : Quad-core processor Intel Sandy Bridge

Shared Memory Model

All processors can access all memory in global address

Figure : Shared Memory Model [1]

An Application Program Interface (API)

OpenMP-program starts as a single thread

Fork-Join Model (cont.)

Figure : Fork-Join Model [1]

Run-time Library Routines:

Variables in var-list are private

default(private | shared | none):

Sets the default for all variables in this region.

i , j= "<< i << " , " << j << s t d : : e n d l ;

Work-sharing constructs distribute the specified work to

The iterations of the loop are distributed to the threads

static: Chunks of the specified size are assigned in a

#pragma omp f o r s c h e d u l e ( dynamic , c h u n k ) n o w a i t

Blaise Barney, Lawrence Livermore National Laboratory,

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.