0% found this document useful (0 votes)

342 views

Shuffle and Sort

The document summarizes how the shuffle process in MapReduce works. It describes that map outputs are buffered in memory and spilled to disk if the buffer fills. The outputs are partitioned and sorted by key. During the reduce phase, map outputs are copied in parallel from map tasks and merged on disk. The reduce task then merges the outputs and directly feeds the sorted data to the reduce function without a final disk write.

Uploaded by

pallavibhardwaj1125

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

342 views

Shuffle and Sort

Uploaded by

pallavibhardwaj1125

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

Shuffle and Sort

MapReduce makes the guarantee that the input to every reducer is sorted by key. The process by
which the system performs the sort—and transfers the map outputs to the reducers as inputs—is
known as the shuffle.6 In this section, we look at how the shuffle works, as a basic understanding
would be helpful, should you need to optimize a Map-Reduce program. The shuffle is an area of
the codebase where refinements and improvements are continually being made, so the following
description necessarily conceals many details (and may change over time, this is for version 0.20).
In many ways, the shuffle is the heart of MapReduce and is where the “magic” happens.

The Map Side

When the map function starts producing output, it is not simply written to disk. The process is
more involved, and takes advantage of buffering writes in memory and doing some presorting for
efficiency reasons. Figure 6-6 shows what happens.

Each map task has a circular memory buffer that it writes the output to. The buffer is 100 MB by
default, a size which can be tuned by changing the io.sort.mb property. When the contents of the
buffer reaches a certain threshold size (io.sort.spill.per cent, default 0.80, or 80%), a background
thread will start to spill the contents to disk.

Map outputs will continue to be written to the buffer while the spill takes place, but if the buffer
fills up during this time, the map will block until the spill is complete.
Spills are written in round-robin fashion to the directories specified by the mapred.local.dir
property, in a job-specific subdirectory.

Figure 6-6. Shuffle and sort in MapReduce

Before it writes to disk, the thread first divides the data into partitions corresponding to the
reducers that they will ultimately be sent to. Within each partition, the background thread performs
an in-memory sort by key, and if there is a combiner function, it is run on the output of the sort.
Running the combiner function makes for a morecompact map output, so there is less data to write
to local disk and to transfer to the reducer.

Each time the memory buffer reaches the spill threshold, a new spill file is created, so after the
map task has written its last output record there could be several spill files. Before the task is
finished, the spill files are merged into a single partitioned and sorted output file. The
configuration property io.sort.factor controls the maximum number of streams to merge at once;
the default is 10.

If there are at least three spill files (set by the min.num.spills.for.combine property) then the
combiner is run again before the output file is written. Recall that combiners may be run repeatedly
over the input without affecting the final result. If there are only one or two spills, then the potential
reduction in map output size is not worth the overhead in invoking the combiner, so it is not run
again for this map output.
It is often a good idea to compress the map output as it is written to disk, since doing so makes
it faster to write to disk, saves disk space, and reduces the amount of data to transfer to the reducer.
By default, the output is not compressed, but it is easy to enable by setting
mapred.compress.map.output to true. The compression library to use is specified by
mapred.map.output.compression.codec;

The output file’s partitions are made available to the reducers over HTTP. The maximum number
of worker threads used to serve the file partitions is controlled by the tasktracker.http.threads
property—this setting is per tasktracker, not per map task slot. The default of 40 may need
increasing for large clusters running large jobs. In MapReduce 2, this property is not applicable
since the maximum number of threads used is set automatically based on the number of processors
on the machine. (Map- Reduce 2 uses Netty, which by default allows up to twice as many threads
as there are processors.)

The Reduce Side

Let’s turn now to the reduce part of the process. The map output file is sitting on the local disk of
the machine that ran the map task (note that although map outputs always get written to local disk,
reduce outputs may not be), but now it is needed by the machine that is about to run the reduce
task for the partition. Furthermore, the reduce task needs the map output for its particular
partition from several map tasks across the cluster. The map tasks may finish at different
times, so the reduce task starts copying their outputs as soon as each completes. This is known
as the copy phase of the reduce task. The reduce task has a small number of copier threads so
that it can fetch map outputs in parallel. The default is five threads, but this number can be
changed by setting the mapred.reduce.parallel.copies property. The map outputs are copied to
the reduce task JVM’s memory if they are small enough (the buffer’s size is controlled by
mapred.job.shuffle.input.buffer.percent, which specifies the proportion of the heap to use for this
purpose); otherwise, they are copied to disk. When the in-memory buffer reaches a threshold
size (controlled by mapred.job.shuffle.merge.percent), or reaches a threshold number of map
outputs (mapred.inmem.merge.threshold), it is merged and spilled to disk. If a combiner is
specified it will be run during the merge to reduce the amount of data written to disk.
As the copies accumulate on disk, a background thread merges them into larger, sorted files. This
saves some time merging later on. Note that any map outputs that were compressed (by the
map task) have to be decompressed in memory in order to perform a merge on them.

When all the map outputs have been copied, the reduce task moves into the sort phase (which
should properly be called the merge phase, as the sorting was carried out on the map side), which
merges the map outputs, maintaining their sort ordering. This is done in rounds. For example, if
there were 50 map outputs, and the merge factor was 10 (the default, controlled by the io.sort.factor
property, just like in the map’s merge), then there would be 5 rounds. Each round would merge 10
files into one, so at the end there would be five intermediate files.

Rather than have a final round that merges these five files into a single sorted file, the merge
saves a trip to disk by directly feeding the reduce function in what is the last phase: the reduce
phase. This final merge can come from a mixture of in-memory and on-disk segments.

During the reduce phase, the reduce function is invoked for each key in the sorted output. The
output of this phase is written directly to the output filesystem, typically HDFS. In the case of
HDFS, since the tasktracker node (or node manager) is also running a datanode, the first block
replica will be written to the local disk.

Grade 7 Python Programming Worksheet
No ratings yet
Grade 7 Python Programming Worksheet
4 pages
Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
DW Slides
No ratings yet
DW Slides
248 pages
Subject Name Parallel and Distributed Computing
100% (1)
Subject Name Parallel and Distributed Computing
3 pages
PDF
100% (2)
PDF
39 pages
Data Stage Basic Concepts
No ratings yet
Data Stage Basic Concepts
6 pages
RMM Unit-I Introdution To Data Mining
No ratings yet
RMM Unit-I Introdution To Data Mining
129 pages
Features of MapReduce
No ratings yet
Features of MapReduce
4 pages
UNIT V Streaming
No ratings yet
UNIT V Streaming
22 pages
Chapter 3 Solutions: Unit 1 Colutions Oomd 06CS71
No ratings yet
Chapter 3 Solutions: Unit 1 Colutions Oomd 06CS71
14 pages
Data Modelling and Visualization
No ratings yet
Data Modelling and Visualization
31 pages
Answers
No ratings yet
Answers
2 pages
CCS341_Data Warehousing_Unit 4 Notes
0% (1)
CCS341_Data Warehousing_Unit 4 Notes
19 pages
Unit 2 - Data Preprocessing
No ratings yet
Unit 2 - Data Preprocessing
23 pages
Midsem Regular QP
No ratings yet
Midsem Regular QP
2 pages
Brochure for ATAL Workshop
No ratings yet
Brochure for ATAL Workshop
3 pages
Pertemuan 3. Business Motivations and Drivers For Big Data Adoption
No ratings yet
Pertemuan 3. Business Motivations and Drivers For Big Data Adoption
16 pages
BDA Lab ManuaL[1]
No ratings yet
BDA Lab ManuaL[1]
83 pages
Java Assignment
No ratings yet
Java Assignment
6 pages
Springer Consent To Publish Form
No ratings yet
Springer Consent To Publish Form
3 pages
Big Data Unit5
No ratings yet
Big Data Unit5
57 pages
Python
No ratings yet
Python
12 pages
2021 10 21 Session1
No ratings yet
2021 10 21 Session1
67 pages
Big Data Management Syllabus
100% (1)
Big Data Management Syllabus
5 pages
IPCV Unit 04
No ratings yet
IPCV Unit 04
12 pages
Sutherland-Hodgeman Polygon Clipping: Abstract
No ratings yet
Sutherland-Hodgeman Polygon Clipping: Abstract
4 pages
Chapter 4. Enterprise Technologies and Big Data Business
No ratings yet
Chapter 4. Enterprise Technologies and Big Data Business
37 pages
Marko Grobelnik, Blaz Fortuna, Dunja Mladenic Jozef Stefan Institute, Slovenia
100% (1)
Marko Grobelnik, Blaz Fortuna, Dunja Mladenic Jozef Stefan Institute, Slovenia
107 pages
Important Questions Soft Computing (1)
No ratings yet
Important Questions Soft Computing (1)
9 pages
Mc5502 Bda Unit I Notes
No ratings yet
Mc5502 Bda Unit I Notes
106 pages
Bda Super Imp
No ratings yet
Bda Super Imp
35 pages
CS302 Unit1-III
No ratings yet
CS302 Unit1-III
18 pages
Sepm Unit 3.... Roshan
No ratings yet
Sepm Unit 3.... Roshan
16 pages
Data Mining - Density Based Clustering
No ratings yet
Data Mining - Density Based Clustering
8 pages
Transaction With Replicated Data PDF
No ratings yet
Transaction With Replicated Data PDF
3 pages
Introduction To MapReduce
No ratings yet
Introduction To MapReduce
43 pages
DATA ANALYTICS Lab
No ratings yet
DATA ANALYTICS Lab
3 pages
Exercise - 3 Submission - Group - 12
No ratings yet
Exercise - 3 Submission - Group - 12
14 pages
DATA WRANGLING New
No ratings yet
DATA WRANGLING New
13 pages
UNIT 3 DV (1)
No ratings yet
UNIT 3 DV (1)
44 pages
Naïve Bayes Classifier
No ratings yet
Naïve Bayes Classifier
39 pages
Parallel Algorithm - Introduction
No ratings yet
Parallel Algorithm - Introduction
36 pages
Unit 4 DATA PLACEMENT ON DISKS
No ratings yet
Unit 4 DATA PLACEMENT ON DISKS
23 pages
Pentaho Predictive Analytics
No ratings yet
Pentaho Predictive Analytics
4 pages
Facets of Data
No ratings yet
Facets of Data
6 pages
Attribute Oriented Induction
100% (1)
Attribute Oriented Induction
6 pages
SPA Group 20
No ratings yet
SPA Group 20
16 pages
Bigdata Unit II
No ratings yet
Bigdata Unit II
19 pages
Dwbi Unit 4 & 5
No ratings yet
Dwbi Unit 4 & 5
26 pages
DWDM Unit-2 PDF
No ratings yet
DWDM Unit-2 PDF
149 pages
Clipping CG
No ratings yet
Clipping CG
9 pages
Databricksmcqsquestionsandanswers
No ratings yet
Databricksmcqsquestionsandanswers
5 pages
Evaluation Metrics in Machine Learning
No ratings yet
Evaluation Metrics in Machine Learning
14 pages
Python Lab Manual
No ratings yet
Python Lab Manual
50 pages
Case Study: Flight Data Analysis Using Spark Graphx
No ratings yet
Case Study: Flight Data Analysis Using Spark Graphx
23 pages
DCCN
No ratings yet
DCCN
48 pages
Data Science Module1
No ratings yet
Data Science Module1
20 pages
Unit-1 Basics of Algorithms and Mathematics
No ratings yet
Unit-1 Basics of Algorithms and Mathematics
47 pages
Building The Analysis Mode1 - 1
No ratings yet
Building The Analysis Mode1 - 1
6 pages
Bda Unit 4
No ratings yet
Bda Unit 4
20 pages
Unit-4
No ratings yet
Unit-4
19 pages
CBSE Sample Papers For Class 12 Computer Science Set 6 With Solutions
No ratings yet
CBSE Sample Papers For Class 12 Computer Science Set 6 With Solutions
6 pages
A. Multiple Choice Questions
No ratings yet
A. Multiple Choice Questions
9 pages
UNIT 1 21 Regulation
No ratings yet
UNIT 1 21 Regulation
81 pages
Shell Programming
No ratings yet
Shell Programming
28 pages
Full Download Agile The Good the Hype and the Ugly 1st Edition Bertrand Meyer (Auth.) PDF DOCX
100% (3)
Full Download Agile The Good the Hype and the Ugly 1st Edition Bertrand Meyer (Auth.) PDF DOCX
37 pages
Competitive Programming: LIVE Session
No ratings yet
Competitive Programming: LIVE Session
12 pages
Motion Control Application Programming Interface: MCAPI Reference Manual
No ratings yet
Motion Control Application Programming Interface: MCAPI Reference Manual
288 pages
12 CS Pa2 06102021 1
No ratings yet
12 CS Pa2 06102021 1
12 pages
Online Price Comparing System
No ratings yet
Online Price Comparing System
16 pages
Ip Project Batch 8 Report - M Venkata Sai Sri Harsha, Cse18 Vel Tech, Chennai
No ratings yet
Ip Project Batch 8 Report - M Venkata Sai Sri Harsha, Cse18 Vel Tech, Chennai
35 pages
CS442 Software Project Management: Dr. Riem Hamdi
No ratings yet
CS442 Software Project Management: Dr. Riem Hamdi
28 pages
Section 9 Quiz
0% (1)
Section 9 Quiz
6 pages
AQtime 7 User Manual
No ratings yet
AQtime 7 User Manual
786 pages
Rjava.R: .Jcall Doent Works, As Stacktrace Below: Error in .Jcall (Obj Obj, Returnsig "V", Method "Gen", "A", "B")
No ratings yet
Rjava.R: .Jcall Doent Works, As Stacktrace Below: Error in .Jcall (Obj Obj, Returnsig "V", Method "Gen", "A", "B")
9 pages
Making Sense of The Agile Methodology Wars
No ratings yet
Making Sense of The Agile Methodology Wars
12 pages
OIC PAAS Developer Dumps
No ratings yet
OIC PAAS Developer Dumps
28 pages
RheaAdhikariResume
No ratings yet
RheaAdhikariResume
1 page
Software Project Management Plan For "Online Purchasing System"
No ratings yet
Software Project Management Plan For "Online Purchasing System"
8 pages
Natasha Project Report Cybergyan Virtual Internship
No ratings yet
Natasha Project Report Cybergyan Virtual Internship
26 pages
Instant Download (Ebook) Design Patterns in .NET: Reusable Approaches in C# and F# for Object-Oriented Software Design by Dmitri Nesteruk ISBN 9781484243657, 9781484243664, 148424365X, 1484243668 PDF All Chapters
100% (5)
Instant Download (Ebook) Design Patterns in .NET: Reusable Approaches in C# and F# for Object-Oriented Software Design by Dmitri Nesteruk ISBN 9781484243657, 9781484243664, 148424365X, 1484243668 PDF All Chapters
81 pages
Forms Personalization
100% (1)
Forms Personalization
6 pages
Dynamic Macros 1
No ratings yet
Dynamic Macros 1
16 pages
Character Fu Ctio S: Char, Varchar2, Varchar Etc
No ratings yet
Character Fu Ctio S: Char, Varchar2, Varchar Etc
13 pages
VxRail - Sas3ircu Runtime Error Caused Disks Shown As - Lost - in VxRail Manager Physical View On Quanta Appliance - Dell Singapore
No ratings yet
VxRail - Sas3ircu Runtime Error Caused Disks Shown As - Lost - in VxRail Manager Physical View On Quanta Appliance - Dell Singapore
3 pages
I puc Computer science 2023-2024
No ratings yet
I puc Computer science 2023-2024
4 pages
Cosine Series
No ratings yet
Cosine Series
5 pages
ReleaseNotes710 810v5.6.2-p7
No ratings yet
ReleaseNotes710 810v5.6.2-p7
9 pages
Ak Dbms Lab File Cse352
No ratings yet
Ak Dbms Lab File Cse352
20 pages
03 Functional Specification CMF
No ratings yet
03 Functional Specification CMF
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Shuffle and Sort

Uploaded by

Shuffle and Sort

Uploaded by

Shuffle and Sort

The Map Side

Figure 6-6. Shuffle and sort in MapReduce

The Reduce Side

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.