0% found this document useful (0 votes)

37 views

CAS CS 460/660 Introduction To Database Systems Query Optimization

The document discusses query optimization in database systems. It provides examples of alternative query execution plans for a sample query joining the Reserves and Sailors tables. Earlier selections and joins can be "pushed down" in the query tree to reduce the number of I/Os compared to a naive nested loop plan. The goal is to find an efficient plan that computes the same results while minimizing estimated execution costs like I/O.

Uploaded by

Arnaldo Canelas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

37 views

CAS CS 460/660 Introduction To Database Systems Query Optimization

Uploaded by

Arnaldo Canelas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 20

CAS CS 460/660

Introduction to Database Systems

Query Optimization

1.1
Review
 Implementation of Relational Operations as Iterators
 Focus largely on External algorithms (sorting/hashing)
 Choices depend on indexes, memory, stats,…
 Joins
 Blocked nested loops:
 simple, exploits extra memory
 Indexed nested loops:
 best if 1 rel small and one indexed
 Sort/Merge Join
 good with small amount of memory, bad with duplicates
 Hash Join
 fast (enough memory), bad with skewed data
 Relatively easy to parallelize
 Sort and Hash-Based Aggs and DupElim

1.2
Query Optimization Overview
 Query can be converted to relational algebra
 Rel. Algebra converted to tree, joins as branches
 Each operator has implementation choices
 Operators can also be applied in different order!

SELECT S.sname sname

FROM Reserves R, Sailors S
WHERE R.sid=S.sid AND
R.bid=100 AND S.rating>5 bid=100 rating > 5

sid=sid
sname((bid=100  rating > 5) (Reserves  Sailors))

Reserves Sailors
1.3
Iterator Interface (pull from the top)
 Recall:
•Relational operators at nodes support uniform
iterator interface:
sname
Open( ), get_next( ), close( )
bid=100 rating > 5 •Unary Ops – On Open() call Open() on child.
•Binary Ops – call Open() on left child then
on right.
sid=sid
•By convention, outer is on left.

Reserves Sailors

Alternative is pipelining (i.e. a “push”-based approach).

Can combine push & pull using special operators.

1.4
Query Optimization Overview (cont)

 Logical Plan: Tree of R.A. ops

 Physical Plan: Tree of R.A. ops, with choice of algorithm for each
operator.

 Two main issues:

 For a given query, what plans are considered?
 Algorithm to search plan space for cheapest (estimated) plan.
 How is the cost of a plan estimated?

 Ideally: Want to find best plan.

 Reality: Avoid worst plans!

1.5
Cost-based Query Sub-System
Select *
Queries From Blah B
Where B.blah = blah
Usually there is a
heuristics-based
rewriting step before
the cost-based steps.
Query Parser

Query Optimizer

Plan Generator Plan Cost Estimator Catalog Manager

Schema Statistics
Query Plan Evaluator

1.6
Schema for Examples
Sailors (sid: integer, sname: string, rating: integer, age: real)

Reserves (sid: integer, bid: integer, day: dates, rname: string)

 As seen in previous lectures…

 Reserves:
 Each tuple is 40 bytes long, 100 tuples per page, 1000 pages.
 Let’s say there are 100 boats.
 Sailors:
 Each tuple is 50 bytes long, 80 tuples per page, 500 pages.
 Let’s say there are 10 different ratings.
 Assume we have 5 pages in our buffer pool.

1.7
Motivating Example

SELECT S.sname
FROM Reserves R, Sailors S
WHERE R.sid=S.sid AND
R.bid=100 AND S.rating>5

 Cost: 500+500*1000 I/Os Plan: (On-the-fly)

sname
 By no means the worst plan!
 Misses several opportunities: selections
could have been `pushed’ earlier, no use bid=100 rating > 5 (On-the-fly)
is made of any available indexes, etc.
 Goal of optimization: To find more
(Page-Oriented
efficient plans that compute the same
sid=sid Nested loops)
answer.

Sailors Reserves

1.8
Alternative Plans – Push Selects
(No Indexes)
(On-the-fly)
sname

(On-the-fly)
sname
bid=100 (On-the-fly)

bid=100 rating > 5 (On-the-fly)

(Page-Oriented
sid=sid Nested loops)
(Page-Oriented
sid=sid Nested loops)
rating > 5
(On-the-fly)
Reserves
Sailors Reserves
Sailors

500,500 IOs 250,500 IOs

1.9
Alternative Plans – Push Selects
(No Indexes)

sname (On-the-fly)
(On-the-fly)
sname

bid=100 (On-the-fly)
(Page-Oriented
sid=sid Nested loops)

(Page-Oriented
sid=sid Nested loops) rating > 5 bid = 100
(On-the-fly)
(On-the-fly)
rating > 5
(On-the-fly) Reserves
Sailors Reserves

Sailors
250,500 IOs
250,500 IOs
1.10
Alternative Plans – Push Selects
(No Indexes)

(On-the-fly) (On-the-fly)
sname sname

rating > 5 (On-the-fly)

bid=100 (On-the-fly)

(Page-Oriented (Page-Oriented
sid=sid Nested loops) sid=sid Nested loops)

rating > 5 bid=100 Sailors

(On-the-fly) Reserves (On-the-fly)

Sailors Reserves

6000 IOs
250,500 IOs
1.11
Alternative Plans – Push Selects
(No Indexes)

(On-the-fly)
sname

(On-the-fly)
sname
rating > 5 (On-the-fly)

(Page-Oriented
sid=sid Nested loops)
(Page-Oriented
sid=sid Nested loops) (Scan &
bid=100 rating > 5 Write to
(On-the-fly) temp T2)
bid=100 Sailors
(On-the-fly)

Reserves Sailors
Reserves
4250 IOs
6000 IOs 1000 + 500+ 250 + (10 * 250)

1.12
Alternative Plans – Push Selects
(No Indexes)

(On-the-fly) (On-the-fly)
sname sname

(Page-Oriented (Page-Oriented
sid=sid Nested loops) sid=sid Nested loops)

(Scan & (Scan &

bid=100 rating > 5 Write to rating>5 bid=100 Write to
(On-the-fly) temp T2) (On-the-fly) temp T2)

Reserves Sailors Sailors Reserves

4250 IOs 4010 IOs

500 + 1000 +10 +(250 *10)

1.13
Alternative Plans 1 sname
(On-the-fly)

(No Indexes)
(Sort-Merge Join)
sid=sid
 Main difference: Sort
(Scan; (Scan;
Merge Join write to
temp T1)
bid=100 rating > 5 write to
temp T2)
 With 5 buffers, cost of plan:
Reserves Sailors
 Scan Reserves (1000) + write temp T1 (10 pages, if we have 100
boats, uniform distribution).
 Scan Sailors (500) + write temp T2 (250 pages, if have 10 ratings).
 Sort T1 (2*2*10), sort T2 (2*4*250), merge (10+250)
 Total: 4060 page I/Os. (note: T2 sort takes 4 passes with B=5)
 If use BNL join, join = 10+4*250, total cost = 2770.
 Can also `push’ projections, but must be careful!
 T1 has only sid, T2 only sid, sname:
 T1 fits in 3 pgs, cost of BNL under 250 pgs, total < 2000.

1.14
(On-the-fly)
Alt Plan 2: Indexes sname

(On-the-fly)
rating > 5

 With clustered hash index on bid of (Index Nested Loops,

Reserves, we get 100,000/100 = sid=sid with pipelining )
1000 tuples on 1000/100 = 10 pages. (Use hash
Index, do
 INL with outer not materialized. bid=100 Sailors
not write
to temp)
– Projecting out unnecessary fields Reserves
from outer doesn’t help.
 Join column sid is a key for Sailors.
At most one matching tuple, unclustered index on sid OK.
 Decision not to push rating>5 before the join is based on
availability of sid index on Sailors.
 Cost: Selection of Reserves tuples (10 I/Os); then, for each,
must get matching Sailors tuple (1000*1.2); total 1210 I/Os.

1.15
What is needed for optimization?

 Iterator Interface
 Cost Estimation
 Statistics and Catalogs
 Size Estimation and Reduction Factors

1.16
Query Blocks: Units of Optimization

SELECT S.sname
FROM Sailors S
WHERE S.age IN
(SELECT MAX (S2.age)
Outer block FROM Sailors S2
GROUP BY S2.rating)
Nested block

 An SQL query is parsed into a collection of query blocks, and these are
optimized one block at a time.

 Inner blocks are usually treated as subroutines

 Computed:
 once per query (for uncorrelated sub-queries)
 or once per outer tuple (for correlated sub-queries)

1.17
Translating SQL to Relational Algebra
SELECT S.sid, MIN (R.day)
FROM Sailors S, Reserves R, Boats B
WHERE S.sid = R.sid AND R.bid = B.bid AND B.color = “red”
AND S.rating = ( SELECT MAX (S2.rating) FROM Sailors S2)
GROUP BY S.sid
HAVING COUNT (*) >= 2

For each sailor with the highest rating (over all sailors), and at least two
reservations for red boats, find the sailor id and the earliest date on which the
sailor has a reservation for a red boat.

1.18
Translating SQL to Relational Algebra

SELECT S.sid, MIN (R.day)

FROM Sailors S, Reserves R, Boats B
WHERE S.sid = R.sid AND R.bid = B.bid AND B.color = “red”
AND S.rating = ( SELECT MAX (S2.rating) FROM Sailors S2)
GROUP BY S.sid
HAVING COUNT (*) >= 2
Inner Block
 S.sid, MIN(R.day)
(HAVING COUNT(*)>2 (
GROUP BY S.Sid (

B.color = “red” S.rating = ( Boats))))
val
Sailors Reserves

1.19
Relational Algebra Equivalences
 Allow us to choose different operator orders and to `push’ selections and
projections ahead of joins.
 Selections:
(Cascade)
 c1 ... cn  R   c1  . . .  cn  R
 c1  c2 R   c2  c1 R (Commute)

 Projections: 
a1 
R 
a1 
...  R
an  (Cascade)
(
i
 Joins: R (S
f
a T) (R S) T (Associative)
 n

(R S) i (S R) (Commute)
n
These two mean
c we can do joins in any order.
l
u 1.20

Preachers Outline and Sermon Bible Classic Niv
0% (1)
Preachers Outline and Sermon Bible Classic Niv
17 pages
Crompton Greaves Domestic Pumps Price List
100% (1)
Crompton Greaves Domestic Pumps Price List
12 pages
Relational Query Optimization: CS186 R & G Chapters 12/15
No ratings yet
Relational Query Optimization: CS186 R & G Chapters 12/15
51 pages
Query Optimization: Imperative Query Execution Plan: Declarative SQL Query
No ratings yet
Query Optimization: Imperative Query Execution Plan: Declarative SQL Query
16 pages
optimization
No ratings yet
optimization
17 pages
Query-Optimization
No ratings yet
Query-Optimization
51 pages
SOEN 363 - Data Systems For Software Engineers: Query Optimization
No ratings yet
SOEN 363 - Data Systems For Software Engineers: Query Optimization
25 pages
13 Query Plan Space
No ratings yet
13 Query Plan Space
71 pages
SQL: The Query Language: R &G - Chapter 5
No ratings yet
SQL: The Query Language: R &G - Chapter 5
25 pages
08 Dist DB - Query Optimizer New
No ratings yet
08 Dist DB - Query Optimizer New
19 pages
Review: SQL: The Query Language
No ratings yet
Review: SQL: The Query Language
5 pages
Lecture05 14f
No ratings yet
Lecture05 14f
55 pages
Overview of Query Evaluation: R&G Chapter 12
No ratings yet
Overview of Query Evaluation: R&G Chapter 12
30 pages
Module-3 SQL.
No ratings yet
Module-3 SQL.
52 pages
SQL: Queries, Programming, Triggers: Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1
No ratings yet
SQL: Queries, Programming, Triggers: Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1
32 pages
13 QP1
No ratings yet
13 QP1
33 pages
Relational Algebra Optimization
No ratings yet
Relational Algebra Optimization
24 pages
SQL: The Query Language: R &G - Chapter 5
No ratings yet
SQL: The Query Language: R &G - Chapter 5
25 pages
Relational Query Optimization: Plan: Tree of R.A. Ops, With Choice of Alg For Each Op
No ratings yet
Relational Query Optimization: Plan: Tree of R.A. Ops, With Choice of Alg For Each Op
7 pages
SQL: Queries, Constraints, Triggers: Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1
No ratings yet
SQL: Queries, Constraints, Triggers: Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke 1
35 pages
SQL
No ratings yet
SQL
41 pages
Slides 12
No ratings yet
Slides 12
24 pages
Query Optimization in Relational Database Systems
No ratings yet
Query Optimization in Relational Database Systems
77 pages
05_optimization (2)
No ratings yet
05_optimization (2)
58 pages
QEII
No ratings yet
QEII
44 pages
SQL: The Query Language: CS 186, Spring 2006, Lectures 11&12 R &G - Chapter 5
No ratings yet
SQL: The Query Language: CS 186, Spring 2006, Lectures 11&12 R &G - Chapter 5
58 pages
SQL: Queries, Programming, Triggers: CSC343 - Introduction To Databases - A. Vaisman 1
No ratings yet
SQL: Queries, Programming, Triggers: CSC343 - Introduction To Databases - A. Vaisman 1
32 pages
BCS Topic
No ratings yet
BCS Topic
66 pages
Advance Database Management System: Unit - 2 .Query Processing and Optimization
No ratings yet
Advance Database Management System: Unit - 2 .Query Processing and Optimization
38 pages
SQL: Queries, Constraints, Triggers, Null: February 18, 2014
No ratings yet
SQL: Queries, Constraints, Triggers, Null: February 18, 2014
67 pages
L10-Query Evaluaion
No ratings yet
L10-Query Evaluaion
50 pages
SQL: The Query Language: CS 186, Spring 2006, Lectures 11&12 R &G - Chapter 5
No ratings yet
SQL: The Query Language: CS 186, Spring 2006, Lectures 11&12 R &G - Chapter 5
58 pages
SQL 2019 PDF
No ratings yet
SQL 2019 PDF
76 pages
Session - 10 Querying
No ratings yet
Session - 10 Querying
36 pages
Implementation of Different Types of Joins
No ratings yet
Implementation of Different Types of Joins
9 pages
Relational Query Optimization: Warih Maharani, ST.,MT
No ratings yet
Relational Query Optimization: Warih Maharani, ST.,MT
39 pages
QueryProcess Optim
No ratings yet
QueryProcess Optim
60 pages
SQL: Queries, Programming, Triggers: Database Management Systems, R. Ramakrishnan and J. Gehrke 1
No ratings yet
SQL: Queries, Programming, Triggers: Database Management Systems, R. Ramakrishnan and J. Gehrke 1
65 pages
04 SQLQueries
No ratings yet
04 SQLQueries
18 pages
DBMS Lab Exp
No ratings yet
DBMS Lab Exp
18 pages
Dbms Manual Solution
No ratings yet
Dbms Manual Solution
73 pages
Tutorial 5
No ratings yet
Tutorial 5
8 pages
Evaluation of Relational Operations: Chapter 14, Part A (Joins)
No ratings yet
Evaluation of Relational Operations: Chapter 14, Part A (Joins)
6 pages
Query Processing + Optimization: Outline: Operator Evaluation Strategies
No ratings yet
Query Processing + Optimization: Outline: Operator Evaluation Strategies
53 pages
SQL Slides Updated
No ratings yet
SQL Slides Updated
118 pages
L4 SQL
No ratings yet
L4 SQL
68 pages
Query Processing and Optimization
No ratings yet
Query Processing and Optimization
45 pages
L16 Relational Model and Structured Query Language
No ratings yet
L16 Relational Model and Structured Query Language
11 pages
Adbms Unit 2
No ratings yet
Adbms Unit 2
137 pages
boats
No ratings yet
boats
8 pages
Advanced Database Systems Lecture Notes
No ratings yet
Advanced Database Systems Lecture Notes
79 pages
DBMS UNIT 4 Part 1
No ratings yet
DBMS UNIT 4 Part 1
15 pages
Query-Processing
No ratings yet
Query-Processing
77 pages
SQL: The Query Language (Part II)
No ratings yet
SQL: The Query Language (Part II)
10 pages
7-Query Processing
No ratings yet
7-Query Processing
47 pages
DBMS Unit 5 HWN
No ratings yet
DBMS Unit 5 HWN
55 pages
.NET Generics 4.0 Beginner’s Guide
From Everand
.NET Generics 4.0 Beginner’s Guide
Sudipta Mukherjee
No ratings yet
Hadoop 2.x Administration Cookbook
From Everand
Hadoop 2.x Administration Cookbook
Gurmukh Singh
No ratings yet
NumPy Beginner's Guide
From Everand
NumPy Beginner's Guide
Ivan Idris
5/5 (3)
NumPy: Beginner's Guide - Third Edition
From Everand
NumPy: Beginner's Guide - Third Edition
Ivan Idris
3.5/5 (3)
Ceph Cookbook: Over 100 effective recipes to help you design, implement, and manage the software-defined and massively scalable Ceph storage system
From Everand
Ceph Cookbook: Over 100 effective recipes to help you design, implement, and manage the software-defined and massively scalable Ceph storage system
Karan Singh
4/5 (1)
Sage Beginner's Guide
From Everand
Sage Beginner's Guide
Craig Finch
4/5 (1)
Mscit 203 SLM
No ratings yet
Mscit 203 SLM
234 pages
Vig 3
No ratings yet
Vig 3
297 pages
CAS CS 460/660 Introduction To Database Systems Transactions and Concurrency Control
No ratings yet
CAS CS 460/660 Introduction To Database Systems Transactions and Concurrency Control
62 pages
No SQL
No ratings yet
No SQL
56 pages
DIsk BFR
No ratings yet
DIsk BFR
26 pages
CAS CS 460/660 Introduction To Database Systems Functional Dependencies and Normal Forms
No ratings yet
CAS CS 460/660 Introduction To Database Systems Functional Dependencies and Normal Forms
38 pages
Files
No ratings yet
Files
26 pages
Hashing
No ratings yet
Hashing
15 pages
F Secure Security Comes As Standard Whitepaper
No ratings yet
F Secure Security Comes As Standard Whitepaper
24 pages
SQLIII
No ratings yet
SQLIII
64 pages
Dica Reparação Tablet ASUS
No ratings yet
Dica Reparação Tablet ASUS
1 page
DIY Laser Burner Tutorial
67% (3)
DIY Laser Burner Tutorial
24 pages
General Biology 1 Q1 Week 7
No ratings yet
General Biology 1 Q1 Week 7
14 pages
Telephone: 342-1014/ Telefax: 342-1378: Marinduque Midwest College
No ratings yet
Telephone: 342-1014/ Telefax: 342-1378: Marinduque Midwest College
5 pages
Batteries Handling Disposal Procedure
No ratings yet
Batteries Handling Disposal Procedure
17 pages
1 Lista 5ce - Co Canais BR
No ratings yet
1 Lista 5ce - Co Canais BR
53 pages
Theo Exp Probability Se
No ratings yet
Theo Exp Probability Se
7 pages
OBJECTIVES
No ratings yet
OBJECTIVES
4 pages
JPG 2 PDF
No ratings yet
JPG 2 PDF
24 pages
SENG 691 Slides 1 Intro To Cloud Computing and IoT
No ratings yet
SENG 691 Slides 1 Intro To Cloud Computing and IoT
74 pages
Change Management
100% (1)
Change Management
5 pages
E-Magazine February 2017 Issue
No ratings yet
E-Magazine February 2017 Issue
60 pages
Innovation and Technology Transfer For B
No ratings yet
Innovation and Technology Transfer For B
6 pages
Association of Autonomous Astronauts Zine
100% (1)
Association of Autonomous Astronauts Zine
44 pages
A-build Training August 2024
No ratings yet
A-build Training August 2024
27 pages
Inside Listening and Speaking 3 Unit 1 Assessment Name: - Date: - Part 1 Vocabulary
No ratings yet
Inside Listening and Speaking 3 Unit 1 Assessment Name: - Date: - Part 1 Vocabulary
5 pages
Figure 1.1 Failure Curve For Hardware Figure 1.2 Failure Curves For Software
No ratings yet
Figure 1.1 Failure Curve For Hardware Figure 1.2 Failure Curves For Software
5 pages
Kandarp Data
100% (1)
Kandarp Data
139 pages
Assignment 7
No ratings yet
Assignment 7
2 pages
Journal of Chemistry - 2024 - Kodua - Optimization of the Reaction Conditions in Biodiesel Production The Case of Baobab
No ratings yet
Journal of Chemistry - 2024 - Kodua - Optimization of the Reaction Conditions in Biodiesel Production The Case of Baobab
14 pages
Ensayo Sobre El Cometa Corredor
100% (2)
Ensayo Sobre El Cometa Corredor
5 pages
Electronic Reservation Slip (ERS) : 4134895480 18046/east Coast Exp Sleeper Class (SL)
No ratings yet
Electronic Reservation Slip (ERS) : 4134895480 18046/east Coast Exp Sleeper Class (SL)
2 pages
IFFA6312 - Activity 3 Touch Point
No ratings yet
IFFA6312 - Activity 3 Touch Point
12 pages
Classification of Player Roles in The Team-Based Multi-Player Game Dota 2
No ratings yet
Classification of Player Roles in The Team-Based Multi-Player Game Dota 2
14 pages
Antiques
No ratings yet
Antiques
3 pages
5-4-2024 Ashani_Mahanama_CV
No ratings yet
5-4-2024 Ashani_Mahanama_CV
2 pages
Grade 5 - Term 1 Geography Booklet 2
No ratings yet
Grade 5 - Term 1 Geography Booklet 2
9 pages
Qa It
No ratings yet
Qa It
277 pages
10th English Maths 2
No ratings yet
10th English Maths 2
192 pages
Albert Einstein Alugbati Basella Alba Stem Extract As An Alternative Ink For Permanent Markers
No ratings yet
Albert Einstein Alugbati Basella Alba Stem Extract As An Alternative Ink For Permanent Markers
20 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

CAS CS 460/660 Introduction To Database Systems Query Optimization

Uploaded by

CAS CS 460/660 Introduction To Database Systems Query Optimization

Uploaded by

CAS CS 460/660

Introduction to Database Systems

SELECT S.sname sname

Alternative is pipelining (i.e. a “push”-based approach).

Can combine push & pull using special operators.

 Logical Plan: Tree of R.A. ops

 Two main issues:

 Ideally: Want to find best plan.

 Reality: Avoid worst plans!

Plan Generator Plan Cost Estimator Catalog Manager

Reserves (sid: integer, bid: integer, day: dates, rname: string)

 As seen in previous lectures…

 Cost: 500+500*1000 I/Os Plan: (On-the-fly)

bid=100 rating > 5 (On-the-fly)

500,500 IOs 250,500 IOs

rating > 5 (On-the-fly)

rating > 5 bid=100 Sailors

(Scan & (Scan &

Reserves Sailors Sailors Reserves

4250 IOs 4010 IOs

 With clustered hash index on bid of (Index Nested Loops,

 Inner blocks are usually treated as subroutines

SELECT S.sid, MIN (R.day)

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.