0% found this document useful (0 votes)

10 views15 pages

Lecture 9

gradient search

Uploaded by

Mehul Agarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views15 pages

Lecture 9

gradient search

Uploaded by

Mehul Agarwal

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Quasi-Newton methods

• Idea: Instead of using true Hessian 𝛻 " 𝑓(𝑥& ) in Newton direction, use an
approximation 𝐵& (symmetric positive definite).

-
1 -
𝑓 𝒙& + 𝒑 ≈ 𝑓 𝒙& + 𝛻𝑓 𝒙& 𝒑 + 𝒑 𝑩& 𝒑
2
• 𝐵& is constructed at each step using 𝑩&12 and gradient information
𝛻𝑓 𝒙& and 𝛻𝑓 𝒙&12
• It is common to develop iteration formula directly in the form of 𝑩12
& = 𝑨&

𝑨&52 = 𝑨& + 𝑨6&

• Direction:
𝑝& = −𝐴& 𝛻𝑓(𝑥& )
• Eventually 𝑩& converges to 𝛻 " 𝑓(𝒙& )
• Quasi-Newton methods give superlinear convergence.

CL 603 Optimization 29
Quasi-Newton methods
• DFP (Davidon Fletcher Powell) method1,2

Δ𝒙& Δ𝒙& - 𝑨& Δ𝒈& Δ𝒈& - 𝑨&

𝑨&52 = 𝑨& + - −
Δ𝒙& Δ𝒈& Δ𝒈& - 𝑨& Δ𝒈&
• BFGS (Broyden Fletcher Goldfarb Shanno) method3-6

-
Δ𝒙& Δ𝒙& - Δ𝒙& Δ𝒈& - Δ𝒙& Δ𝒈& -
𝑨&52 = -
+ 𝑰− -
𝑨& 𝑰 −
Δ𝒙& Δ𝒈& Δ𝒙& Δ𝒈& Δ𝒙& - Δ𝒈&

with 𝑨= = 𝑰, Δ𝒙& = 𝒙&52 − 𝒙& , Δ𝒈& = 𝛻𝑓 𝒙&52 − 𝛻𝑓 𝒙&

1. Davidon WC, AEC Res. Develop. Rep. ANL-599, 1959
2. Fletcher R and Powell MJD, Computer J, 163, 1963
3. Broyden GG, J Inst. Math. Appl., 76-90, 1970
4. Fletcher R, Computer J, 317-322, 1970
5. Goldfarb D, Math. Prog., 94-110, 1977
6. Shanno DF, Math. Comput., 647-657, 1970
CL 603 Optimization 30
Quasi Newton methods
• Both the updating formulas preserve symmetry and positive
definiteness.
• They follow descent property
• They require storage for 𝑁×N matrix 𝑨𝒌
• The methods are robust and work on a wide variety of
practical problems.
• DFP method suffers from practical difficulty of 𝐴&52 becoming
ill-conditioned. Work-around is to restart with 𝑨& = 𝑰
• BFGS requires fewer restarts and is less dependent on exact
line searches.

CL 603 Optimization 31
Application to motivating example
DFP method After 3 iterations: 𝒙 ⋆ = [𝟏𝟒. 𝟏𝟏; 𝟖. 𝟓𝟖]
1. Initial guess 𝑥= = [4; 4] 𝒇⋆ = −𝟕. 𝟑𝟔
2. 𝑠2 = −𝛻𝑓 𝑥= = [0.65; 0.39]
Analytical solution: 𝒙 ⋆ = [𝟏𝟒. 𝟏𝟒; 𝟖. 𝟓𝟖]
3. 𝛼2 = 13.16
𝒇⋆ = −𝟕. 𝟑𝟔
4. 𝑥2 = [12.58; 9.14]

10.12 5.31 12 1 2 -1
-3.1619 0-2 -1
-3.1619
-5.5 0-2
5. 𝐴2 =

-5
-3.161
5.31 4.04

.5
-1 90-2
.5
0

-5
6. 𝑠" = [0.09; −0.10] 10

-2
-1
1
-7.
7. 𝛼" = 7.00
34

-7.2
-3.16
76

2
.2

76
8
-7

8. 𝑥" = [13.20; 8.43]

-5.5
-2 -3.1619
-1
CA0
9.11 −2.14
6

-5.
9. 𝐴" =

0
−2.14 4.37

-2
-1
1
-5.5 -5.5
4
10. 𝑠Q = [0.27; 0.05]

-3
.1
61
9
11. 𝛼Q = 3.35 2
-3.1619 -3.1619
-2
-2

-1
-2
12. 𝑥Q = [14.11; 8.58] -1
0

-1 -1

0
0 0 0 0
0 2 4 6 8 10 12 14 16 18 20
=

CL 603 Optimization 32
Application to motivating example
BFGS method After 3 iterations: 𝒙 ⋆ = [𝟏𝟒. 𝟏𝟏; 𝟖. 𝟔𝟐]
1. Initial guess 𝑥= = [4; 4] 𝒇⋆ = −𝟕. 𝟑𝟔
2. 𝑠2 = −𝛻𝑓 𝑥= = [0.65; 0.39]
Analytical solution: 𝒙 ⋆ = [𝟏𝟒. 𝟏𝟒; 𝟖. 𝟓𝟖]
3. 𝛼2 = 13.16
𝒇⋆ = −𝟕. 𝟑𝟔
4. 𝑥2 = [12.58; 9.14]
12 1 2 -1
-3.1619 0-2 -1
-3.1619
-5.5 0-2
10.13 5.29

-5
-3.161
5. 𝐴2 =

.5
-1 90-2
5.29 4.06

.5
0

-5
10
6. 𝑠" = [0.09; −0.105]

-2
-1
1
-7.
34

-7.2
7. 𝛼" = 6.79

-3.16
76

2
.2

76
8
-7

8. 𝑥" = [13.20; 8.43]

-5.5
-2 -3.1619
-1
CA0
6

-5.
18.62 0.07

0
9. 𝐴" =

-2
0.07 4.89
-1
1
-5.5 -5.5
4

-3
10. 𝑠Q = [0.68; 0.15]

.1
6 19
11. 𝛼Q = 1.33
-3.1619 -3.1619
2
-2
-2

-1
-2
-1
12. 𝑥Q = [14.11; 8.62]
0

-1 -1

0
0 0 0 0
0 2 4 6 8 10 12 14 16 18 20
=

CL 603 Optimization 33
Selection of step length 𝛼&
• Exact line search: Minimize 𝑓(𝑥& + 𝛼& 𝑝& ) using univariate
optimization
• Can be computationally expensive
• Inexact line search: Select 𝛼& such that it achieves
‘reasonable’ reduction in 𝑓 in the search direction 𝑝&
• Trade-offs: Substantial decrease in 𝑓 versus computational
effort involved

Need conditions on 𝜶𝒌 to select appropriate step length

CL 603 Optimization 34
Conditions on step length
• Simple condition: provide reduction 𝑓 𝑥& + 𝛼& 𝑝& < 𝑓(𝑥& )
• Armijo condition (AC): If 𝑝& is a descent direction, 𝛼& should
give sufficient decrease in 𝑓 i.e.
-𝑝
𝑓 𝑥& + 𝛼& 𝑝& ≤ 𝑓 𝑥& + 𝑐2 𝛼& 𝛻𝑓 𝑥& &

for some 𝑐2 ∈ (0,1)

• Reduction in 𝑓 should be proportional to both step size as
-𝑝
well as directional derivative 𝛻𝑓 𝑥& &

• In practice, 𝑐2 is taken as a small value (say 101b )

• AC condition is not sufficient to make reasonable progress
as it is satisfied for all sufficiently small values of 𝛼&

CL 603 Optimization 35
Armijo condition – graphical interpretation
• 𝑓 𝑥 = 𝑥2 − 2 b + 𝑥2 − 2 𝑥"" + 𝑥" + 1 ", 𝑥& = 1; 1 , 𝑐2 = 0.2
• 𝑝& = −𝛻𝑓 - 𝑥& = [6; −6]

𝜶 ≤ 𝟎. 𝟑𝟐

Any small step 𝜶 → 𝟎 satisfies this condition

CL 603 Optimization 36
Curvature condition
-𝑝 -𝑝 , 𝑐
𝛻𝑓 𝑥& + 𝛼& 𝑝& & ≥ 𝑐" 𝛻𝑓 𝑥& & " ∈ (𝑐2 , 1)
• Rules out very small steps.
• 𝑐" = 0.9 for Newton type method, 0.1 for conjugate gradient
𝒄𝟏 = 𝟎. 𝟐, 𝒄𝟐 = 𝟎. 𝟒

-𝑝
𝑐" 𝛻𝑓 𝑥& &

𝜶 ≥ 𝟎. 𝟎𝟕 𝟎. 𝟎𝟕 ≤ 𝜶 ≤ 𝟎. 𝟑𝟐
Exact line search: 𝜶⋆ = 𝟎. 𝟐𝟓
CL 603 Optimization 37
Wolfe conditions
• Armijo condition and curvature condition
-𝑝
𝑓 𝑥& + 𝛼& 𝑝& ≤ 𝑓 𝑥& + 𝑐2 𝛼& 𝛻𝑓 𝑥& &
-𝑝 -𝑝
𝛻𝑓 𝑥& + 𝛼& 𝑝& & ≥ 𝑐" 𝛻𝑓 𝑥& &

with 0 < 𝑐2 < 𝑐" < 1

• Strong Wolfe conditions: To bring 𝛼& closer to local minima

-𝑝
𝑓 𝑥& + 𝛼& 𝑝& ≤ 𝑓 𝑥& + 𝑐2 𝛼& 𝛻𝑓 𝑥& &
-𝑝 -𝑝
𝛻𝑓 𝑥& + 𝛼& 𝑝& & ≤ 𝑐" 𝛻𝑓 𝑥& &

with 0 < 𝑐2 < 𝑐" < 1

Small 𝒄𝟏 and large 𝒄𝟐 lead to large bound on 𝜶

CL 603 Optimization 38
Strong Wolfe conditions

-𝑝
𝑐" 𝛻𝑓 𝑥& &

𝟎. 𝟎𝟕 ≤ 𝜶 ≤ 𝟎. 𝟑𝟏 𝟎. 𝟎𝟕 ≤ 𝜶 ≤ 𝟎. 𝟑𝟏

Eliminated region 0.31 ≤ 𝛼 ≤ 0.32 with large positive

𝛻𝑓 𝑥& + 𝛼& 𝑝& - 𝑝&

CL 603 Optimization 39
Why 𝑐2 < 𝑐"
𝒄𝟏 = 𝟎. 𝟓, 𝒄𝟐 = 𝟎. 𝟐

𝜶 ≤ 𝟎. 𝟏𝟐 𝜶 ≥ 𝟎. 𝟏𝟒

No feasible 𝜶

CL 603 Optimization 40
Backtrack algorithm
• Based on Armijo condition
1. Choose 𝛼g > 0, 𝜌, 𝑐2 ∈ (0,1)
2. Set 𝛼 = 𝛼g
-𝑝
3. Is 𝑓 𝑥& + 𝛼& 𝑝& ≤ 𝑓 𝑥& + 𝑐2 𝛼& 𝛻𝑓 𝑥& & satisfied?
a. Yes: 𝛼& = 𝛼
b. No: 𝛼 = 𝜌𝛼, go back to step 3

CL 603 Optimization 41
How to select 𝛼g
• For Newton type methods, 𝛼g = 1
• Choice 1: Assume that first order change in function across
two iterates is same
𝛼& 𝛻- 𝑓 𝑥& 𝑝& = 𝛼&12 𝛻- 𝑓 𝑥&12 𝑝&12

jklm n o p qklm rklm

𝛼g =
n o p qk rk

• Choice 2: Fit a quadratic using 𝑓 𝑥&12 , 𝑓(𝑥& ) and

𝛻- 𝑓 𝑥& 𝑝& and define 𝛼g as the minimizer for this quadratic

2 𝑓 𝑥& − 𝑓 𝑥&12
𝛼g =
𝛻- 𝑓 𝑥& 𝑝&

CL 603 Optimization 42
Newton with inexact line search
Newton’s method with inexact After 4 iterations: 𝒙 ⋆ = [𝟏𝟒. 𝟏𝟑; 𝟖. 𝟓𝟖]
line search 𝒇⋆ = −𝟕. 𝟑𝟔
1. Initial guess 𝑥= = [4; 4] Analytical solution: 𝒙 ⋆ = [𝟏𝟒. 𝟏𝟒; 𝟖. 𝟓𝟖]
2. 𝛼= = 0.25 𝒇⋆ = −𝟕. 𝟑𝟔
3. 𝑥2 = [9.64; 9.09]
4. 𝛼2 = 1.00
5. 𝑥" = [12.39; 8.71]
6. 𝛼" = 1.00
7. 𝑥Q = [13.83; 8.61]
8. 𝛼Q = 1.00
9. 𝑥b = [14.13; 8.58]

CL 603 Optimization 43

A Stable AI-Based Binary and Multiple Class Heart Disease Prediction Model For IoMT
No ratings yet
A Stable AI-Based Binary and Multiple Class Heart Disease Prediction Model For IoMT
9 pages
Mathematical Methods of Optimization
No ratings yet
Mathematical Methods of Optimization
62 pages
Optim
No ratings yet
Optim
70 pages
Numerical Optimization For Inverse Problems - 10 Lectures On Inverse Problems and Imaging
No ratings yet
Numerical Optimization For Inverse Problems - 10 Lectures On Inverse Problems and Imaging
15 pages
3) CoCubes - Capgemini Faceprep - Pseudocode
No ratings yet
3) CoCubes - Capgemini Faceprep - Pseudocode
199 pages
ML Unsupervised Notes-New
100% (1)
ML Unsupervised Notes-New
13 pages
Matlab Optimization Toolbox Documentation instant download
No ratings yet
Matlab Optimization Toolbox Documentation instant download
83 pages
Lecture 12
No ratings yet
Lecture 12
16 pages
CSC 3110 - Study Guide
No ratings yet
CSC 3110 - Study Guide
2 pages
8.3.2 Prim's Algorithm PDF
No ratings yet
8.3.2 Prim's Algorithm PDF
3 pages
A Mathematical Property (Intermediate Value Theorem) - : Bisection Method Disadvantages (Drawbacks)
No ratings yet
A Mathematical Property (Intermediate Value Theorem) - : Bisection Method Disadvantages (Drawbacks)
6 pages
EEE160 LAB4B MOHAMMAD Lab
No ratings yet
EEE160 LAB4B MOHAMMAD Lab
4 pages
Or Goalprograming Last Lecture Sllbs
No ratings yet
Or Goalprograming Last Lecture Sllbs
39 pages
Coursework of Optimization 2022 2
No ratings yet
Coursework of Optimization 2022 2
2 pages
EO_Chapter 7_Multiple variable methods_2nd part
No ratings yet
EO_Chapter 7_Multiple variable methods_2nd part
33 pages
CL615 Assignment 2
No ratings yet
CL615 Assignment 2
2 pages
Ica20100100003 17780538
No ratings yet
Ica20100100003 17780538
8 pages
Lecture 14 From Sensitivities to Optimisation
No ratings yet
Lecture 14 From Sensitivities to Optimisation
20 pages
Lasso SVM
No ratings yet
Lasso SVM
6 pages
Newton Raphson Method good
No ratings yet
Newton Raphson Method good
14 pages
Den Clue
No ratings yet
Den Clue
16 pages
Lecture 6
No ratings yet
Lecture 6
11 pages
Digital Image Fundamentals
No ratings yet
Digital Image Fundamentals
4 pages
CMPSC 465 Assignment 2
No ratings yet
CMPSC 465 Assignment 2
11 pages
Optimization PPT - Part-2
No ratings yet
Optimization PPT - Part-2
42 pages
opt_ch6
No ratings yet
opt_ch6
6 pages
Optimization2
No ratings yet
Optimization2
40 pages
1991IMAJNA-11-325-332
No ratings yet
1991IMAJNA-11-325-332
9 pages
Gradient Based Optimization
No ratings yet
Gradient Based Optimization
24 pages
ECOM 6302: Engineering Optimization: Chapter Three
100% (1)
ECOM 6302: Engineering Optimization: Chapter Three
56 pages
University Institute of Engineering Department of Computer Science and Engineering
No ratings yet
University Institute of Engineering Department of Computer Science and Engineering
10 pages
Multiplierless Constant Multiplication
No ratings yet
Multiplierless Constant Multiplication
2 pages
Lecture8_UnconstrainedII_2023
No ratings yet
Lecture8_UnconstrainedII_2023
57 pages
Lecture3-PO SS2011 04.1 MultidimensionalOptimizationUnconstrained p9
No ratings yet
Lecture3-PO SS2011 04.1 MultidimensionalOptimizationUnconstrained p9
9 pages
Optimumengineeringdesign Day5
No ratings yet
Optimumengineeringdesign Day5
84 pages
Line Search Algorithms With Guaranteed Sufficient Decrease
No ratings yet
Line Search Algorithms With Guaranteed Sufficient Decrease
22 pages
t1-sol
No ratings yet
t1-sol
4 pages
Exam With Solutions PDF
0% (1)
Exam With Solutions PDF
17 pages
3 - Univariate Optimization
No ratings yet
3 - Univariate Optimization
26 pages
School of Computer Science and Applied Mathematics
No ratings yet
School of Computer Science and Applied Mathematics
5 pages
Mesh Deformation Using Radial Basis Functions
No ratings yet
Mesh Deformation Using Radial Basis Functions
13 pages
DSA Lab 2
No ratings yet
DSA Lab 2
9 pages
Inequalities: Answer: X 1.5
No ratings yet
Inequalities: Answer: X 1.5
1 page
Cheatsheet
No ratings yet
Cheatsheet
2 pages
ch4
No ratings yet
ch4
28 pages
Optimization: Lecturer: Stanley B. Gershwin
No ratings yet
Optimization: Lecturer: Stanley B. Gershwin
62 pages
2110.01858v1
No ratings yet
2110.01858v1
59 pages
exam2018
No ratings yet
exam2018
18 pages
Solved Problem
No ratings yet
Solved Problem
20 pages
BFGS
No ratings yet
BFGS
9 pages
AIT307 AI Unit2 3
No ratings yet
AIT307 AI Unit2 3
63 pages
Unconstrained Numerical Optimization An Introduction For Econometricians
100% (1)
Unconstrained Numerical Optimization An Introduction For Econometricians
32 pages
BARCODE Q&A 1 Why Doesn't My Barcode Reader Work
No ratings yet
BARCODE Q&A 1 Why Doesn't My Barcode Reader Work
13 pages
Numerical Optimization: Basic Concepts and Algorithms: R. Duvigneau
No ratings yet
Numerical Optimization: Basic Concepts and Algorithms: R. Duvigneau
34 pages
GA_ex_2
No ratings yet
GA_ex_2
21 pages
Application of Wavelets To Gearbox Vibration Signa 1996 Journal of Sound and
No ratings yet
Application of Wavelets To Gearbox Vibration Signa 1996 Journal of Sound and
13 pages
Computer Network Lab Manual 2024 5 Pro
No ratings yet
Computer Network Lab Manual 2024 5 Pro
16 pages
Optimumengineeringdesign Day3a
No ratings yet
Optimumengineeringdesign Day3a
34 pages
Preguntas del examen
No ratings yet
Preguntas del examen
8 pages
Lecture 12 - Neural Networks (DONE!!) PDF
No ratings yet
Lecture 12 - Neural Networks (DONE!!) PDF
27 pages
Chương 9
No ratings yet
Chương 9
12 pages
Unconstrained Multivariable Optimization
No ratings yet
Unconstrained Multivariable Optimization
42 pages
BSC Part 3
No ratings yet
BSC Part 3
29 pages
Applied Numerical Optimization: Prof. Alexander Mitsos, Ph.D. Basic Solution Methods For Unconstrained Problems
No ratings yet
Applied Numerical Optimization: Prof. Alexander Mitsos, Ph.D. Basic Solution Methods For Unconstrained Problems
68 pages
Lecture 2
No ratings yet
Lecture 2
19 pages
Linnear Nonlineae Numerical Method
No ratings yet
Linnear Nonlineae Numerical Method
43 pages
An Overview of Traditional Optimization Methods - Truncated
No ratings yet
An Overview of Traditional Optimization Methods - Truncated
17 pages
Lecture 5
No ratings yet
Lecture 5
6 pages
Lecture 05 - Unconstrained
No ratings yet
Lecture 05 - Unconstrained
21 pages
Study On GRG
100% (1)
Study On GRG
14 pages
Algorithms Process Optimization
No ratings yet
Algorithms Process Optimization
5 pages
6 OneD Unconstrained Opt
No ratings yet
6 OneD Unconstrained Opt
29 pages
FALLSEM2020-21 CHE1011 TH VL2020210101704 Reference Material I 05-Sep-2020 Lecture 17 PDF
No ratings yet
FALLSEM2020-21 CHE1011 TH VL2020210101704 Reference Material I 05-Sep-2020 Lecture 17 PDF
19 pages
Other Nonlinear Regression Methods For Algebraic Models
No ratings yet
Other Nonlinear Regression Methods For Algebraic Models
17 pages
Project For Automated Train by Roshan
No ratings yet
Project For Automated Train by Roshan
6 pages
Windowing Functions Improve FFT Results,: Richard Lyons
No ratings yet
Windowing Functions Improve FFT Results,: Richard Lyons
7 pages
Unconstrained Optimization: Prof. S.S. Jang Department of Chemical Engineering National Tsing-Hua Univeristy
No ratings yet
Unconstrained Optimization: Prof. S.S. Jang Department of Chemical Engineering National Tsing-Hua Univeristy
46 pages
CS-6777 Liu Abs
No ratings yet
CS-6777 Liu Abs
103 pages
Process Optimization
No ratings yet
Process Optimization
70 pages
FX X RCX I CX I I: Study On Lagrangian Methods
No ratings yet
FX X RCX I CX I I: Study On Lagrangian Methods
10 pages
OpTimIzation Overview
No ratings yet
OpTimIzation Overview
47 pages
DL Unit-2
No ratings yet
DL Unit-2
31 pages
Recurrence Relations
0% (1)
Recurrence Relations
54 pages
Optimization Based On Gradient Descent
No ratings yet
Optimization Based On Gradient Descent
24 pages
Bode Plot
No ratings yet
Bode Plot
38 pages
Optimization Class Notes MTH-9842
No ratings yet
Optimization Class Notes MTH-9842
25 pages
CH1
No ratings yet
CH1
33 pages
CHE 536 Engineering Optimization: Course Policies and Outline
No ratings yet
CHE 536 Engineering Optimization: Course Policies and Outline
33 pages
Sample - IND ASSINGMENT-2-9 (ISP611)
No ratings yet
Sample - IND ASSINGMENT-2-9 (ISP611)
8 pages
PMI-SP® Exam Focus: Study Guide with Practice Tests
From Everand
PMI-SP® Exam Focus: Study Guide with Practice Tests
SUJAN
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Lecture 9

Uploaded by

Lecture 9

Uploaded by

Quasi-Newton methods

𝑨&52 = 𝑨& + 𝑨6&

Δ𝒙& Δ𝒙& - 𝑨& Δ𝒈& Δ𝒈& - 𝑨&

with 𝑨= = 𝑰, Δ𝒙& = 𝒙&52 − 𝒙& , Δ𝒈& = 𝛻𝑓 𝒙&52 − 𝛻𝑓 𝒙&

8. 𝑥" = [13.20; 8.43]

8. 𝑥" = [13.20; 8.43]

Need conditions on 𝜶𝒌 to select appropriate step length

for some 𝑐2 ∈ (0,1)

• In practice, 𝑐2 is taken as a small value (say 101b )

Any small step 𝜶 → 𝟎 satisfies this condition

with 0 < 𝑐2 < 𝑐" < 1

• Strong Wolfe conditions: To bring 𝛼& closer to local minima

with 0 < 𝑐2 < 𝑐" < 1

Small 𝒄𝟏 and large 𝒄𝟐 lead to large bound on 𝜶

Eliminated region 0.31 ≤ 𝛼 ≤ 0.32 with large positive

jklm n o p qklm rklm

• Choice 2: Fit a quadratic using 𝑓 𝑥&12 , 𝑓(𝑥& ) and

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.