Herrero - Free extrema of two variables functions
Herrero - Free extrema of two variables functions
variables functions
2. Introduction
In many situations it is important to know where a function reaches a local
maximum or minimum value. For instance, if we consider the temperature
function, it can be interesting to know in which points the temperature is
higher or lower than at any point near them. This is what we are going to
consider in this work.
It is known that a one variable function has a local maximum or mi-
nimum when the growing behavior changes from increasing to decreasing
(maximum) or from decreasing to increasing (minimum). These situations
can be characterized using the first derivative of the function. In the same
way we generalize the concept of maximum and minimum value of a function
to functions of several variables.
1
and absolute extrema as happens in one variable functions.
f (x, y) ≤ f (x0 , y0 )
f (x, y) ≥ f (x0 , y0 )
Remark 1 The relative extreme points of a function are the points where
the function has a relative maximum or minimum. They are also called free
extreme points.
Remark 2 Note that the above definitions are also valid for functions of
several variables in general.
2
∂f
(x0 , y0 ) don’t exist or equals 0.
∂y
This concept can also be generalized to functions of more than two va-
riables. In fact, a point x~0 ∈ Rn is a critical point of a function f (~x), with
~x ∈ Rn , if all the first partial derivatives of the function at that point are
null or one of them doesn’t exist.
Note that, as in one variable functions, the first order partial derivatives
can vanish for points that are no extrema. That is, not all critical points are
extrema of f (x, y).
∂f
= 2y − 2x = 0
∂y
From second equation, x = y and substituting at the firs equation
So, x = 0 or x = 8/3. Then, the points (0, 0) and (8/3, 8/3) are critical
points of f (x, y).
3
In order to classify the critical points of a function of two variables we can
use a geometrical method (plot the function) or the second derivative test to
determine what kind of extrema is each critical point. The difference with
one variable function is that we don’t have a second derivative of the function
but four second partial derivatives. To consider all of them we construct a
square matrix with all of them and calculate its determinant.
Using the second partial derivatives and this determinant we obtain a way
of classifying the critical points to determine if they are minima, maxima or
saddle points.
∂2 f
If Hf (x0 , y0 ) > 0 and (x0 , y0 ) > 0 then f (x0 , y0 ) is a relative mini-
∂ x2
mum.
∂2 f
If Hf (x0 , y0 ) > 0 and (x0 , y0 ) < 0 then f (x0 , y0 ) is a relative ma-
∂ x2
ximum.
There are other equivalent versions of the previous test. For example,
∂ 2f
some texts may use instead of the corresponding derivative of f (x, y)
∂y 2
with respect to x twice.
4
Example 2 Classify the critical points of f (x, y) = x3 − 3x2 + y 2 − 2xy
Solution: The first partial derivatives of f (x, y) are
∂f ∂f
= 3x2 − 6x − 2y and = 2y − 2x
∂x ∂y
Considering their nullity, the points (0, 0) and (8/3, 8/3) are critical points
of f (x, y).
Now, the hessian of the function at any point (x, y) is
6x − 6 −2
Hf (x, y) = = 12x − 8.
−2 2
For the first critical point one has Hf (0, 0) = −8 < 0, then the function
has a saddle point at (0, 0).
For the second critical point one has
∂ 2f 8 8
8 8
Hf , = 24 > 0 and , = 10 > 0,
3 3 ∂x2 3 3
8 8
then, the function has a relative minimum at the point , and its value
3 3
8 8 256
is f , =− .
3 3 27
∂f ∂f
= 3x2 − 6y − 39 and = 2y − 6x + 18,
∂x ∂y
and equal them to zero. From the second equation y = 3x − 9 and substituting
at the first equation
So x = 5 or x = 1. Then, the points (5, 6) and (1, −6) are critical points of
f (x, y).
5
Now, the hessian of the function at any point (x, y) is
6x −6
Hf (x, y) = = 12x − 36.
−6 2
∂ 2f
Hf (5, 6) > 0 and (5, 6) = 30 > 0,
∂x2
then, the function has a relative minimum at the point (5, 6) and its value is
f (5, 6) = −86.
For the second critical point one has Hf (1, −6) < 0 and , then the function
has a saddle point at (1, −6).
3. Closing
We have studied relative extrema of a two variables function, how to
calculate and classify them. Extrema of a function are necessarily critical
points, so we can use the first partial derivatives of the function to find them.
Critical points are those points satisfying that the first partial derivatives of
the function are both null or at least one of them doesn’t exist. Once we have
the critical points we can use the second derivative test to determine if they
are maxima, minima or saddle points of the function. That is, we calculate
the hessian of the function at each critical point. If the hessian is negative
then the point represents a saddle point, if the hessian is positive then the
point is a maximum or a minimum. In some cases (when the hessian equals
zero) this test can not conclude what kind of extrema is the critical point.
In such cases we would have to use other methods (for example, geometrical
methods).
Definitions of relative maxima and minima can be extended to functions
with more than two variables. In this cases extrema are also critical points,
which can be obtained from the nullity of all the first partial derivatives of the
function or if some of them do not exist. Then, the Hessian is a determinant
of a square matrix with all the possible second derivatives (that is, of size n,
with n equal to the number of variables of the function). However the second
derivative test is different and it is not studied in this work.
6
4. Bibliography
There are many books studying several variables functions. Some of them
are: