Uploaded by UCBerkeley on 18.11.2009

Transcript:

Thank you for coming despite the terrible weather.

In some sense it is great weather, because when

it's sunny and warm you want to be outside.

But in this weather, I certainly don't

want to be outside.

So it's a perfect time to find absolute maxima and

minima of functions.

So why don't we do that?

I would like to pick up where I left off last time and discuss

the example of finding maxima and minima of a function on a

two-dimensional domain, which is a square.

And I think it's a good example to illustrate the main points

and also to make a transition to the next topic which we'll

discuss today, which is Lagrange multipliers.

So we have a function.

We have this function, and we have this domain

which I drew last time.

It is a square.

It is a square which is of length, each side is of length

2, which is situated on the plane like this.

So we need to find absolute maxima and minima

of this function.

And the way I explained this is you have to

follow an algorithm.

The easiest way to solve this problem is to

follow an algorithm.

And the first step is-- the first step of this algorithm

is to look in the interior of the domain.

OK?

Should I do this example or not?

OK, thanks.

All right.

I think I should.

I mean I will go quickly.

I know that this was in last week's homework and stuff and

even the quiz, I suppose but since I started doing it I

just wanted to go quickly over it again, OK?

So step 1 is-- step 1 is to look at the interior of the

domain and to calculate the partial derivatives.

I set the partial equal to zero.

OK?

I'm going to skip that.

I already did it last time.

But this is inside the domain.

So the next step-- that's very easy, right?

So you just get two equations and you solve them.

You find all the functions-- sorry, all the points which

satisfy this equations.

And this will be the first points on your list.

Next you look at each component of the boundary.

And on each component of the boundary, you can set one of

the variables to equal something.

So effectively your function restricted to that component

becomes a function of one variable.

So let's say if we take the part of the boundary which is

that top horizontal statement, that would y equal 1 and x

between negative 1 and 1.

When you set y equal 1 in the function, you effectively get a

new function-- let's call it g over x-- which is x squared

plus 1 plus x squared plus 4, which is 2x squared

plus 5, right?

squared plus 5.

And what you want to do now is to find the points, fine

the points, where g prime of x is equal to zero.

So this will be the points on the boundary which are

suspicious-- which are candidates for points of

maxima and minima, all right?

And I purposefully just look at the interior-- on the

interior of the interval.

So that means x is between negative 1 and 1 but

strict inequalities.

I didn't make this clear, I think, at the end of my last

lecture what exactly I wanted you to do at this step.

So I want to emphasize that's what I was talking about is

just finding the points with the derivative vanishes.

And then step 3 would be just to include the corners.

Now there is a different way to think about this-- there's a

different way to think about this process.

The way it is described in the book is there are two steps.

First step is exactly the same as what I explained.

But the second step would be to try to find the maxima and

minima of the function on each all of the components

of the boundary.

I mean they say on the entire boundary but effectively you

will have to split it into four parts, right, and do

it on each part separately.

Now, what would be involved in doing that?

So let's say when you look at this-- at my

component y equals 1.

When you look at y equals 1 you effectively get one-dimensional

problem, where you have the interval from negative 1 to 1.

And you want to walk find the absolute maxima and minima

of the function g over x equals 2x squared plus 5.

So you split this problem into two parts as well.

You look at the interior of the interval and you find points

where the derivative is zero.

Right?

And also you look at the end points.

So you have to look at the end points as part of the problem

of finding the maxima and minima on this interval, right?

So what I did in my presentation of this algorithm,

I have taken the end points and put them in separate list, or

sublist-- separate sublist-- so that in step 2 you would only

be doing part of the problem of finding the absolute maxima and

minima on this interval.

Namely the points where the derivative vanishes

in the interior.

And then you have to deal with the corners, corners being that

end points of those intervals anyway.

The reason I like this way of thinking about it is the

following: the same corner-- say this one-- will appear

twice as the boundary of this horizontal segment and

the boundary of this vertical segment.

So if you go by the book-- so to speak, right?-- so if you

actually literally solve the problem for each of these

segments you will have to deal-- you'll have to

include this point twice.

Whereas what I'm suggesting-- it's really a minor variation,

right-- but what I'm suggesting is that don't worry about the

corners from the beginning, just put them at the end.

Include them-- when I say include them I mean include

them in your list, your list of suspicious-- maybe let's not

use the word suspicious, let's use the word candidates--

candidates for maxima and minima, right?

So you include them anyway.

So let's-- in step 2 then I'm suggesting not to worry about

the corners, which could also be thought of as the end points

of the one-dimensional problems which you get on the boundary.

But instead just focus on the interiors of the boundary

components and just find the points in there where

the derivative is zero.

I think it's a slightly easier way to think about it.

But if you like, it's OK, you can do it in two

steps so to speak.

In other words, including the calculation of the

corners in step 2.

You just have to do the same work twice-- I mean you have

to deal with the same point twice in this example.

do you see what I mean?

Is that clear?

So that's what I meant.

I didn't really spell it out in detail.

That's what I meant at the end of my life when I

said there's three steps.

I don't mean by step 2 doing the entire problem of finding

the maximum and minimum on each interval.

I mean just doing part of that, namely finding the points

where the derivative is zero.

And then the end points you as a separate step.

So in this particular example, in this particular example, you

have a list of oh how many points?

Quite a few.

I mean I'm not sure I should actually write them down.

But you have some points which will have the real-- for which

some points where the partial derivatives will vanish

in the interior.

There will be some points in the interior of those segments

where the derivative vanishes.

And finally there will be the corners.

So I have what?

1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11-- I have 11 points.

But anyway, it's not a hundred.

It's going to be a small number of points and then what you do

is you just evaluate your function at those points and

you find for which points you get the maximum value and for

which points you get the minimum value.

It's possible there will be more than one point at which

the function will attain the same maximum value.

You just include both of them and say that the function takes

the maximum value at those two points or three

points or whatever.

OK?

Any questions about this?

All right.

So the reason why this problem was relatively easy was the

following: step 1 would be essentially the same

for all domains.

I mean, for step 1, no matter what your domain is, you'll

just have to calculate the points-- you'll just have to

find the points where the two partial derivatives vanish, and

then you have to make sure that that point belongs

to your domain.

So even if the domain has a complicated shape that's

not going to be so hard.

You just have to know which domain belong-- which points

belong to this domain and which points don't.

The problem would arise at step 2 if the domain is

not quite so simple.

So let me give you an example.

So another example is a domain where x and y are such that x

squared plus 4y squared is less than or equal to 1.

So let's sketch this domain.

First of all, when you sketch domains by the way, there is a

general rule which you probably have figured out already

a long time ago.

You first look-- you have an inequality-- you first look at

the corresponding equality.

So in this case you would have just x squared plus

4y squared equals 1.

And you sketch that.

And that, we know, that's an ellipse.

We've talked about ellipses.

So for instance, we have the point x equal 1, x equals

negative 1, and y equals zero.

And we also have the points y equals 1/2 and y equals

negative 1/2 two other points.

And the ellipse looks like this.

So it's kind of a circle which is squished.

So that's-- the equality gives you a curve.

And in fact we can recognize our old friend the level curve.

This is a level curve for the function x squared

plus 4y squared.

But if instead of equality you write inequality like this, it

means that in addition you include points for which the

values is not equal to strictly 1 but could be less than 1.

And that means that this points are going to be on 1 or the

other side of this curve.

In this case, it's easy to figure out which side.

Well first of all they're going to be on the same side.

And clearly, x equals zero and y equals zero satisfies this

equation-- this inequality for this formula.

And that's this point.

So clearly it should be inside and not outside.

That's how you figure out whether it's interior-- inside

or outside-- the interior part or the exterior part of the

plane, or inside the curve or outside the curve.

In this case it's inside.

And usually it's easy to figure this out.

So that's the domain.

That's what now plays the role of this square.

Let's suppose you're asked to find maxima and minima of

the following function.

F-- I'll make the function a little bit easier, linear

function-- 2x plus y.

So how would you solve this problem?

You approach it in exactly the same way.

Solve it in the same way.

Meaning that the first step would be to set f(x) equals

zero and f(y) equals zero and find all the points inside the

domain-- inside the domain, which satisfies

the two formulas.

So in this case it's very easy to see that there are no points

like this because in this case f sub x is equal to 2.

If 2 equals zero, no solutions.

And f(y) equals zero, also no solutions.

So in step 1 we don't get any contribution to the list of

candidates, list of suspicious points.

So everything boils down to the boundary.

And in fact there is no step 3-- no step 3 here because

there are no corners. the boundary is smooth.

So we'd like to understand this entire boundary

in sort of one shot.

How can we do this?

The idea in the previous analysis was to express the

restriction of our function to the boundary or maybe a

component of the boundary.

As a function of one variable.

Once it becomes a function of one variable, we could solve

the problem by just going back to the one variable calculus.

But in this case it's not so clear.

Because, well, in our problem it was very easy.

You just substituted y equals 1.

And of course, it's obvious that the function then becomes

a function of only one variable, namely x.

But here, what should you substitute?

Well one way to deal with this would be-- one possibility

would be to express one of the two variables on the boundary

in terms of the other.

So you could write, for example, x equals square

root of 1 minus 4y squared.

But there is a problem.

We have two solutions.

There's plus minus 1 minus 4y squared.

So in fact what we are doing is that we're taking this very

nice domain and we are artificially cutting

it here and here.

And so we're breaking it into two parts.

One would be where x is greater than zero or greater

than or equal to zero.

And the other one would be where x is less

than or equal to zero.

Once you do that, you have two components-- let me raise

this now-- two components.

One is x greater than or equal to zero and the other one

greater or equal to zero.

And then you can try to solve the problem of each component.

So you would have to-- you would get the function, say

g of y-- which is equal to 2 times square root of 1

minus 4y squared plus y.

This is in the case where x is greater than or equal to zero.

And a similar function with a negative sign, when x is

less than or equal to zero.

And you would have to find the points where

g prime of y is zero.

So you see that it was kind of nasty because the function's

not so nice and you split such a beautiful, a beautiful

ellipse into two parts.

And then you have to also deal with the end points.

There are no corners the way there were corners there,

because there they were actually not smooth.

They were singular, kind of points of singularity.

Here they are actually smooth points but we have sort of

artificially made them singular, made them look like

corners, because now they appear as the end points of

the-- they appear as the end points of these segments-- two

parts of the boundary.

So it is possible to solve the problem in this way but it just

not a very elegant solution.

So it is natural to try to find a more elegant solution.

And the more elegant solution-- so then there would

still be step 3.

Actually in this approach there will be a step 3 which will

correspond to points x equal zero and y equals plus/minus

1/2, a kind of fake corners.

So what I'm going to explain now is a more elegant

solution which is called Lagrange multipliers.

It's a more elegant solution which is applicable to even

more sophisticated functions.

In this case, actually we're dealing with a fairly

simple level curve.

Or with a level curve of a fairly simple function.

And that's why we can actually get away with expressing one

of the variables in terms of the other.

But let's say-- let's suppose-- you have instead of this, you

have something like x squared plus 4y squared plus xy plus--

you know some-- plus y plus square root of x.

So then try to express now y in terms of x or x in terms of y.

That would be very complicated.

That alone-- even if you can, then you have to

take derivatives and stuff like that.

So that's very complicated.

But it turns out that is actually a much more

straightforward and elegant method.

And that's called Lagrange multipliers.

And actually I would like to explain this method

first geometrically.

So let me look again at this picture.

This picture-- the domain D consists of two parts again.

It consists of the interior of the ellipse and

the ellipse itself.

But we have just seen that on the interior of the ellipse

there are no interesting points for us.

There are no candidates for maxima and minima because our

function is such that there are no points where both

partial derivatives vanish.

So actually we can forget about the interior.

The problem boils down to understanding the maxima and

minima of our function restricted to just

the boundary.

There's nothing interesting for us inside.

So that's why I'm raising-- I'm sort of raising this and I am

just focusing on the boundary.

And now I would like to approach this problem

geometrically.

In other words, we have discussed several

times level curves.

And for example we recognize this curve itself as a level

curve of some function.

And I said many times that if you know the level curves of a

given function you can understand much better the

shape of this function.

So let's see how it works in practice in this problem.

The boundary is given by an equation.

So the boundary is a level curve.

That's a given.

But we also have a function.

So you have to separate the two things that you have.

First of all, you have the equation-- this is the

equation of the boundary.

But you also have a function.

And so what we can do is we can also look at the level

curves of this function.

Here we are looking at just one level curve.

Why only one level curve?

Because this is the equation.

The equation is that this is equal to 1.

So we are just looking at one level curve for this function

f-- I'm sorry, for this function which let's call it,

let's call it something else-- let's call it h.

I don't want to use g because I already used g for this other.

So this is function h.

For the function h this is only one level curve which

shows up in this problem.

We don't care about other level curves.

But for the function f we would like to find maxima and minimum

values-- maximal and minimal values-- and we don't know

yet where there are.

So we should look at all level curves for that

second function f.

So what are- what do this level curves look like?

So the level curves 4f of xy equal 2x plus y are given by

equations 2x plus y equals k where k is some number.

So for instance, one possibility is say k is equal

to zero. 2x plus y equals zero.

eq question No?

All right.

2x plus y equals zero.

What is 2x plus y equals zero?

It's the same as y equals-- maybe it's easier if I write

it like this like this: y equals minus 2x.

That looks like this.

That's the line which goes through zero with a

slope with negative 2.

And then the if you look at other lines like

this, for example.

For instance, 2x plus y equals 1.

This is just y equals minus 2x plus 1.

So that's going to be a line which is parallel to this one

but goes through the point 1.

So it's actually in this case, it's going to cut

here at the point 1/2.

So I should do it more precisely like this.

And so on.

Well my lines are not perfect yet.

My lines are not perfectly aligned but I hope you see

what I'm doing anyway.

And what about other level curves?

Of course all other level curves are going to be

just parallel lines.

I mean we are so sophisticated now we even understand

questions of planes in 3-space.

For sure we understand equation of lines on the plane.

This looks like a typical equation-- well, if we had a z

that would be typical equation for a plane in 3-space.

Because we don't have a z it's even simpler.

So instead of a plane in 3-space we get a line in

two-dimensional space on the plane where 2,1

is a normal vector.

So when you vary this number k, you're going to get a system of

parallel with such normal vector.

So I've drawn two of them but now it I'll draw more.

Here's another one.

Yeah, i kind of messed it up.

We can do it again.

I do want to emphasize the fact that this is a

system of parallel lines.

OK.

Let's see if I can do it better this time.

So we got this, we got this, and this.

And now I want to draw a master line which is

going to be this one.

I keep hearing voices as if there is some color commentary,

sort of to my play-by-play.

I don't mind but I'm sure that people sitting next

to you probably mind it.

OK.

So you have a system of lines like this, OK?

So I think it's more clear now.

OK.

So let's say that this one goes through zero.

What is special about each of these lines?

What is special is that along each of these lines the value

of the function f is the same.

That's the function which we want to maximize, minimize.

We would like to find maximum, minimum values

of that function.

And now what we-- what this picture tells us, it tells

us the lines of equal value for that function.

So for instance, the function is going to be equal to

zero along this line.

The function is going to be equal to-- where

is my master line?

This is my master line-- the function is going to be

equal to 1 along this line.

The function is going to be equal to say

1/2 along this line.

Maybe 3/4 along this line, 1 1/4 along this line and so on.

Whatever line you want you can draw.

And it's going to be one of these parallel lines.

But this the level curves for the function on

the entire plane.

But for now we are interested in step 2, so we're looking--

because step 1 did not produce anything anyway.

So we're looking just at this ellipse.

So what we're really interested in is the values of the

function on the ellipse.

AUDIENCE: [INAUDIBLE]

Oh sorry.

Not this one, this one.

The one that goes through zero.

It's just bad drawing.

It's a bad drawing day.

But this one. you see what I mean. 1/2 would be then--

this is not 1/2, it's like 1/3-- sorry.

But you see what I mean.

It's not to scale, it's not parallel, it's not-- it's a

very qualatative picture, not very precise.

Kind of abstract like Picasso, the way Picasso

would approach it.

But I hope you see what I mean.

As you vary-- the point is when you go to the next one the

right, the value will increase.

When you go to the next one to the left, the value of

function will decrease.

But we are interested not in the values everywhere, we are

only interested in the value on and the values on that ellipse.

So in particular, we now know that here and here, that

these are the two points of intersection of the line f

equals zero with out ellipse.

Here and here the function is equal to zero.

Here and here, the function is equal to 1.

Here and here it's equal to 1/3 and so on.

So now how can we use this picture to find where the

function takes the maximum or minimum values.

Well the point is that for sure, we can see from this

picture it's quite clear that f equals zero is not.

It's neither maximum nor minimum.

Why?

Because this line we can move both to the right

and to the left.

When we move it to-- and we'll still have some points of

intersection with our domain-- not with our domain, sorry--

with the ellipse, with our curve.

If I move it even by 1/3-- I can move it even by 1/3 and

I still have intersection.

So that cannot be maximum value because here the value

is zero, here it's 1/3.

Likewise, say this one would be f equals negative 1/3.

So from here I could also move along this curve to the left.

So I can shift my parallel line to the left,

decreasing the value.

So here is neither maximum nor minimum.

Likewise this is neither maximum or minimum.

This one is not, this one is not.

So at first it looks like neither of them

maximum or minimum.

So the point is to find the points where this qualitative

argument doesn't work, where it breaks down.

We have to find the points where you could no longer move

it to the right or to the left.

In what sense?

Of course you could always move it to the right and to the

left, but you could not move it and still have a non-empty

intersection with your curve.

For example, you have a line like this, which is one

of their level curves.

But it's totally outside of the scope of our problem.

In other words, there's no intersection between this line

and this curve that we're interested in, right?

So those guys are excluded from our consideration anyway.

And so the point is to find the points where you can not move

to the right or to the left and still have non-empty

intersection.

And the point-- for most of these lines, for all of the

lines which I have drawn, you have two points

of intersection.

So as you move further and further and further to the

right-- let me just make it slightly more

realistic looking.

Did somebody have a question?

No?

OK.

As you move it, you still get two points, two points, two

points, two points, two points and then boom, there this one

point that something happens.

At this point, the intersection instead of getting two points,

you get just one point of intersection.

In other words, give up two points, two points, two points,

two points, but they get closer and closer to each other and

they converge at this point.

And suddenly you get a very special point.

This point is a point where the intersect is just this one

point and not two points.

Right?

You see what I mean?

It's just one point.

And likewise on this end, the same time.

You have intersection of two points and two points and the

next one's two points-- I'm just unable to do parallel

lines today-- two points, two points.

And then suddenly, boom!

There is this one line where it intersects with one point only.

And for these two points my previous argument

actually breaks down.

You see because when I was here I could argue that this is not

the maximum or minimum because I could move this way,

I could move that way.

And when I move a little bit, I would still have a non-zero--

non-empty intersection with my curve.

But when I'm here, I can certainly move it to the left

so I could decrease the value.

But I cannot move it to the right.

It's kind of frozen here.

What do I mean that I cannot move it to the right?

Of course I can.

I can move it, but if I move it by any small amount-- epsilon--

or any other small amount, there will still be a line like

this but it will not intersect my curve.

So that second argument breaks down.

So that means that this is the maximum value.

Because I cannot move it anymore to the right.

You see what I mean?

And likewise this is the minimum value because

I cannot move it any more one to the left.

So what are these special points?

Well, now we know enough about level curves and things like

that to realize that these are precisely the points where

this line is tangent to this level curve, to this curve.

So this is the case when this is a tangent line.

And likewise, this is the case where this is a tangent line.

And what does it mean that this is a tangent line?

It means that the normal vector-- the normal vector

to this level curve is proportional to the normal

vector of this line.

But what on that the maxima and minima are the points where the

level curve of the function f of xy, which in this

case is 2x plus y.

In this case, the level curve are lines.

But in general, they're not going to be lines, they're

going to be some curves.

Where the level curve of the function which we want to

maximize and minimize for f is function that we want

to maximize or minimize.

OK?

If it's tangent to our curve, which is this h

of xy which equals 1.

So I think that x squared plus 4y squared equals 1.

So that's the given curve.

So the point is in this exercise we are given two

functions, but they play very different roles.

There's a function f and there's a function h.

Function f is the function we want to maximize and minimize.

Therefore we kind of look at all level curves of that.

h is the second function.

It's the function which defines the curve and also defines the

domain if our original problem was about the interior of that

curve, the way I started talking about it.

For this function, we all need this in one level curve.

That's our-- that's the curve where we want to

maximize/minimize f.

But the interesting thing happens the places where

the two curves are tangent to each other.

And what does it mean that they are tangent to each other?

Well, in this example, the level curves of f are lines, so

it's clear what we mean by saying that it's tangent

because we talked about tangent lines.

But in general, if you have two different curves, there is also

an obvious notion of being tangent.

It just means that tangent lines to them coincide.

Or in other words, normal lines to both of them coincide.

So this means, in other words, the normal lines-- normal

lines-- so we are points where the level curve, curve of f,

is tangent to our curve.

Maybe I should say these are the point-- these are the

points where the normal lines to f of xy equals k and h of

xy equals 1, are the same.

Coincide.

So how do we check that?

Well we know that the normal lines are given by the

direction vector-- the normal vector to that curve,

which is a gradient.

So equivalently, we can say that the two gradient vectors

should be proportional to each other.

These are the points x,zero y,zero such that nabla f at

x,zero y,zero is equal to lambda-- that's nabla--

h for x,zero y, zero.

And that's the equation that will give us those points.

What is the meaning of this equation in our case?

In our case, at this point we have a normal

vector to the line f.

f equals k.

That's vector That's the vector 2, this is nabla f.

Actually, nabla f is very easy to find, it's 2,1.

It's the vector 2,1.

There are two parts to the derivative.

So our f are 2 and 1.

Let me do it in red so that everything isn't

the same color.

This is nabla f at this point.

And nabla h is the vector which is normal to the curve.

So in this case, the curve goes like this, so the normal vector

will go like this, right?

So that's nabla g.

Clearly at this point they are not proportional to each other.

And they shouldn't be because these are not the points where

the two level curves are tangent to each other.

They intersect with a certain angle.

They are not tangent.

But at this point, the two normal vectors suddenly become

proportional to each other.

So this is one and this is the other.

They become proportional you see.

So that, and likewise same thing here.

That's why at these points we get maxima/minima values.

So let's find these points.

Are there any questions about this first of all? of all?

What's lambda?

Very good point.

So lambda is the coefficient of proportionality.

We don't know it a priori.

this is, by the way, a very important point because

some of you made this mistake on the midterm.

If you have two vectors, each vector spans the line, right?

So what is the property of the two vectors for the

lines to be the same?

ehy It's not that they are equal, right?

They should be proportional.

If you have a vector you vet a line.

But if you take twice that vector you get

the same line, right?

So the condition is much weaker.

It's a condition is that the two vectors are proportional

to each other.

And then the two line coincide.

And what does it mean that they are proportional?

Proportional means that one of them is equal to some

number times the other.

And we call that number lambda.

So that's the coefficient of proportionality.

Which in this case is called Lagrange multiplier.

La-- I'm thinking of another name-- Lagrange multiplier.

And this method of finding these external points is called

Lagrange multiplier method, after a great French

mathematician, Lagrange.

So, any other questions?

You'll probably see it better when we do it in this example.

So let's see what this equation look like in this example.

So what is nabla f?

Nabla f consists of two partial derivatives of f.

And those are easy to find it, they are just 2 and 1.

And what's nabla h?

These are the two partial derivatives of that function,

x squared plus 4y squared.

So it's going to be hx and hy are going to be 2x and 8y.

And now the equation that we want is that 2,1-- that's the

left hand side-- is equal to lambda times 2x 8y.

More precisely, since here I wrote x,zero y,zero, let

me stick to that and let's write also x,zero y,zero.

But of course when you do your calculations you will

oftentimes just write x and y without worrying about

the index zero.

Here I write x,zero and y,zero to emphasize that we are

talking about a particular point x,zero y,zero and not

about general values x,y.

So, but this is not just one equation.

This is a system of two equations because you have two

components of these vectors.

So there are two equations. 2 equals lambda times 2x and 1

equals lambda times x,zero y,zero.

So let's remember one thing which is that

lambda is unknown.

We are saying that they're proportional to each other,

but we don't know what the coefficient of

proportionality is.

So what we need to do here is to solve these equations for

x,zero y,zero and also lambda.

But you shouldn't forget that actually you are solving it not

on the entire plane but you are solving it on that curve only.

We are interested only in that curve now.

That's very important.

So far, I've written the equation as though x,zero and

y,zero were an arbitrary point.

But in fact, it should be a point on that curve.

So we have to add one more question, which is the

equation of that curve.

Which is x squared-- x,zero in this case plus 4y

squared equals 1.

So in fact you have three equations.

Three equations on three variables. x,zero

y,zero and lambda.

If you just tried these two equations, it would mean

that you look everywhere.

You have to constrain x and y, or x,zero y,zero by

imposing various original equations on this curve.

OK, so now it looks more meaningful because you've

got three equations and three variables.

And usually, generically, if you have three equations with

three variables, you're going to get one solution or you

get fininite many solutions.

That's what's going to happen here.

Let's find a solution.

So first we can express x,zero and y,zero in terms of lambda

and then we can substitute x,zero and y,zero into the

third equation and we'll get one equation in lambda.

That's the simplest way to solve it.

So from the first equation we find that x,zero

is 1 over lambda.

For the second question we find that y,zero is 1 over 8 lambda.

And now we put them into the third equation and we get one

over lambda squared plus 4 times 1 over 8 lambda

squared equals 1.

So what that means 1 over lambda squared times 1 plus

4 divided by 64 equals 1.

And that's going to be what? so this is 1/16 so that's 17/16.

And from this you find that lambda squared

is equal to 16/17.

Right?

Is that right?

17/16.

Right, because you take it to the other side. 17/16.

And that means that there are two solutions to lambda.

Lambda is equal to plus or minus square root

of 17 divided by 4.

Once we know what lambda is, we can find x,zero

and y,zero, right?

So now we find x,zero is equal to 4 over of square root of 17.

Well, there are two points.

First of all there are two points.

So for each, let me do it slowly-- let

me not in mess it up. each solution for lambda

will correspond to a particular point.

And of course we kind of know that we should get two points,

we expect to get two points.

One will be the maximum and the other one will be the minimum.

So the first point is lambda equals square root of 17 over

4. x,zero is 4 over square root of 17.

And y,zero is 1 over 8 of that.

So it's 1 over 2 square root of 17.

The second point, so that's this point, that's this point.

And the other point is going to be with the minus sign.

4 over square root of 17 minus 1 over 2 square root of 17.

So lambda equals minus square root of 17 over 4, x,zero is

minus 4 square root of 17, y,zero is minus 1 over

2 square root of 17.

And well in this particular case it's clear which one is

the maximum and which one is the minimum because we can see

it clearly on the picture.

But in general, it may not be so obvious.

So what you should do is just substitute them

into your function and calculate the value.

And the one for which the value is bigger will be the maximum

and the one for which the value is smaller it will

be the minimum.

So in this case, f is going to be you have to substitute in 2x

plus y, so that's going to be 2x plus y, so it's like 8--

it's going to be 1 over square root of 17 and here

will be 8 plus 1/2.

So that's going to be 17-- square root of 17 over 2.

So you've got 17 over that's the square root.

And here the function will be just-- actually, in

this case-- ah, but here see I was too fast.

I kind of-- Oh no, it is correct. that's right.

So it will be just minus of that.

Minus square root of 17 over 2.

So it's clear that that's the maximum, that's the minimum.

So that's that solution.

Yes?

AUDIENCE: [INAUDIBLE]

That's right.

They point in the same direction.

So lambda is positive.

Right.

When it's the minimum they should be opposite.

Exactly.

It's always like this.

If they are pointing in the same direction,

that's the maximum.

So that's actually a very good question.

That we can actually tell by looking at in which direction

the two vectors are pointing. or, in other words, what

is the sign of lambda?

If the sign of lambda is positive, it's the maximum.

If the sign of lambda is negative it's the minimum.

And geometrically actually, I did not draw the picture, but

the picture would be like this: this would be the-- y would be

the gradient vector for the function g.

But now the red one will now go this way, but

red is always the same.

It's 2,1.

The So that's nabla f.

And here they are opposite whereas here they are pointing

in the same direction.

Any other questions about this?

AUDIENCE: [INAUDIBLE]

Right.

That's right.

So-- that's right.

So you're talking about-- the question is about the level

curve, the red ones, right?

So the red lines are the level curves of the function

f, which is 2x plus y.

So we could look at the equation f equals

2x plus y in 3D.

That would be z equals 2x plus y.

So in fact, because this function is linear, the

graph of this function would be a plane.

And that plane actually can be very easily

illustrated right here.

So let's say it's this plane.

So now, what are the level curves-- I'm going back to

the same drawing which I did a couple of lectures ago.

What are the level curves?

Well you can draw-- draw level curves are just the sections

which are parallel to the floor, sections by planes which

are parallel to the floor.

So here's one of them, here's another.

I should have drawn in red, but you see what I mean.

This is will be-- so this is the level curve

for the function f.

But again, you think of the level curves as

being part of the graph.

But you can also think of them of their projections

into the x-y plane.

So what I had drawn on that picture are the projections

onto the x-y plane.

So I'm writing them as equations f equals something.

For example, f equals zero, or f equals 1/3 or f equals

negative 1/3, f equals 1 and so on, right?

If you like, you can think of the plane, x is the graph of

this function, as one which is obtained by assembling those

lines but in the kind of-- each time you have to lift it by

the height which is equal to the value of the function.

So only this one-- this one, the one for which x is equal

to zero, would remain on the x,y plane.

And this one will be elevated and those guys will be

lower, just like here.

So let's say the x-y plane-- if the x-y plane were so that this

one was-- let's say this one is f equals zero.

So then this one-- let's say this is f equal 1/3 and so on.

So they're all elevated.

Thnk of like sort of a staircase.

I mean it's not like a staircase, but if you think of

just a step 1/3, then this is a collection of lines.

But it doesn't make sense to draw them in 3D because they

are given by equation on just x and y.

So that's why I'm drawing their projections into the

floor, onto the x-y plane.

You see?

AUDIENCE: [INAUDIBLE]

That's right.

That's a very good question.

So, so far we were lucky that the domain, or more precisely,

the boundary of the domain, was presented by one equation only.

So now I would like to for the last example, look at kind of a

mixed case where you could have some components in the boundary

and some of them could be linear and some of them

could be curved like this.

So let's say a very simple generalization of this example

would be-- a very simple generalization of this

example would be if your domain was like this.

So you could say, in this case the domain would be x,y such

that x squared plus 4y squared is equal To 1.

And x is greater than or equal to zero.

Oftentimes you will have domains like this.

So in this case, we actually do have two corners.

And we actually have two natural components

in the boundary.

This is one component-- the first component-- and this

is the second component.

So how would we solve the problem of finding maxima and

minima for function f if our domain is like this.

So let's say we're given some function f of x,y. of x,y.

I don't want to specify what this function is.

It could be this function 2x plus y or it coule be a

more complicated function.

Well, very simple.

Now we know everything that we need know to know for this.

y, that's right.

You're right.

Thank you.

y greater or equal to zero.

X greater or equal to zero would be this part.

That's right.

So you approach it the same way as before.

Three steps.

The first step would be to find the points in the interior

where both partial derivatives vanish.

So step 1, you said f sub x is zero, f sub y is zero inside.

That's just like before.

Well of course if our function is just 2x plus y then there

are no such points so we don't have to worry about it.

But in general there might be some points in the interior.

So we would collect those points.

So we don't do anything yet.

We just collect them and put-- so we start a list.

And these points will be on the list.

And then we do step 2.

Now in step 2, you have two components.

So you want to do the first component, means that you

do Lagrange multiplier.

But out of all the points we find we only pick those points

for which y is greater or equal to zero.

Let's start with the second component.

Sorry.

Right.

The curvy component.

AUDIENCE: [INAUDIBLE]

No, I meant that as a component.

This is the second component.

I'm just doing it out of order.

Instead of doing the first component first and second

component second, I do second component first.

Because I don't want to switch the board.

But now I've spent more time than I would have just

switching the board than changing the first and second.

Anyway, I think you know what I mean now, right?

So we do this one.

And this is exactly the same calculation as

we've done so far.

Well, it would have been exactly the same if our

function were just 2x plus y.

But we to have to take care of the fact that now, the curve

that we're talking about is not the entire as before

just the top part.

So we are only interested in the points we fine which

belong to this part.

So if you find something here, you discard it.

It's not part of our problem.

So in this case, if the function were indeed 2x plus y,

instead of finding two points we would find just one

point, which is this one.

Only this one.

And this one we discard.

So it means that at the end of your calculation

you have to check.

When you find your point, you have to check.

Is it true that y,zero is greater or equal to zero?

For this one it's true but for this one it's not.

So then we discard this one.

So that's what you do for the second component.

For the first component, by which I mean just the segment

from negative 1 to 1, you do it in the old fashioned way.

You just substitute the function-- you just substitute

y equals zero into your function.

You get a function of one variable, you

find the derivative.

You find the points where the derivative vanishes.

So the first component is like this.

From 1 to negative 1.

And you just substitute y equals zero and get a

function of one variable.

Function g of x and then solve the g prime over x equals zero

when x is between negative 1 and 1.

And finally you have step 3, where because I ignored in my

intermediate calculations up to now, I ignored

the two end points.

I have ignored the end points.

For the same reasons that I explained before.

I don't have to ignore it.

I could also take them into account, but then I might

have to include each of them twice in my analysis.

So I personally would just take the two corners separately.

So that's x equals 1, y equals zero and x equals negative

1 and y equals zero.

So put them on your list.

And finally you assemble this list.

So You've got points from step 1, step 2, each of the

two components in step 3.

And then you calculate the value of your function at each

of these points and you find which ones correspond to

the maximum and minimum.

And that's how you solve it.

That's right.

So the question is, does Lagrange work for the

inside of the domain?

Lagrange is specifically for the boundary, for the curve.

So it doesn't tell us anything about the inside.

Because this argument would break down it being.

Because for being tangent you would to have two curves.

And one of the curves would be the level curve of function f

and the other one would be your boundary.

If you weren't looking at the boundary then we wouldn't be

able to argue in this way, right?

But you have to realize that there are two ways in

which Lagrange methods may enter a given problem.

Essentially there are problems of two types.

The problems where you have to maximizer and minimize

functions on the two-dimensional

domain like this.

So this is usually domains which will be given by some

inequalities like x squared plus 4y squared less than or

equal to 1 or that coupled with y greater than

or equal to zero.

Such a problem consists of several steps.

You have to look at inside and you have to look

on the boundary.

But sometimes there could be a problem where you're just asked

to maximize or minimize your function on the boundary only.

In other words your set is given not by the

inequalities, by equalities.

Like you could be asked specifically to find the

maximum/miminum of your function 2x plus y on that

curve, on that ellipse.

So in that case if in the problem there is no reference

to the interior of that curve, you don't need

to do the first step.

You can skip the first step because you don't need

to know what the partial derivatives and so on.

So you just start with step 2.

You just look at the corresponding curve.

AUDIENCE: [INAUDIBLE]

This function?

You mean g prime as a constant or g as a constant?

If g prime is a constant it means that there's

a linear function.

So your question is about function one variable, right?

What does it mean if you have function one variable whose

derivative is a constant?

It means that the function is a linear function.

ax plus b.

And of course that your function does not have

any extrema points.

Any points of local maxima and minima because the derivative

is nowhere vanishing.

So in that case step 2 will be vacuous.

So if you that as your function, the maxima and

minima it will obtain will happen at the end points.

And the end points are the ones we're including anyway.

AUDIENCE: [INAUDIBLE]

That's right, that's right.

In fact, in this particular case that's what will happen.

If f were-- actually 2x plus y and we said y equals zero, in f

is 2x plus y and then we said y equals zero then g becomes 2x.

So it is going to be a linear function like this.

So in that case, there will be no points coming from

this segment-- from the interior of this segment.

The only points will come from the end points of that segment.

In the way I approach this, those points will show

up in the third step.

But if you like it, you could include them in step 2 as well.

So now I would like to discuss one possible application

of all of this.

The application of a kind of practical problem where you

have to-- a kind of optimization problem where you

would like to optimize-- maximize or minimize

certain quantity under various constraints.

So here is a very representative example of that.

So we are building a rectangular box from cardboard.

And we would like to maximize the volume of the interior of

that box given that we can use so much cardboard.

So it's a very natural question one could ask.

So build a rectangular cardboard-- a rectangular

box without a lid.

From 12 square meters of cardboard so as to

maximize its volume.

OK, so here you are asked to build a box like this.

And what does it mean build a box?

You have to decide what the dimensions will be.

So this will be x, this will be z, and this will be y.

You have to decide what the dimensions are given that

you have so much cardboard.

So that means that you have to maximize-- we translate it into

mathematics, we say that means we must maximize the volume.

And volume is a function of x, y, z, which are the

dimensions of the box.

And it's just x times y times z.

And we have a constraint.

So we have function h of x,y,z which is the amount of cardbox.

So how much cardbox do we use?

Well, this box has six sides with cardboard everywhere

except for the top.

So we'll get two rectangles of which size for

the vertical ones.

But the horizontal-- there will be only one

horizontal rectangle of dimensions x and y.

So the function will be x times y.

That's the area of this rectangle and it's not doubled

because we don't have the lid as we are asked

in the exercise.

But for the other guys we'll have to double. yz and xz.

So it's xy plus 2xz plus 2yz.

That is a total area of cardbox that is used for this, such

a box, and that has to be equal to 12.

So this is a very-- we get exactly in the situation

of the problem about Lagrange multipliers.

Except now we have three variables to begin with.

In my previous problem we only had two variables, x and y.

So we had a function f in two variables and we had a

function h in two variables.

Now the role of x is played by the function v and the role of

h is played by this function.

H and three variables.

But we will approach this problem in

exactly the same way.

Namely, we will say-- we will apply Lagrange method.

OK by Lagrange method-- Lagrange multiplier-- that

means that we are looking for points x,y,z.

I will skip x,zero y-zero z-zero.

Just call them x y z to simplify notation.

Such that nabla v-- the gradient of the function

v-- is proportional the gradient of the function h.

I remember the [UNINTELLIGIBLE]

equation h equals 12.

So remember it's not enough to write down the equation of

proportionality-- I'm sorry, I should have written nabla.

It's not enough to write down this equation, which is the

equation of proportionality of the two normal vectors or

the two gradient vectors.

You also have to write down the equation you

constrained separately.

Otherwise you will not find unique solution or you will not

find fundamental solution.

So what do these equations look like?

Well, nabla v is obtained by taking all partial derivatives

of that function.

So what are they?

Well, with respect to x it's yz, with respect to y it's xz,

with respect to z it's xy.

And here we have lambda times the partial derivative of h.

And what is the partial derivative of h?

It's 2xz plus xy.

What am I -- what am I writing?

Sorry, I'm looking in the wrong place.

So it's y-- where did I write this?

2z-- OK.

2z plus y-- 2z plus y, 2z plus x and 2x plus 2y.

OK?

So now of this means that we have a bunch

of equations actually.

Let me see.

How much time do I have?

OK, great.

We have just enough time.

OK, so this is going to be three equations.

Yz equals lambda times 2z plus y. xz equals lambda times

2z plus x. xy is equal to lambda times 2x plus 2y.

Let's not forget this equation too.

So that will be 2xz plus 2yz plus xy equals 12.

So we've got a system of four equations with the variables

x, y, z and lambda.

So that's good.

We have four occasionals and four variables.

So at first it looked kind of intimidating.

But actually-- but actually it's not so bad because, for

example, the way we can approach this is the following.

For example, note that from this formula we know that x,

y, z equals lambda x times 2z plus y.

But also from this formula x, y, z-- so maybe I should try

it like this-- is lambda times y 2z plus x.

From this formula now I multiply by z.

This line I multiply by x, this line I multiply by y and

this line I multiple by z.

And the left hand side is the same.

But on the right hand side I get different formulas.

So here I get lambda z times 2x plus 2y.

OK?

So for this example, we can then equa-- we can look at

this equation, for example.

So we have then 2x-- so we got lambda times 2x plus-- 2xz plus

xy is equal to lambda times 2yz plus xy.

2yz plus xy, right?

So now the point is that actually we can assume that--

we can show that lambda is not equal to zero.

Because if lambda were equal to zero, then

nabla v would be zero.

And nabla v being zero means that one of the three

variables is zero.

But of course the cardbox has non-zero comp-- dimensions.

All of the three dimensions are non-zero.

So lambda is non-zero since nabla v is

assumed to be non-zero.

So therefore we divide but lambda, we could move lambda.

And so now you see what happens is that now xy appears

and we get 2xz and 2yz.

And again, since z is not equal to zero because of course

otherwise it's not a box if z is equal to zero.

This implies that there actually x is equal to y.

So that's the first formula that you have.

And then if you substitute, in a similar way we also find

that's why is equal to 2z.

So that means that you've got two relations between x, y, z.

x is equal to y and y is 2z.

So that means that x is equal to y and x is equal to 2z.

So now you can substitute them into this last equation.

Put this too in the last equation.

You can express all of them in terms of z.

Find z, find x, find y.

That's the solution.

So the end result is x is 2, y is 2, z is 1.

And that's the box that you get.

All right, see you on Thursday. the on Thursday

In some sense it is great weather, because when

it's sunny and warm you want to be outside.

But in this weather, I certainly don't

want to be outside.

So it's a perfect time to find absolute maxima and

minima of functions.

So why don't we do that?

I would like to pick up where I left off last time and discuss

the example of finding maxima and minima of a function on a

two-dimensional domain, which is a square.

And I think it's a good example to illustrate the main points

and also to make a transition to the next topic which we'll

discuss today, which is Lagrange multipliers.

So we have a function.

We have this function, and we have this domain

which I drew last time.

It is a square.

It is a square which is of length, each side is of length

2, which is situated on the plane like this.

So we need to find absolute maxima and minima

of this function.

And the way I explained this is you have to

follow an algorithm.

The easiest way to solve this problem is to

follow an algorithm.

And the first step is-- the first step of this algorithm

is to look in the interior of the domain.

OK?

Should I do this example or not?

OK, thanks.

All right.

I think I should.

I mean I will go quickly.

I know that this was in last week's homework and stuff and

even the quiz, I suppose but since I started doing it I

just wanted to go quickly over it again, OK?

So step 1 is-- step 1 is to look at the interior of the

domain and to calculate the partial derivatives.

I set the partial equal to zero.

OK?

I'm going to skip that.

I already did it last time.

But this is inside the domain.

So the next step-- that's very easy, right?

So you just get two equations and you solve them.

You find all the functions-- sorry, all the points which

satisfy this equations.

And this will be the first points on your list.

Next you look at each component of the boundary.

And on each component of the boundary, you can set one of

the variables to equal something.

So effectively your function restricted to that component

becomes a function of one variable.

So let's say if we take the part of the boundary which is

that top horizontal statement, that would y equal 1 and x

between negative 1 and 1.

When you set y equal 1 in the function, you effectively get a

new function-- let's call it g over x-- which is x squared

plus 1 plus x squared plus 4, which is 2x squared

plus 5, right?

squared plus 5.

And what you want to do now is to find the points, fine

the points, where g prime of x is equal to zero.

So this will be the points on the boundary which are

suspicious-- which are candidates for points of

maxima and minima, all right?

And I purposefully just look at the interior-- on the

interior of the interval.

So that means x is between negative 1 and 1 but

strict inequalities.

I didn't make this clear, I think, at the end of my last

lecture what exactly I wanted you to do at this step.

So I want to emphasize that's what I was talking about is

just finding the points with the derivative vanishes.

And then step 3 would be just to include the corners.

Now there is a different way to think about this-- there's a

different way to think about this process.

The way it is described in the book is there are two steps.

First step is exactly the same as what I explained.

But the second step would be to try to find the maxima and

minima of the function on each all of the components

of the boundary.

I mean they say on the entire boundary but effectively you

will have to split it into four parts, right, and do

it on each part separately.

Now, what would be involved in doing that?

So let's say when you look at this-- at my

component y equals 1.

When you look at y equals 1 you effectively get one-dimensional

problem, where you have the interval from negative 1 to 1.

And you want to walk find the absolute maxima and minima

of the function g over x equals 2x squared plus 5.

So you split this problem into two parts as well.

You look at the interior of the interval and you find points

where the derivative is zero.

Right?

And also you look at the end points.

So you have to look at the end points as part of the problem

of finding the maxima and minima on this interval, right?

So what I did in my presentation of this algorithm,

I have taken the end points and put them in separate list, or

sublist-- separate sublist-- so that in step 2 you would only

be doing part of the problem of finding the absolute maxima and

minima on this interval.

Namely the points where the derivative vanishes

in the interior.

And then you have to deal with the corners, corners being that

end points of those intervals anyway.

The reason I like this way of thinking about it is the

following: the same corner-- say this one-- will appear

twice as the boundary of this horizontal segment and

the boundary of this vertical segment.

So if you go by the book-- so to speak, right?-- so if you

actually literally solve the problem for each of these

segments you will have to deal-- you'll have to

include this point twice.

Whereas what I'm suggesting-- it's really a minor variation,

right-- but what I'm suggesting is that don't worry about the

corners from the beginning, just put them at the end.

Include them-- when I say include them I mean include

them in your list, your list of suspicious-- maybe let's not

use the word suspicious, let's use the word candidates--

candidates for maxima and minima, right?

So you include them anyway.

So let's-- in step 2 then I'm suggesting not to worry about

the corners, which could also be thought of as the end points

of the one-dimensional problems which you get on the boundary.

But instead just focus on the interiors of the boundary

components and just find the points in there where

the derivative is zero.

I think it's a slightly easier way to think about it.

But if you like, it's OK, you can do it in two

steps so to speak.

In other words, including the calculation of the

corners in step 2.

You just have to do the same work twice-- I mean you have

to deal with the same point twice in this example.

do you see what I mean?

Is that clear?

So that's what I meant.

I didn't really spell it out in detail.

That's what I meant at the end of my life when I

said there's three steps.

I don't mean by step 2 doing the entire problem of finding

the maximum and minimum on each interval.

I mean just doing part of that, namely finding the points

where the derivative is zero.

And then the end points you as a separate step.

So in this particular example, in this particular example, you

have a list of oh how many points?

Quite a few.

I mean I'm not sure I should actually write them down.

But you have some points which will have the real-- for which

some points where the partial derivatives will vanish

in the interior.

There will be some points in the interior of those segments

where the derivative vanishes.

And finally there will be the corners.

So I have what?

1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11-- I have 11 points.

But anyway, it's not a hundred.

It's going to be a small number of points and then what you do

is you just evaluate your function at those points and

you find for which points you get the maximum value and for

which points you get the minimum value.

It's possible there will be more than one point at which

the function will attain the same maximum value.

You just include both of them and say that the function takes

the maximum value at those two points or three

points or whatever.

OK?

Any questions about this?

All right.

So the reason why this problem was relatively easy was the

following: step 1 would be essentially the same

for all domains.

I mean, for step 1, no matter what your domain is, you'll

just have to calculate the points-- you'll just have to

find the points where the two partial derivatives vanish, and

then you have to make sure that that point belongs

to your domain.

So even if the domain has a complicated shape that's

not going to be so hard.

You just have to know which domain belong-- which points

belong to this domain and which points don't.

The problem would arise at step 2 if the domain is

not quite so simple.

So let me give you an example.

So another example is a domain where x and y are such that x

squared plus 4y squared is less than or equal to 1.

So let's sketch this domain.

First of all, when you sketch domains by the way, there is a

general rule which you probably have figured out already

a long time ago.

You first look-- you have an inequality-- you first look at

the corresponding equality.

So in this case you would have just x squared plus

4y squared equals 1.

And you sketch that.

And that, we know, that's an ellipse.

We've talked about ellipses.

So for instance, we have the point x equal 1, x equals

negative 1, and y equals zero.

And we also have the points y equals 1/2 and y equals

negative 1/2 two other points.

And the ellipse looks like this.

So it's kind of a circle which is squished.

So that's-- the equality gives you a curve.

And in fact we can recognize our old friend the level curve.

This is a level curve for the function x squared

plus 4y squared.

But if instead of equality you write inequality like this, it

means that in addition you include points for which the

values is not equal to strictly 1 but could be less than 1.

And that means that this points are going to be on 1 or the

other side of this curve.

In this case, it's easy to figure out which side.

Well first of all they're going to be on the same side.

And clearly, x equals zero and y equals zero satisfies this

equation-- this inequality for this formula.

And that's this point.

So clearly it should be inside and not outside.

That's how you figure out whether it's interior-- inside

or outside-- the interior part or the exterior part of the

plane, or inside the curve or outside the curve.

In this case it's inside.

And usually it's easy to figure this out.

So that's the domain.

That's what now plays the role of this square.

Let's suppose you're asked to find maxima and minima of

the following function.

F-- I'll make the function a little bit easier, linear

function-- 2x plus y.

So how would you solve this problem?

You approach it in exactly the same way.

Solve it in the same way.

Meaning that the first step would be to set f(x) equals

zero and f(y) equals zero and find all the points inside the

domain-- inside the domain, which satisfies

the two formulas.

So in this case it's very easy to see that there are no points

like this because in this case f sub x is equal to 2.

If 2 equals zero, no solutions.

And f(y) equals zero, also no solutions.

So in step 1 we don't get any contribution to the list of

candidates, list of suspicious points.

So everything boils down to the boundary.

And in fact there is no step 3-- no step 3 here because

there are no corners. the boundary is smooth.

So we'd like to understand this entire boundary

in sort of one shot.

How can we do this?

The idea in the previous analysis was to express the

restriction of our function to the boundary or maybe a

component of the boundary.

As a function of one variable.

Once it becomes a function of one variable, we could solve

the problem by just going back to the one variable calculus.

But in this case it's not so clear.

Because, well, in our problem it was very easy.

You just substituted y equals 1.

And of course, it's obvious that the function then becomes

a function of only one variable, namely x.

But here, what should you substitute?

Well one way to deal with this would be-- one possibility

would be to express one of the two variables on the boundary

in terms of the other.

So you could write, for example, x equals square

root of 1 minus 4y squared.

But there is a problem.

We have two solutions.

There's plus minus 1 minus 4y squared.

So in fact what we are doing is that we're taking this very

nice domain and we are artificially cutting

it here and here.

And so we're breaking it into two parts.

One would be where x is greater than zero or greater

than or equal to zero.

And the other one would be where x is less

than or equal to zero.

Once you do that, you have two components-- let me raise

this now-- two components.

One is x greater than or equal to zero and the other one

greater or equal to zero.

And then you can try to solve the problem of each component.

So you would have to-- you would get the function, say

g of y-- which is equal to 2 times square root of 1

minus 4y squared plus y.

This is in the case where x is greater than or equal to zero.

And a similar function with a negative sign, when x is

less than or equal to zero.

And you would have to find the points where

g prime of y is zero.

So you see that it was kind of nasty because the function's

not so nice and you split such a beautiful, a beautiful

ellipse into two parts.

And then you have to also deal with the end points.

There are no corners the way there were corners there,

because there they were actually not smooth.

They were singular, kind of points of singularity.

Here they are actually smooth points but we have sort of

artificially made them singular, made them look like

corners, because now they appear as the end points of

the-- they appear as the end points of these segments-- two

parts of the boundary.

So it is possible to solve the problem in this way but it just

not a very elegant solution.

So it is natural to try to find a more elegant solution.

And the more elegant solution-- so then there would

still be step 3.

Actually in this approach there will be a step 3 which will

correspond to points x equal zero and y equals plus/minus

1/2, a kind of fake corners.

So what I'm going to explain now is a more elegant

solution which is called Lagrange multipliers.

It's a more elegant solution which is applicable to even

more sophisticated functions.

In this case, actually we're dealing with a fairly

simple level curve.

Or with a level curve of a fairly simple function.

And that's why we can actually get away with expressing one

of the variables in terms of the other.

But let's say-- let's suppose-- you have instead of this, you

have something like x squared plus 4y squared plus xy plus--

you know some-- plus y plus square root of x.

So then try to express now y in terms of x or x in terms of y.

That would be very complicated.

That alone-- even if you can, then you have to

take derivatives and stuff like that.

So that's very complicated.

But it turns out that is actually a much more

straightforward and elegant method.

And that's called Lagrange multipliers.

And actually I would like to explain this method

first geometrically.

So let me look again at this picture.

This picture-- the domain D consists of two parts again.

It consists of the interior of the ellipse and

the ellipse itself.

But we have just seen that on the interior of the ellipse

there are no interesting points for us.

There are no candidates for maxima and minima because our

function is such that there are no points where both

partial derivatives vanish.

So actually we can forget about the interior.

The problem boils down to understanding the maxima and

minima of our function restricted to just

the boundary.

There's nothing interesting for us inside.

So that's why I'm raising-- I'm sort of raising this and I am

just focusing on the boundary.

And now I would like to approach this problem

geometrically.

In other words, we have discussed several

times level curves.

And for example we recognize this curve itself as a level

curve of some function.

And I said many times that if you know the level curves of a

given function you can understand much better the

shape of this function.

So let's see how it works in practice in this problem.

The boundary is given by an equation.

So the boundary is a level curve.

That's a given.

But we also have a function.

So you have to separate the two things that you have.

First of all, you have the equation-- this is the

equation of the boundary.

But you also have a function.

And so what we can do is we can also look at the level

curves of this function.

Here we are looking at just one level curve.

Why only one level curve?

Because this is the equation.

The equation is that this is equal to 1.

So we are just looking at one level curve for this function

f-- I'm sorry, for this function which let's call it,

let's call it something else-- let's call it h.

I don't want to use g because I already used g for this other.

So this is function h.

For the function h this is only one level curve which

shows up in this problem.

We don't care about other level curves.

But for the function f we would like to find maxima and minimum

values-- maximal and minimal values-- and we don't know

yet where there are.

So we should look at all level curves for that

second function f.

So what are- what do this level curves look like?

So the level curves 4f of xy equal 2x plus y are given by

equations 2x plus y equals k where k is some number.

So for instance, one possibility is say k is equal

to zero. 2x plus y equals zero.

eq question No?

All right.

2x plus y equals zero.

What is 2x plus y equals zero?

It's the same as y equals-- maybe it's easier if I write

it like this like this: y equals minus 2x.

That looks like this.

That's the line which goes through zero with a

slope with negative 2.

And then the if you look at other lines like

this, for example.

For instance, 2x plus y equals 1.

This is just y equals minus 2x plus 1.

So that's going to be a line which is parallel to this one

but goes through the point 1.

So it's actually in this case, it's going to cut

here at the point 1/2.

So I should do it more precisely like this.

And so on.

Well my lines are not perfect yet.

My lines are not perfectly aligned but I hope you see

what I'm doing anyway.

And what about other level curves?

Of course all other level curves are going to be

just parallel lines.

I mean we are so sophisticated now we even understand

questions of planes in 3-space.

For sure we understand equation of lines on the plane.

This looks like a typical equation-- well, if we had a z

that would be typical equation for a plane in 3-space.

Because we don't have a z it's even simpler.

So instead of a plane in 3-space we get a line in

two-dimensional space on the plane where 2,1

is a normal vector.

So when you vary this number k, you're going to get a system of

parallel with such normal vector.

So I've drawn two of them but now it I'll draw more.

Here's another one.

Yeah, i kind of messed it up.

We can do it again.

I do want to emphasize the fact that this is a

system of parallel lines.

OK.

Let's see if I can do it better this time.

So we got this, we got this, and this.

And now I want to draw a master line which is

going to be this one.

I keep hearing voices as if there is some color commentary,

sort of to my play-by-play.

I don't mind but I'm sure that people sitting next

to you probably mind it.

OK.

So you have a system of lines like this, OK?

So I think it's more clear now.

OK.

So let's say that this one goes through zero.

What is special about each of these lines?

What is special is that along each of these lines the value

of the function f is the same.

That's the function which we want to maximize, minimize.

We would like to find maximum, minimum values

of that function.

And now what we-- what this picture tells us, it tells

us the lines of equal value for that function.

So for instance, the function is going to be equal to

zero along this line.

The function is going to be equal to-- where

is my master line?

This is my master line-- the function is going to be

equal to 1 along this line.

The function is going to be equal to say

1/2 along this line.

Maybe 3/4 along this line, 1 1/4 along this line and so on.

Whatever line you want you can draw.

And it's going to be one of these parallel lines.

But this the level curves for the function on

the entire plane.

But for now we are interested in step 2, so we're looking--

because step 1 did not produce anything anyway.

So we're looking just at this ellipse.

So what we're really interested in is the values of the

function on the ellipse.

AUDIENCE: [INAUDIBLE]

Oh sorry.

Not this one, this one.

The one that goes through zero.

It's just bad drawing.

It's a bad drawing day.

But this one. you see what I mean. 1/2 would be then--

this is not 1/2, it's like 1/3-- sorry.

But you see what I mean.

It's not to scale, it's not parallel, it's not-- it's a

very qualatative picture, not very precise.

Kind of abstract like Picasso, the way Picasso

would approach it.

But I hope you see what I mean.

As you vary-- the point is when you go to the next one the

right, the value will increase.

When you go to the next one to the left, the value of

function will decrease.

But we are interested not in the values everywhere, we are

only interested in the value on and the values on that ellipse.

So in particular, we now know that here and here, that

these are the two points of intersection of the line f

equals zero with out ellipse.

Here and here the function is equal to zero.

Here and here, the function is equal to 1.

Here and here it's equal to 1/3 and so on.

So now how can we use this picture to find where the

function takes the maximum or minimum values.

Well the point is that for sure, we can see from this

picture it's quite clear that f equals zero is not.

It's neither maximum nor minimum.

Why?

Because this line we can move both to the right

and to the left.

When we move it to-- and we'll still have some points of

intersection with our domain-- not with our domain, sorry--

with the ellipse, with our curve.

If I move it even by 1/3-- I can move it even by 1/3 and

I still have intersection.

So that cannot be maximum value because here the value

is zero, here it's 1/3.

Likewise, say this one would be f equals negative 1/3.

So from here I could also move along this curve to the left.

So I can shift my parallel line to the left,

decreasing the value.

So here is neither maximum nor minimum.

Likewise this is neither maximum or minimum.

This one is not, this one is not.

So at first it looks like neither of them

maximum or minimum.

So the point is to find the points where this qualitative

argument doesn't work, where it breaks down.

We have to find the points where you could no longer move

it to the right or to the left.

In what sense?

Of course you could always move it to the right and to the

left, but you could not move it and still have a non-empty

intersection with your curve.

For example, you have a line like this, which is one

of their level curves.

But it's totally outside of the scope of our problem.

In other words, there's no intersection between this line

and this curve that we're interested in, right?

So those guys are excluded from our consideration anyway.

And so the point is to find the points where you can not move

to the right or to the left and still have non-empty

intersection.

And the point-- for most of these lines, for all of the

lines which I have drawn, you have two points

of intersection.

So as you move further and further and further to the

right-- let me just make it slightly more

realistic looking.

Did somebody have a question?

No?

OK.

As you move it, you still get two points, two points, two

points, two points, two points and then boom, there this one

point that something happens.

At this point, the intersection instead of getting two points,

you get just one point of intersection.

In other words, give up two points, two points, two points,

two points, but they get closer and closer to each other and

they converge at this point.

And suddenly you get a very special point.

This point is a point where the intersect is just this one

point and not two points.

Right?

You see what I mean?

It's just one point.

And likewise on this end, the same time.

You have intersection of two points and two points and the

next one's two points-- I'm just unable to do parallel

lines today-- two points, two points.

And then suddenly, boom!

There is this one line where it intersects with one point only.

And for these two points my previous argument

actually breaks down.

You see because when I was here I could argue that this is not

the maximum or minimum because I could move this way,

I could move that way.

And when I move a little bit, I would still have a non-zero--

non-empty intersection with my curve.

But when I'm here, I can certainly move it to the left

so I could decrease the value.

But I cannot move it to the right.

It's kind of frozen here.

What do I mean that I cannot move it to the right?

Of course I can.

I can move it, but if I move it by any small amount-- epsilon--

or any other small amount, there will still be a line like

this but it will not intersect my curve.

So that second argument breaks down.

So that means that this is the maximum value.

Because I cannot move it anymore to the right.

You see what I mean?

And likewise this is the minimum value because

I cannot move it any more one to the left.

So what are these special points?

Well, now we know enough about level curves and things like

that to realize that these are precisely the points where

this line is tangent to this level curve, to this curve.

So this is the case when this is a tangent line.

And likewise, this is the case where this is a tangent line.

And what does it mean that this is a tangent line?

It means that the normal vector-- the normal vector

to this level curve is proportional to the normal

vector of this line.

But what on that the maxima and minima are the points where the

level curve of the function f of xy, which in this

case is 2x plus y.

In this case, the level curve are lines.

But in general, they're not going to be lines, they're

going to be some curves.

Where the level curve of the function which we want to

maximize and minimize for f is function that we want

to maximize or minimize.

OK?

If it's tangent to our curve, which is this h

of xy which equals 1.

So I think that x squared plus 4y squared equals 1.

So that's the given curve.

So the point is in this exercise we are given two

functions, but they play very different roles.

There's a function f and there's a function h.

Function f is the function we want to maximize and minimize.

Therefore we kind of look at all level curves of that.

h is the second function.

It's the function which defines the curve and also defines the

domain if our original problem was about the interior of that

curve, the way I started talking about it.

For this function, we all need this in one level curve.

That's our-- that's the curve where we want to

maximize/minimize f.

But the interesting thing happens the places where

the two curves are tangent to each other.

And what does it mean that they are tangent to each other?

Well, in this example, the level curves of f are lines, so

it's clear what we mean by saying that it's tangent

because we talked about tangent lines.

But in general, if you have two different curves, there is also

an obvious notion of being tangent.

It just means that tangent lines to them coincide.

Or in other words, normal lines to both of them coincide.

So this means, in other words, the normal lines-- normal

lines-- so we are points where the level curve, curve of f,

is tangent to our curve.

Maybe I should say these are the point-- these are the

points where the normal lines to f of xy equals k and h of

xy equals 1, are the same.

Coincide.

So how do we check that?

Well we know that the normal lines are given by the

direction vector-- the normal vector to that curve,

which is a gradient.

So equivalently, we can say that the two gradient vectors

should be proportional to each other.

These are the points x,zero y,zero such that nabla f at

x,zero y,zero is equal to lambda-- that's nabla--

h for x,zero y, zero.

And that's the equation that will give us those points.

What is the meaning of this equation in our case?

In our case, at this point we have a normal

vector to the line f.

f equals k.

That's vector That's the vector 2, this is nabla f.

Actually, nabla f is very easy to find, it's 2,1.

It's the vector 2,1.

There are two parts to the derivative.

So our f are 2 and 1.

Let me do it in red so that everything isn't

the same color.

This is nabla f at this point.

And nabla h is the vector which is normal to the curve.

So in this case, the curve goes like this, so the normal vector

will go like this, right?

So that's nabla g.

Clearly at this point they are not proportional to each other.

And they shouldn't be because these are not the points where

the two level curves are tangent to each other.

They intersect with a certain angle.

They are not tangent.

But at this point, the two normal vectors suddenly become

proportional to each other.

So this is one and this is the other.

They become proportional you see.

So that, and likewise same thing here.

That's why at these points we get maxima/minima values.

So let's find these points.

Are there any questions about this first of all? of all?

What's lambda?

Very good point.

So lambda is the coefficient of proportionality.

We don't know it a priori.

this is, by the way, a very important point because

some of you made this mistake on the midterm.

If you have two vectors, each vector spans the line, right?

So what is the property of the two vectors for the

lines to be the same?

ehy It's not that they are equal, right?

They should be proportional.

If you have a vector you vet a line.

But if you take twice that vector you get

the same line, right?

So the condition is much weaker.

It's a condition is that the two vectors are proportional

to each other.

And then the two line coincide.

And what does it mean that they are proportional?

Proportional means that one of them is equal to some

number times the other.

And we call that number lambda.

So that's the coefficient of proportionality.

Which in this case is called Lagrange multiplier.

La-- I'm thinking of another name-- Lagrange multiplier.

And this method of finding these external points is called

Lagrange multiplier method, after a great French

mathematician, Lagrange.

So, any other questions?

You'll probably see it better when we do it in this example.

So let's see what this equation look like in this example.

So what is nabla f?

Nabla f consists of two partial derivatives of f.

And those are easy to find it, they are just 2 and 1.

And what's nabla h?

These are the two partial derivatives of that function,

x squared plus 4y squared.

So it's going to be hx and hy are going to be 2x and 8y.

And now the equation that we want is that 2,1-- that's the

left hand side-- is equal to lambda times 2x 8y.

More precisely, since here I wrote x,zero y,zero, let

me stick to that and let's write also x,zero y,zero.

But of course when you do your calculations you will

oftentimes just write x and y without worrying about

the index zero.

Here I write x,zero and y,zero to emphasize that we are

talking about a particular point x,zero y,zero and not

about general values x,y.

So, but this is not just one equation.

This is a system of two equations because you have two

components of these vectors.

So there are two equations. 2 equals lambda times 2x and 1

equals lambda times x,zero y,zero.

So let's remember one thing which is that

lambda is unknown.

We are saying that they're proportional to each other,

but we don't know what the coefficient of

proportionality is.

So what we need to do here is to solve these equations for

x,zero y,zero and also lambda.

But you shouldn't forget that actually you are solving it not

on the entire plane but you are solving it on that curve only.

We are interested only in that curve now.

That's very important.

So far, I've written the equation as though x,zero and

y,zero were an arbitrary point.

But in fact, it should be a point on that curve.

So we have to add one more question, which is the

equation of that curve.

Which is x squared-- x,zero in this case plus 4y

squared equals 1.

So in fact you have three equations.

Three equations on three variables. x,zero

y,zero and lambda.

If you just tried these two equations, it would mean

that you look everywhere.

You have to constrain x and y, or x,zero y,zero by

imposing various original equations on this curve.

OK, so now it looks more meaningful because you've

got three equations and three variables.

And usually, generically, if you have three equations with

three variables, you're going to get one solution or you

get fininite many solutions.

That's what's going to happen here.

Let's find a solution.

So first we can express x,zero and y,zero in terms of lambda

and then we can substitute x,zero and y,zero into the

third equation and we'll get one equation in lambda.

That's the simplest way to solve it.

So from the first equation we find that x,zero

is 1 over lambda.

For the second question we find that y,zero is 1 over 8 lambda.

And now we put them into the third equation and we get one

over lambda squared plus 4 times 1 over 8 lambda

squared equals 1.

So what that means 1 over lambda squared times 1 plus

4 divided by 64 equals 1.

And that's going to be what? so this is 1/16 so that's 17/16.

And from this you find that lambda squared

is equal to 16/17.

Right?

Is that right?

17/16.

Right, because you take it to the other side. 17/16.

And that means that there are two solutions to lambda.

Lambda is equal to plus or minus square root

of 17 divided by 4.

Once we know what lambda is, we can find x,zero

and y,zero, right?

So now we find x,zero is equal to 4 over of square root of 17.

Well, there are two points.

First of all there are two points.

So for each, let me do it slowly-- let

me not in mess it up. each solution for lambda

will correspond to a particular point.

And of course we kind of know that we should get two points,

we expect to get two points.

One will be the maximum and the other one will be the minimum.

So the first point is lambda equals square root of 17 over

4. x,zero is 4 over square root of 17.

And y,zero is 1 over 8 of that.

So it's 1 over 2 square root of 17.

The second point, so that's this point, that's this point.

And the other point is going to be with the minus sign.

4 over square root of 17 minus 1 over 2 square root of 17.

So lambda equals minus square root of 17 over 4, x,zero is

minus 4 square root of 17, y,zero is minus 1 over

2 square root of 17.

And well in this particular case it's clear which one is

the maximum and which one is the minimum because we can see

it clearly on the picture.

But in general, it may not be so obvious.

So what you should do is just substitute them

into your function and calculate the value.

And the one for which the value is bigger will be the maximum

and the one for which the value is smaller it will

be the minimum.

So in this case, f is going to be you have to substitute in 2x

plus y, so that's going to be 2x plus y, so it's like 8--

it's going to be 1 over square root of 17 and here

will be 8 plus 1/2.

So that's going to be 17-- square root of 17 over 2.

So you've got 17 over that's the square root.

And here the function will be just-- actually, in

this case-- ah, but here see I was too fast.

I kind of-- Oh no, it is correct. that's right.

So it will be just minus of that.

Minus square root of 17 over 2.

So it's clear that that's the maximum, that's the minimum.

So that's that solution.

Yes?

AUDIENCE: [INAUDIBLE]

That's right.

They point in the same direction.

So lambda is positive.

Right.

When it's the minimum they should be opposite.

Exactly.

It's always like this.

If they are pointing in the same direction,

that's the maximum.

So that's actually a very good question.

That we can actually tell by looking at in which direction

the two vectors are pointing. or, in other words, what

is the sign of lambda?

If the sign of lambda is positive, it's the maximum.

If the sign of lambda is negative it's the minimum.

And geometrically actually, I did not draw the picture, but

the picture would be like this: this would be the-- y would be

the gradient vector for the function g.

But now the red one will now go this way, but

red is always the same.

It's 2,1.

The So that's nabla f.

And here they are opposite whereas here they are pointing

in the same direction.

Any other questions about this?

AUDIENCE: [INAUDIBLE]

Right.

That's right.

So-- that's right.

So you're talking about-- the question is about the level

curve, the red ones, right?

So the red lines are the level curves of the function

f, which is 2x plus y.

So we could look at the equation f equals

2x plus y in 3D.

That would be z equals 2x plus y.

So in fact, because this function is linear, the

graph of this function would be a plane.

And that plane actually can be very easily

illustrated right here.

So let's say it's this plane.

So now, what are the level curves-- I'm going back to

the same drawing which I did a couple of lectures ago.

What are the level curves?

Well you can draw-- draw level curves are just the sections

which are parallel to the floor, sections by planes which

are parallel to the floor.

So here's one of them, here's another.

I should have drawn in red, but you see what I mean.

This is will be-- so this is the level curve

for the function f.

But again, you think of the level curves as

being part of the graph.

But you can also think of them of their projections

into the x-y plane.

So what I had drawn on that picture are the projections

onto the x-y plane.

So I'm writing them as equations f equals something.

For example, f equals zero, or f equals 1/3 or f equals

negative 1/3, f equals 1 and so on, right?

If you like, you can think of the plane, x is the graph of

this function, as one which is obtained by assembling those

lines but in the kind of-- each time you have to lift it by

the height which is equal to the value of the function.

So only this one-- this one, the one for which x is equal

to zero, would remain on the x,y plane.

And this one will be elevated and those guys will be

lower, just like here.

So let's say the x-y plane-- if the x-y plane were so that this

one was-- let's say this one is f equals zero.

So then this one-- let's say this is f equal 1/3 and so on.

So they're all elevated.

Thnk of like sort of a staircase.

I mean it's not like a staircase, but if you think of

just a step 1/3, then this is a collection of lines.

But it doesn't make sense to draw them in 3D because they

are given by equation on just x and y.

So that's why I'm drawing their projections into the

floor, onto the x-y plane.

You see?

AUDIENCE: [INAUDIBLE]

That's right.

That's a very good question.

So, so far we were lucky that the domain, or more precisely,

the boundary of the domain, was presented by one equation only.

So now I would like to for the last example, look at kind of a

mixed case where you could have some components in the boundary

and some of them could be linear and some of them

could be curved like this.

So let's say a very simple generalization of this example

would be-- a very simple generalization of this

example would be if your domain was like this.

So you could say, in this case the domain would be x,y such

that x squared plus 4y squared is equal To 1.

And x is greater than or equal to zero.

Oftentimes you will have domains like this.

So in this case, we actually do have two corners.

And we actually have two natural components

in the boundary.

This is one component-- the first component-- and this

is the second component.

So how would we solve the problem of finding maxima and

minima for function f if our domain is like this.

So let's say we're given some function f of x,y. of x,y.

I don't want to specify what this function is.

It could be this function 2x plus y or it coule be a

more complicated function.

Well, very simple.

Now we know everything that we need know to know for this.

y, that's right.

You're right.

Thank you.

y greater or equal to zero.

X greater or equal to zero would be this part.

That's right.

So you approach it the same way as before.

Three steps.

The first step would be to find the points in the interior

where both partial derivatives vanish.

So step 1, you said f sub x is zero, f sub y is zero inside.

That's just like before.

Well of course if our function is just 2x plus y then there

are no such points so we don't have to worry about it.

But in general there might be some points in the interior.

So we would collect those points.

So we don't do anything yet.

We just collect them and put-- so we start a list.

And these points will be on the list.

And then we do step 2.

Now in step 2, you have two components.

So you want to do the first component, means that you

do Lagrange multiplier.

But out of all the points we find we only pick those points

for which y is greater or equal to zero.

Let's start with the second component.

Sorry.

Right.

The curvy component.

AUDIENCE: [INAUDIBLE]

No, I meant that as a component.

This is the second component.

I'm just doing it out of order.

Instead of doing the first component first and second

component second, I do second component first.

Because I don't want to switch the board.

But now I've spent more time than I would have just

switching the board than changing the first and second.

Anyway, I think you know what I mean now, right?

So we do this one.

And this is exactly the same calculation as

we've done so far.

Well, it would have been exactly the same if our

function were just 2x plus y.

But we to have to take care of the fact that now, the curve

that we're talking about is not the entire as before

just the top part.

So we are only interested in the points we fine which

belong to this part.

So if you find something here, you discard it.

It's not part of our problem.

So in this case, if the function were indeed 2x plus y,

instead of finding two points we would find just one

point, which is this one.

Only this one.

And this one we discard.

So it means that at the end of your calculation

you have to check.

When you find your point, you have to check.

Is it true that y,zero is greater or equal to zero?

For this one it's true but for this one it's not.

So then we discard this one.

So that's what you do for the second component.

For the first component, by which I mean just the segment

from negative 1 to 1, you do it in the old fashioned way.

You just substitute the function-- you just substitute

y equals zero into your function.

You get a function of one variable, you

find the derivative.

You find the points where the derivative vanishes.

So the first component is like this.

From 1 to negative 1.

And you just substitute y equals zero and get a

function of one variable.

Function g of x and then solve the g prime over x equals zero

when x is between negative 1 and 1.

And finally you have step 3, where because I ignored in my

intermediate calculations up to now, I ignored

the two end points.

I have ignored the end points.

For the same reasons that I explained before.

I don't have to ignore it.

I could also take them into account, but then I might

have to include each of them twice in my analysis.

So I personally would just take the two corners separately.

So that's x equals 1, y equals zero and x equals negative

1 and y equals zero.

So put them on your list.

And finally you assemble this list.

So You've got points from step 1, step 2, each of the

two components in step 3.

And then you calculate the value of your function at each

of these points and you find which ones correspond to

the maximum and minimum.

And that's how you solve it.

That's right.

So the question is, does Lagrange work for the

inside of the domain?

Lagrange is specifically for the boundary, for the curve.

So it doesn't tell us anything about the inside.

Because this argument would break down it being.

Because for being tangent you would to have two curves.

And one of the curves would be the level curve of function f

and the other one would be your boundary.

If you weren't looking at the boundary then we wouldn't be

able to argue in this way, right?

But you have to realize that there are two ways in

which Lagrange methods may enter a given problem.

Essentially there are problems of two types.

The problems where you have to maximizer and minimize

functions on the two-dimensional

domain like this.

So this is usually domains which will be given by some

inequalities like x squared plus 4y squared less than or

equal to 1 or that coupled with y greater than

or equal to zero.

Such a problem consists of several steps.

You have to look at inside and you have to look

on the boundary.

But sometimes there could be a problem where you're just asked

to maximize or minimize your function on the boundary only.

In other words your set is given not by the

inequalities, by equalities.

Like you could be asked specifically to find the

maximum/miminum of your function 2x plus y on that

curve, on that ellipse.

So in that case if in the problem there is no reference

to the interior of that curve, you don't need

to do the first step.

You can skip the first step because you don't need

to know what the partial derivatives and so on.

So you just start with step 2.

You just look at the corresponding curve.

AUDIENCE: [INAUDIBLE]

This function?

You mean g prime as a constant or g as a constant?

If g prime is a constant it means that there's

a linear function.

So your question is about function one variable, right?

What does it mean if you have function one variable whose

derivative is a constant?

It means that the function is a linear function.

ax plus b.

And of course that your function does not have

any extrema points.

Any points of local maxima and minima because the derivative

is nowhere vanishing.

So in that case step 2 will be vacuous.

So if you that as your function, the maxima and

minima it will obtain will happen at the end points.

And the end points are the ones we're including anyway.

AUDIENCE: [INAUDIBLE]

That's right, that's right.

In fact, in this particular case that's what will happen.

If f were-- actually 2x plus y and we said y equals zero, in f

is 2x plus y and then we said y equals zero then g becomes 2x.

So it is going to be a linear function like this.

So in that case, there will be no points coming from

this segment-- from the interior of this segment.

The only points will come from the end points of that segment.

In the way I approach this, those points will show

up in the third step.

But if you like it, you could include them in step 2 as well.

So now I would like to discuss one possible application

of all of this.

The application of a kind of practical problem where you

have to-- a kind of optimization problem where you

would like to optimize-- maximize or minimize

certain quantity under various constraints.

So here is a very representative example of that.

So we are building a rectangular box from cardboard.

And we would like to maximize the volume of the interior of

that box given that we can use so much cardboard.

So it's a very natural question one could ask.

So build a rectangular cardboard-- a rectangular

box without a lid.

From 12 square meters of cardboard so as to

maximize its volume.

OK, so here you are asked to build a box like this.

And what does it mean build a box?

You have to decide what the dimensions will be.

So this will be x, this will be z, and this will be y.

You have to decide what the dimensions are given that

you have so much cardboard.

So that means that you have to maximize-- we translate it into

mathematics, we say that means we must maximize the volume.

And volume is a function of x, y, z, which are the

dimensions of the box.

And it's just x times y times z.

And we have a constraint.

So we have function h of x,y,z which is the amount of cardbox.

So how much cardbox do we use?

Well, this box has six sides with cardboard everywhere

except for the top.

So we'll get two rectangles of which size for

the vertical ones.

But the horizontal-- there will be only one

horizontal rectangle of dimensions x and y.

So the function will be x times y.

That's the area of this rectangle and it's not doubled

because we don't have the lid as we are asked

in the exercise.

But for the other guys we'll have to double. yz and xz.

So it's xy plus 2xz plus 2yz.

That is a total area of cardbox that is used for this, such

a box, and that has to be equal to 12.

So this is a very-- we get exactly in the situation

of the problem about Lagrange multipliers.

Except now we have three variables to begin with.

In my previous problem we only had two variables, x and y.

So we had a function f in two variables and we had a

function h in two variables.

Now the role of x is played by the function v and the role of

h is played by this function.

H and three variables.

But we will approach this problem in

exactly the same way.

Namely, we will say-- we will apply Lagrange method.

OK by Lagrange method-- Lagrange multiplier-- that

means that we are looking for points x,y,z.

I will skip x,zero y-zero z-zero.

Just call them x y z to simplify notation.

Such that nabla v-- the gradient of the function

v-- is proportional the gradient of the function h.

I remember the [UNINTELLIGIBLE]

equation h equals 12.

So remember it's not enough to write down the equation of

proportionality-- I'm sorry, I should have written nabla.

It's not enough to write down this equation, which is the

equation of proportionality of the two normal vectors or

the two gradient vectors.

You also have to write down the equation you

constrained separately.

Otherwise you will not find unique solution or you will not

find fundamental solution.

So what do these equations look like?

Well, nabla v is obtained by taking all partial derivatives

of that function.

So what are they?

Well, with respect to x it's yz, with respect to y it's xz,

with respect to z it's xy.

And here we have lambda times the partial derivative of h.

And what is the partial derivative of h?

It's 2xz plus xy.

What am I -- what am I writing?

Sorry, I'm looking in the wrong place.

So it's y-- where did I write this?

2z-- OK.

2z plus y-- 2z plus y, 2z plus x and 2x plus 2y.

OK?

So now of this means that we have a bunch

of equations actually.

Let me see.

How much time do I have?

OK, great.

We have just enough time.

OK, so this is going to be three equations.

Yz equals lambda times 2z plus y. xz equals lambda times

2z plus x. xy is equal to lambda times 2x plus 2y.

Let's not forget this equation too.

So that will be 2xz plus 2yz plus xy equals 12.

So we've got a system of four equations with the variables

x, y, z and lambda.

So that's good.

We have four occasionals and four variables.

So at first it looked kind of intimidating.

But actually-- but actually it's not so bad because, for

example, the way we can approach this is the following.

For example, note that from this formula we know that x,

y, z equals lambda x times 2z plus y.

But also from this formula x, y, z-- so maybe I should try

it like this-- is lambda times y 2z plus x.

From this formula now I multiply by z.

This line I multiply by x, this line I multiply by y and

this line I multiple by z.

And the left hand side is the same.

But on the right hand side I get different formulas.

So here I get lambda z times 2x plus 2y.

OK?

So for this example, we can then equa-- we can look at

this equation, for example.

So we have then 2x-- so we got lambda times 2x plus-- 2xz plus

xy is equal to lambda times 2yz plus xy.

2yz plus xy, right?

So now the point is that actually we can assume that--

we can show that lambda is not equal to zero.

Because if lambda were equal to zero, then

nabla v would be zero.

And nabla v being zero means that one of the three

variables is zero.

But of course the cardbox has non-zero comp-- dimensions.

All of the three dimensions are non-zero.

So lambda is non-zero since nabla v is

assumed to be non-zero.

So therefore we divide but lambda, we could move lambda.

And so now you see what happens is that now xy appears

and we get 2xz and 2yz.

And again, since z is not equal to zero because of course

otherwise it's not a box if z is equal to zero.

This implies that there actually x is equal to y.

So that's the first formula that you have.

And then if you substitute, in a similar way we also find

that's why is equal to 2z.

So that means that you've got two relations between x, y, z.

x is equal to y and y is 2z.

So that means that x is equal to y and x is equal to 2z.

So now you can substitute them into this last equation.

Put this too in the last equation.

You can express all of them in terms of z.

Find z, find x, find y.

That's the solution.

So the end result is x is 2, y is 2, z is 1.

And that's the box that you get.

All right, see you on Thursday. the on Thursday