Physics 101
Physics 101
AN INTRODUCTION TO PHYSICS
I. GENERAL INFORMATION
Purpose: This course aims at providing the student a good understanding of physics
at the elementary level. Physics is essential for understanding the modern world, and
is a definite part of its culture.
Background: It will be assumed that the student has taken physics and mathematics
at the F.Sc level, i.e. the 12th year of schooling. However, B.Sc students are also
likely to find the course useful. Calculus is not assumed and some essential concepts
will be developed as the course progresses. Algebra and trigonometry are essential.
However, for physics, the more mathematics one knows the better.
Scope and Duration: The course has 45 lectures, each of somewhat less than one
hour duration. All main fields of physics will be covered, together with several
applications in each.
Language: For ease of communication, all lectures are in Urdu. However, English or
Latin technical terms have been used where necessary. The student must remember
that further study and research in science is possible only if he or she has an adequate
grasp of English.
Assignments: At the end of every lecture summary you will find a few questions that
you should answer. The book you choose to consult will have many more. Those
students who are seriously interested in the subject are advised to work out several of
the questions posed there. In physics you cannot hope to gain mastery of the subject
without extensive problem solving.
Tutors: Their duty is to help you, and they will respond to all genuine questions.
However, please do not overload them as they have to deal with a large number of
students. Happy studying!
2. It is thought that the laws of physics do not change from place to place. This is
why experiments carried out in different countries by different scientists – of any
religion or race – have always led to the same results if the experiments have been
done honestly and correctly. We also think that the laws of physics today are the
same as they were in the past. Evidence, contained in the light that left distant
stars billions of years ago, strongly indicates that the laws operating at that time
were no different than those today. The spectra of different elements then and
now are impossible to tell apart, even though physicists have looked very
carefully.
Speed LT −1
Acceleration LT −2
Force MLT −2
Energy ML2T −2
Pressure ML−1T −2
You cannot add quantities that have different dimensions. So force can be added
to force, but force can never be added to energy, etc. A formula is definitely
wrong if the dimensions on the left and right sides of the equal sign are different.
6. Do not confuse units and dimensions. We can use different units to measure the
same physical quantity. So, for example, you can measure the mass in units of
kilograms, pounds, or even in sair and chatak! In this course we shall always use
the MKS or Metre-Kilogram-Second system. When you want to convert from one
hsystem to another, be methodical as in the example below:
mi mi ft 1 m 1 hr m
= 1 × 5280 ×
1 × = 0.447
hr hr mi 3.28 ft 3600 s s
When you write it out in this manner, note that various quantities cancel out
cleanly in the numerator and denominator. So you never make a mistake!
7. A good scientist first thinks of the larger picture and then of the finer details. So,
estimating orders of magnitude is extremely important. Students often make the
mistake of trying to get the decimal points right instead of the first digit – which
obviously matters the most! So if you are asked to calculate the height of some
building using some data and you come up with 0.301219 metres or 4.01219 × 106
metres, then the answer is plain nonsense even though you may have
miraculously got the last six digits right. Physics is commonsense first, so use
your intelligence before submitting any answer.
8. Always check your equations to see if they have the same dimensions on the left
side as on the right. So, for example, from this principle we can see the equation
v 2 = u 2 + 2at is clearly wrong, whereas v 2 = u 2 + 13a 2t 2 could possibly be a
correct relation. (Here v and u are velocities, a is acceleration, and t is time.) Note
here that I use the word possibly because the dimensions on both sides match up
in this case.
9. Whenever you derive an equation that is a little complicated, see if you can find a
special limit where it becomes simple and transparent. So, sometimes it is helpful
to imagine that some quantity in it is very large or very small. Where possible,
make a “mental graph” so that you can picture an equation. So, for example, a
formula for the distribution of molecular speeds in a
2 2
gas could look like f (v) = ve-(v-v0 ) / a . Even without
knowing the value of a you can immediately see that
a) f (v) goes to zero for large values of v, and v = 0.
b) The maximum value of f (v) occurs at v0 and the
function decreases on both side of this value. v0
1. Scientists are told to doubt everything and not believe anything that is not
provable. Is this a strength or weakness of science?
2. According to the philosopher of science, Sir Karl Popper, even the most well-
established and popular scientific theory can never be proved – it can only be
disproved. Discuss this strange sounding claim.
3. Suppose we measure time by using hour glasses filled with sand. Discuss the
various errors that would exist if we tried to use this as a world standard for time.
Give a rough estimate for the error over a 24 hour period.
5. Find how much time is taken for a telephone signal to go from your mobile to
your friend’s mobile. Assume that the relevant satellite is orbiting the earth at a
distance of 250 kilometres, and that the electronic circuits have a delay of 300
microseconds. Give your answer in microseconds.
1. x(t) is called displacement and it denotes the position of a body at time. If the
displacement is positive then that body is to the right of the chosen origin and
if negative, then it is to the left.
2. If a body is moving with average speed v then in time t it will cover a distance
d=vt. But, in fact, the speed of a car changes from time to time and so one
should limit the use of this formula to small time differences only. So, more
accurately, one defines an average speed over the small time interval ∆t as:
x2
∆x
x1
t1 ∆t t2
5. Some students are puzzled by the fact that a body can have a very large
acceleration but can be standing still at a given time. In fact, it can be moving
in the opposite direction to its acceleration. There is actually nothing strange
here because position, velocity, and acceleration are independent quantities.
This means that specifying one does not specify the other.
7. For constant acceleration and a body that starts from rest at t = 0, v increases
linearly with time, v ∝ t (or v = at ). If the body has speed v0 at t = 0, then at
time t , v = at + v0 .
8. We know in (6) above how far a body moving at constant speed moves in
time t. But what if the body is changing its speed? If the speed is increasing
linearly (i.e. constant acceleration), then the answer is particularly simple: just
use the same formula as in (6) but use the average speed: (v0 + v0 + at ) / 2 . So
we get x = x0 + (v0 + v0 + at )t / 2 = x0 + v0t + 12 at 2 . This formula tells you how
far a body moves in time t if it moves with constant acceleration a, and if
started at position x0 at t=0 with speed v0 .
9. We can eliminate the time using (7), and arrive at another useful formula that
tells us what the final speed will be after the body has traveled a distance
equal to x − x0 after time t , v 2 = v02 + 2a ( x − x0 ).
10. Vectors: a quantity that has a size as well as direction is called a vector. So,
for example, the wind blows with some speed and in some direction. So the
wind velocity is a vector.
11. If we choose axes, then a vector is fixed by its components along those axes.
In one dimension, a vector has only one component (call it the x-component).
In two dimensions, a vector has both x and y components. In three dimensions,
the components are along the x,y,z axes.
r
12. If we denote a vector r = ( x, y ) then, rx = x = r cos θ , and ry = y = r sin θ .
Note that x 2 + y 2 = r 2 . Also, that tan θ = y / x.
13. Two vectors can be added together geometrically. We take any one vector, move
it without changing its direction so that both vectors start from the same point, and
then make a parallelogram. The diagonal of the parallelogram is the resultant.
r r r
C = A+ B
14. Two vectors can also be added algebraically. In this case, we simply add the
components of the two vectors along each axis separately. So, for example,
Two vectors can be put together as (1.5,2.4) + (1, −1) = (2.5,1.4).
v0
T t→ 4T T
5 5
4. Find the resultant of the two velocity vectors v1 = (3, 2) and v 2 = (4,6) and the angle
that the resultant makes with the first vector.
5. a) Is it always true that the resultant of two vectors is bigger than either one of the
vectors?
b) Is it possible to define a function of two vectors that is a scalar? Give an example.
c) Is it possible to define a function of two vectors that is a vector? Give an example.
6. Show that the magnitude of the resultant of two vectors is always less than or equal
to the sum of the magnitudes of the two vectors. In what situation are the two equal?
1. The concept of the derivative of a function is exceedingly important. The derivative shows
how fast a function changes when its argument is changed. (Remember that for f ( x) we
say that f is a function that depends upon the argument x. You should think of f as a
machine that gives you the value f when you input x.)
2. Functions do not always have to be written as f ( x). x(t ) is also a function. It tells
us where a body is at different times t.
∆x
∆t
4. Let's see how to calculate the derivative of a simple function like x(t ) = t 2 . We must first
calculate the difference in x at two slightly different values, t and t + ∆t , while
remembering that we choose ∆t to be extremely small:
∆x = ( t + ∆t ) − t 2
2
= t 2 + ( ∆t ) + 2t ∆t − t 2
2
∆x ∆x dx
= ∆t + 2t ⇒ lim = =2
∆t ∆t → 0 ∆t dt
5. In exactly the same way you can show that if x(t ) = t n then:
dx ∆x
≡ lim = nt n −1
dt ∆t → 0 ∆t
This is an extremely useful result.
7. A stone dropped from rest increases its speed in the downward direction according
dv
to = g ≈ 9.8 m/sec. This is true provided we are fairly close to the earth, otherwise
dt
the value of g decreases as we go further away from the earth. Also, note that if we
measured distances from the ground up, then the acceleration would be negative.
dv d ⎛ dx ⎞ d 2 x d 2x
8. A useful notation: write = ⎜ ⎟ = 2 . We call 2 the second derivative
dt dt ⎝ dt ⎠ dt dt
of x with respect to t , or the rate of rate of change of x with respect to t.
9. It is easy to extend these ideas to a body moving in both the x and y directions. The
position and velocity in 2 dimensions are:
r
r = x(t )iˆ + y (t ) ˆj
r
r dr dx ˆ dy ˆ
v= = i+ j
dt dt dt
= v xiˆ + v y ˆj
Here the unit vectors iˆ and ˆj are fixed, meaning that they do not depend upon time.
r r
10. The scalar product of two vectors A and B is defined as:
r r
A ⋅ B = AB cosθ
You can think of:
r r
A ⋅ B = ( A)( B cosθ )
r r r
= (length of A) × (projection of B on A)
OR,
r r
A ⋅ B = ( B)( A cosθ )
r r r
= (length of B) × (projection of A on B).
Remember that for unit vectors iˆ ⋅ iˆ = ˆj ⋅ ˆj = 1 and iˆ ⋅ ˆj = 0.
2. Suppose the position of a particle changes with respect to time according to the relation
b0 + b1t 3
x(t ) = a0 + a1t + a2t 2 + + c sin dt
t
a) What are the dimensions of a0 , a1 , a2 , b0 , b1 , c, d ?
d
b) Find x(t ).
dt
d2
c) Find 2 x(t ).
dt
3. We shall find the derivative of a product of two functions in this problem. Suppose that
df g (t + ∆t )h(t + ∆t ) − g (t )h(t )
f (t ) = g (t )h(t ). Start from the definition, = lim . Now
dt ∆t → 0 ∆t
dg
write g (t + ∆t ) = g (t ) + ∆t and a similar equation for f (t + ∆t ). Hence show that:
dt
df dg dh
a) = h+g
dt dt dt
d (1/ g ) 1 dg
b) =− 2
dt g dt
d ( f / g) f dg 1 df
c) =− 2 +
dt g dt g dt
1. Ancient view: objects tend to stop if they are in motion; force is required to keep
something moving. This was a natural thing to believe in because we see objects
stop moving after some time; frictionless motion is possible to see only in rather
special circumstances.
2. Modern view: objects tend to remain in their initial state; force is required to
change motion. Resistance to changes in motion is called inertia. More inertia
means it is harder to make a body accelerate or decelerate.
3. Newton’s First Law: An object will remain at rest or move with constant velocity
unless acted upon by a net external force. (A non-accelerating reference frame is
called an inertial frame; Newton’s First Law holds only in inertial frames.)
5. The greater the mass of a body, the harder it is to change its state of motion. More
mass means more inertia. In other words, more mass leads to less acceleration:
1
⇒ a∝
m
Combine both the above observations to conclude that:
F
a∝
m
F
6. Newton's Second Law: a = (or, if you prefer, write as F = ma ).
m
9. Forces can be internal or external. For example the mutual attraction of atoms
within a block of wood are called internal forces. Something pushing the wood
11. Gravity acts directly on the mass of a body - this is a very important experimental
observation due to Newton and does not follow from F = ma. So a body of mass
m1 experiences a force F1 = m1 g while a body of mass m2 experiences a force
F2 = m2 g , where g is the acceleration with which any body (big or small) falls under
the influence of gravity. (Galileo had established this important fact when he dropped
different masses from the famous leaning tower of Pisa!)
12. The weight of a body W is the force which gravity exerts upon it, W = mg. Mass and
weight are two completely different quantities. So, for example, if you used a spring
balance to weigh a kilo of grapes on earth, the same grapes would weigh only 1/7 kilo
on the moon.
13. Newton's Third Law: for every action there is an equal and opposite reaction. More
precisely, FAB = − FBA , where FAB is the force exerted by body B upon A whereas FBA
is the force exerted by body A upon B. Ask yourself what would happen if this was not
true. In that case, a system of two bodies, even if it is completely isolated from the
surroundings, would have a net force acting upon it because the net force acting upon
both bodies would be FAB + FBA ≠ 0.
14. If action and reaction are always equal, then why does a body accelerate at all? Students
are often confused by this. The answer: in considering the acceleration of a body you must
consider only the (net) force acting upon that body. So, for example, the earth pulls a stone
towards it and causes it to accelerate because there is a net force acting upon the stone. On
the other hand, by the Third Law, the stone also pulls the earth towards it and this causes the
earth to accelerate towards the stone. However, because the mass of the earth is so large, we
are only able to see the acceleration of the stone and not that of the earth.
Q.2 Two astronauts are in outer space connected with a rope in a state of complete
weightlessness. Each pulls one end of a rope with his hands. Describe what will
happen as time goes on.
r
Q.3 Add together the forces F1 = 3xˆ + 5 yˆ − zˆ and F2 = −2 xˆ − 3 yˆ + 2 zˆ and obtain the
magnitude of the resultant force.
Q.4 A cube of constant density ρ is pushed with a pressure P from one side. The cube
is place on a smooth level surface. Find the acceleration of the cube, and the distance
covered after time t.
Q.5 Describe all the forces acting upon a ladder that is leaning against a wall. Both
the floor and the wall are rough.
Q.6 In the above situation, suppose the floor and wall suddenly become perfectly smooth.
What will the net force on the ladder become, and what will be the acceleration in the
horizontal and vertical directions?
3. Examples of systems out of equilibrium: a stone thrown upwards that is at its highest
point; a plane diving downwards; a car at rest whose driver has just stepped on the
car's accelerator.
4. If you know the acceleration of a body, it is easy to find the force that causes it to
accelerate. Example: An aircraft of mass m has position vector,
r
r = (at + bt 3 )iˆ + (ct 2 + dt 4 ) ˆj
What force is acting upon it?
r d 2x d2y
SOLUTION: F = m 2 iˆ + m 2 ˆj
dt dt
= 6 b mt iˆ + m(2 c + 12d t 2 ) ˆj
5. The other way around is not so simple: suppose that you know F and you want to
find x. For this you must solve the equation,
d 2x F
=
dt 2 m
This may or may not be easy, depending upon F (which may depend upon both x as
well as t if the force is not constant).
6. Ropes are useful because you can pull from a distance to change the direction of a
force. The tension, often denoted by T , is the force you would feel if you cut the
rope and grabbed the ends. For a massless rope (which may be a very good
approximation in many situations) the tension is the same at every point along the
rope. Why? Because if you take any small slice of the rope it weighs nothing (or
very little). So if the force on one side of the slice was any different from the force
force on the other side, it would be accelerating hugely. All this was for the "ideal
rope" which has no mass and never breaks. But this idealization if often good enough.
F m1 T T m2
We want to find the tension and acceleration: The total force on the first mass is F − T
and so F − T = m1a. The force on the second mass is simply T and so T = m2 a. Solving the
m2 F F
above, we get: T = and a = .
m1 + m2 m1 + m2
10. There is a general principle by which you solve equilibrium problems. For equilibrium,
the sum of forces in every direction must vanish. So Fx = Fy = Fz = 0. You may always
choose the x, y, z directions according to your convenience. So, for example, as in the
lecture problem dealing with a body sliding down an inclined plane, you can choose the
directions to be along and perpendicular to the surface of the plane.
Q.3 In (9), a frictionless surface was considered. Now assume that the coefficient of
friction is µ. What will be the acceleration and the tension now. Assume that the
body is pulled with sufficient force so that it moves.
M g sin α
Mg cos α
α
Mg
1. As a body moves through a body it displaces the fluid. it has to exert a force
on the fluid to push it out of the way. By Newton's third law, the fluid pushes
back on the body with an equal and opposite force. The direction of the fluid
resistance force on a body is always opposite to the direction of the body's velocity
relative to the fluid.
2. The magnitude of the fluid resistance force usually increases with the speed of the
body through the fluid. Typically, f = kv (an empirical law!). Imagine that you
drop a ball bearing into a deep container filled with oil. After a while the ball bearing
will approach its maximum (terminal) speed when the forces of gravity and friction
balance each other: mg = kv from which vfinal = mg / k .
3. The above was a simple example of equilibrium under two forces. In general, while
solving problems you should a)draw a diagram, b)define an origin for a system of
coordinates, c)identify all forces (tension, normal, friction, weight, etc) and their x and
y components, d)Apply Newton's law separately along the x and y axes. e) find the
accelerations, then velocities, then displacements. This sounds very cook-book, and in
fact it will occur to you naturally how to do this while solving actual problems.
4. Your weight in a lift: suppose you are in a lift that is at rest or moving at constant
velocity. In either case a=0 and the normal force N and the force due to gravity are
exactly equal, N − Mg = 0 ⇒ N = Mg. But if the lift is accelerating downwards then
Mg − N = Ma or N = M ( g − a) . So now the normal force (i.e. the force with which the
floor of the lift is pushing on you) is decreased. Note that if the lift is accelerating
downwards with acceleration a (which it will if the cable breaks!) then N=0 and you
will experience weightlessness just like astronauts in space do. Finally, if the lift is
accelerating upwards then a is negative and you will feel heavier.
5. Imagine that you are in a railway wagon and want to know how much you are
accelerating. You are not able to look out of the windows. A mass is hung from the
roof. Find the acceleration of the car from the angle made by the mass.
6. Friction is a funny kind of force. It doe not make up its mind which way to act until
some other force compels it to decide. Imagine a block lying on the floor. If you push
it forward, friction will act backward. And if you push it to the left, friction will act
to the right. In other words, the direction of the frictional force is always in the
opposite direction to the applied force.
7. Let us solve the following problem: a rope of total length L and mass per unit length m
is put on a table with a length l hanging from one edge. What should be l such that the
rope just begins to slip? N
(L − l )
l µN ml g
m( L − l ) g
To solve this, look at the balance of forces in the diagram below: in the vertical direction,
the normal force balances the weight of that part of the rope that lies on the table:
N = m( L − l ) g . In the horizontal direction, the rope exerts a force mlg to the right, which
is counteracted by the friction that acts to the left. Therefore µ N = mlg . Substituting N
µL
from the first equation we find that l = . Note that if µ isvery small then even a small
µ +1
piece of string that hangs over the edge will cause the entire string to slip down.
Q.2 For a particular liquid and a certain shape of body the relation between the frictional
resistance and velocity is of the form F = av + bv 2 .
a) What are the dimensions of a and b?
b) If a body of mass M is dropped into a deep vessel containing the liquid, what is the
maximum speed that it will attain?
c) What will be the initial acceleration just as it enters the liquid?
θ
r
d
3. Suppose you lift a mass of 20 kg through a distance of 2 metres. Then the work you
do is 20 kg × 9.8 Newtons × 2 metres = 39.2 Joules. On the other hand, the force
of gravity is directed opposite to the force you exert and the work done by gravity
is - 39.2 Joules.
4. What if the force varies with distance (say, a spring pulls harder as it becomes
longer). In that case, we should break up the distance over which the force acts into
small pieces so that the force is approximately constant over each bit. As we make the
pieces smaller and smaller, we will approach the exact result:
0 ∆x x→
Now add up all the little pieces of work:
N
W = ∆W1 + ∆W2 + ⋅ ⋅ ⋅ + ∆WN = F1∆x + F2 ∆x + ⋅ ⋅ ⋅ + FN ∆x ≡ ∑ Fn ∆x
n =1
∞
To get the exact result let ∆x → 0 and the number of intervals N → ∞ : W = lim ∑ Fn ∆x
∆x → 0
n =1
xf
∞
Definition: W = lim ∑ Fn ∆x ≡ ∫ F ( x)dx is called the integral of F with respect to x from
∆x → 0
n =1 xi
5. Just to check what our result looks like for a constant force, let us calculate W if F = b,
F
F =b
1 2 3 4
0 ∆x = a / 4 a x
a
1 1 1 1
a(b) + a(b) + a(b) + a(b) = ab ∴ ∫ Fdx = ab
4 4 4 4 0
6. Now for a less trivial case: suppose that F=kx, i.e. the force increases linearly with x.
F ( a, k a )
4
3
2
1
0 a x→
∆x = a / 4
a
1 a2 a2
Area of shaded region = (a )(ka ) = k
2 2
∴ ∫0 Fdx = k 2
8. Accepting the fact that energy is conserved, let us derive an expression for the kinetic
energy of a body. Suppose a constant force accelerates a mass m from speed 0 to speed
v over a distance d. What is the work done by the force? Obviously the answer is:
mv 2 1
W = Fd . But F = ma and v = 2ad . This gives W = (ma)d =
2
d = mv 2 . So, we
2d 2
conclude that the work done by the force has gone into creating kinetic energy. and that
the amount of kinetic energy possessed by a body moving with speed v is 1
2 mv 2 .
9. The work done by a force is just the force multiplied by the distance – it does not
depend upon time. But suppose that the same amount of work is done in half the time.
We then say that the power is twice as much. We define:
Work F ∆x
If the force does not depend on time: = = F v. Therefore, Power = F v.
Time ∆t
10.Let's work out an example. A constant force accelerates a bus (mass m) from speed v1
to speed v 2 over a distance d. What work is done by the engine?
Recall that for constant acceleration, v22 − v12 = 2a ( x2 − x1 ) where: v2 = final velocity,
v 22 − v12
x 2 = final position, v1 = initial velocity, x1 = initial position. Hence, a = . Now
2d
v 22 − v12 1 1
calculate the work done: W = Fd = mad = m d = m v 2 2 − m v12 . So the
2d 2 2
1
work done has resulted in an increase in the quantity m v 2 , which is kinetic energy.
2
Q.1 A stone tied to a string is whirled around. Suppose the string has tension T. How
much work will be done when the stone goes around one complete revolution?
Q.2 In (6) above, calculate the areas 1,2,3,4 separately and then add them up. Is your
a2
answer equal to k ?
2
Q.3 A 1000 kg trolley is pulled up a 45° inclined plane at 1.5 m/sec. How much power is
is needed?
Q. 4 In the previous question, if the coefficient of friction is µ = 0.5 what will be the
power needed now?
Q.5 Suppose the air friction acting on a car increases as kv2. What is the engine power
needed to keep the car moving at speed v?
Q.6 In the example solved in (10) above, calculate the power of the bus engine.
1. Potential energy is, as the word suggests, the energy “locked up” up somewhere and
which can do work. Potential energy kam karnay ki salahiat hai! Potential energy can
be converted into kinetic energy, 12 mv 2 . As I showed you earlier, this follows directly
from Newton’s Laws.
2. If you lift a stone of mass m from the ground up a distance x, you have to do work against
gravity. The (constant) force is mg , and so W = mgx. By conservation of energy,
the work done by you was transformed into gravitational potential energy whose
values is exactly equal to mgx. Where is the energy stored? Answer: it is stored neither
in the mass or in the earth - it is stored in the gravitational field of the combined system
of stone+earth.
3. Suppose you pull on a spring and stretch it by an amount x away from its normal
(equilibrium) position. How much energy is stored in the spring? Obviously, the
spring gets harder and harder to pull as it becomes longer. When it is extended by
length x and you pull it a further distance dx, the small amount of work done is
dW = Fdx = kxdx. Adding up all the small bits of work gives the total work:
x x
1 2
W = ∫ Fdx = ∫ k xdx = kx
0 0
2
This is the work you did. Maybe you got tired working so hard. What was the result
of your working so hard? Answer: this work was transformed into energy stored in the
spring. The spring contains energy exactly equal to 1
2 k x2 .
4. Kinetic energy obviously depends on the frame you choose to measure it in. If you are
running with a ball, it has zero kinetic energy with respect to you. But someone who is
standing will see that it has kinetic energy! Now consider the following situation: a box
of mass 12kg is pushed with a constant force so that so that its speed goes from zero to
1.5m/sec (as measured by the person at rest on the cart) and it covers a distance of 2.4m.
Assume there is no friction.
v =1.5m/ s
2.4m
15m / s
mass of box = 12 kg
5. Now suppose there is somebody standing on the ground, and that the trolley moves at
15 m/sec relative to the ground:
15 m / s 16.5m / s
50.4m
15m / s
48.0m
6. The total mechanical energy is: Emech = KE + PE. If there is no friction then Emech is
conserved. This means that the sum does not change with time. For example: a ball
is thrown upwards at speed v0 . How high will it go before it stops? The loss of potential
1 2 v2
energy is equal to the gain of potential energy. Hence, mv0 = mg h ⇒ h = 0 .
2 2g
Now look at the smooth, frictionless motion of a car over the hills below:
A v0
B
C
h h
h/2
D
1. If energy is conserved, then why do you get tired simply standing in one place?
2. Suppose that the motion of the earth around the sun suddenly stopped. Using the
conservation of energy, discuss what will happen to the earth after that.
3. In point 5 above, find the work done by the force (as seen by the ground observer)
and show that it is equal to 284 J. So, even though work and energy are different in
different frames, the law of conservation of energy holds in every frame.
1. Momentum is the "quantity of motion" possessed by a body. More precisely, it is defined as:
Mass of the body × Velocity of the body.
The dimensions of momentum are MLT -1 and the units of momentum are kg-m/s.
r
2. Momentum is a vector quantity and has both magnitude and direction, p = mv. We can
easily see that Newton's Second Law can be reexpressed in terms of momentum. When
r
r r dv r
I wrote it down originally, it was in the form ma = F . But since = a, this can also be
dt
r
dp r
written as = F (new form). In words, the rate of change of momentum of a body equals
dt
the total force acting upon it. Of course, the old and new are exactly the same,
r r r
d p d (m v) dv r r
= =m = ma = F.
dt dt dt
r
3. When there are many particles, then the total momentum P is,
r r r r
P = p1 + p 2 + ⋅ ⋅ ⋅ p N
r r r
d r d p1 d p 2 dp
P= + + ⋅⋅⋅ N
dt dt dt dt
r r r r
= F1 + F2 + ⋅ ⋅ ⋅ FN = F
This shows that when there are several particles, the rate at which the total momentum
changes is equal to the total force. It makes sense!
4. A very important conclusion of the above is that if the sum of the total external forces
r d r
vanishes, then the total momentum is conserved, ∑ Fext = 0 ⇒ P = 0. This is quite
dt
independent of what sort of forces act between the bodies - electric, gravitational, etc. -
or how complicated these are. We shall see why this is so important from the following
examples.
5. Two balls, which can only move along a straight line, collide with each other. The initial
momentum is Pi = m1u1 + m2u2 and the final momentum is Pf = m1v1 + m2 v 2 . Obviously one
ball exerts a force on the other when they collide, so its momentum changes. But, from the
fact that there is no external force acting on the balls, Pi = Pf , or m1u1 + m2u2 = m1v1 + m2 v 2 .
6. A bomb at rest explodes into two fragments. Before the explosion the total momentum is
8. When you hit your thumb with a hammer it hurts, doesn't it? Why? Because a large
amount of momentum has been destroyed in a short amount of time. If you wrap your
thumb with foam, it will hurt less. To understand this better, remember that force is the
dp
rate of change of momentum: F = ⇒ dp = Fdt. Now define the impulse I as:
dt
force × time over which the force acts.
If the force changes with time between the limits , then one should define I as,
t2 t2 pf
equals the impulse, which is equal to the area under the curve of force versus time. Even
if you wrap your thumb in foam, the impulse is the same. But the force is definitely not!
Q.1 A stream of bullets, each of mass m, is fired horizontally with a speed v into a large
wooden block of mass M that is at rest on a horizontal table. If the coefficient of
friction is µ , how many bullets must be fired per second so that the block just begins
to move? [Hint, calculate the momentum destroyed in one second.]
Q. 2 In point 5, we have one equation but two unknowns v1 ,v 2 . Obviously more information
needs to be supplied. So consider two extreme cases. In both cases, find the final
velocities:
a) The collision is perfectly elastic, meaning that the sum of initial kinetic energies is
exactly equal to the sum of final kinetic energies.
b) The collision results in the two bodies sticking together and moving off as one body.
Q.3 a) Would you rather land with your legs bending or stiff ?
b) Why do cricket fielders move their hands backwards when catching a fast ball?
c) Why do railway carriages have dampers at the front and back?
Q.4 A rocket of mass M 0 is at rest in space. Then at t = 0 it starts to eject hot gas at speed
v from the nozzle and the mass of the rocket is m(t ) = M 0 - α t . Find the acceleration
of the rocet at time t = 0.
Q.5 Sand drops onto an open railway carriage at ρ kg/sec. If the engine pulling it is working
at power P0 watts at time t = 0, find the additional power of the engine pulling it such
that the speed of the train remans constant at v0 .
1. Collisions are extremely important to understand because they happen all the time -
electrons collide with atoms, a bat with a ball, cars with trucks, star galaxies with other
galaxies,...In every case, the sum of the initial momenta equals the sum of the final
momenta. This follows directly from Newton's Second Law, as we have already seen.
2. Take the simplest collision: two bodies of mass m1 and m2 moving with velocities u1 and u2 .
After the collision they are moving with velocities v1 and v 2 . From momentum conservation,
m1u1 + m2u2 = m1v1 + m2 v 2 ⇒ m1 (u1 − v1 ) = m2 (v 2 − u2 )
This is as far as we can go. There are two unknowns but only one equation. However, if the
collision is elastic then,
1
2
m1u12 + 12 m2u22 = 1
2
m1v12 + 12 m2 v 22 ⇒ 12 m1 (u12 − v12 ) = 12 m2 (v 2 2 − u2 2 ).
Combine the two equations above,
u1 + v1 = v 2 + u2 ⇒ u1 − u2 = v 2 − v1.
In words, this says that in an elastic collision the relative speed of the incoming particles
equals the relative speed of the outgoing particles.
3. One can solve for v1 and v 2 (please do it!) easily and find that:
m1 − m 2 2m 2
v1 = ( )u1 + ( )u2
m1 + m 2 m1 + m 2
2m1 m 2 − m1
v2 = ( )u1 + ( )u2
m1 + m 2 m1 + m 2
Notice that if m1 = m2 , then v1 = u2 and v 2 = u1. So this says that after the collision, the
bodies will just reverse their velocities and move on as before.
4. What if one of the bodies is much heavier than the other body, and the heavier body is at
rest? In this case, m2 >> m1 and u2 = 0. We can immediately see that v1 = −u1 and v 2 = 0.
This makes a lot of sense: the heavy body continues to stay at rest and the light body just
bounces back with the same speed. In the lecture, you saw a demonstration of this!
5. And what if the lighter body (rickshaw) is at rest and is hit by the heavier body (truck)? In
this case, m2 << m1 and u2 = 0. From the above equation we see that v1 = u1 and v 2 = 2u1.
So the truck's speed is unaffected, but the poor rickshaw is thrust in the direction of the
truck at twice the truck's speed!
6. Sometimes we wish to slow down particles by making them collide with other particles.
7. A bullet with mass m, is fired into a block of wood with mass M, suspended like a
pendulum and makes a completely inelastic collision with it. After the impact, the
block swings up to a maximum height y. What is the initial speed of the bullet?
M +m
m
v M
Solution:
By conservation of momentum in the direction of the bullet,
(m + M )
mv = (m + M )V , ⇒ v = V
m
The block goes up by distance y, and so gains potential energy. Now we can use the
1
conservation of energy to give, (m + M )V 2 = (m + M ) gy, where V is the velocity
2
acquired by the block+bullet in the upward direction just after the bullet strikes. Now
(m + M )
use V = 2 gy . So finally, the speed of the bullet is: v = 2 gy .
m
9. What happens to energy in an inelastic collision? Let's say that one body smashes into
another body and breaks it into 20 pieces. To create 20 pieces requires doing work
against the intermolecular forces, and the initial kinetic energy is used up for this.
Q.1 Give two examples of nearly elastic collisions, and two examples of nearly inelastic
collisions that are not from the lecture.
Q.2 In the lecture demonstration you saw that there are 3 balls of equal mass placed in a
straight line. The first strikes the second and comes to rest. The second moves and hits
the third and comes to rest. The third one then moves off alone.
a) What would happen if the second one was twice the mass of the first or the third,
and if the collision was elastic?
b) What would happen if the second was infinitely heavy, and the collision was elastic?
c) What would happen if the collision of the first and second resulted in these two
getting stuck to each other? With what speed would they strike the third one?
Q.3 Suppose a ball loses 10% of its energy when it is bounced off the ground. If it dropped
from a height h initially, find the maximum height it reaches after
a) The first bounce
b) The second bounce
c) The n ' th bounce
Q.4 In the example in point 7 above, suppose that the bullet bounces back with speed v/2
instead of lodging itself into the wooden block. In that case, calculate how high the
block will rise.
Q.5 The figure shows an elastic collision of two bodies A,B on a frictionless table. A has
mass mA = 0.5 kg, and B has mass mB = 0.3 kg. A has an initial velocity of 4 m/s in the
the positive x-direction and a final velocity of 2 m/s in an unknown direction. B is
initially at rest. Find the final velocity of B and the angles in the figure.
v2
y
mB φ2
u
φ1 x
mA
v1
3. Suppose that there is a particle located at the tip of the radius vector. Now we wish to
describe the rotational kinematics of this particle, i.e. describe its motion as goes around
the circle. So, suppose that the particle moves from angle φ1 to φ2 in time t2 − t1. Then, the
φ2 − φ1 ∆φ
average angular speed ω is defined as, ω = = . Suppose that we look at ω
t2 − t1 ∆t
∆φ dφ
over a very short time. Then, ω = lim = is called the instantaneous angular speed.
∆t → 0 ∆t dt
4. To familarize ourselves with the notion of angular speed, let us compute ω for a clock
second, minute and hour hands:
2π
ωsecond = = 0.105 rad / s,
60
2π
ωminute = = 1.75 × 10−3 rad / s,
60 × 60
2π
ωhour = = 1.45 × 10−4 rad / s.
60 × 60 × 12
5. Just as we defined accelaration for linear motion, we also define acceleration for
circular motion:
ω − ω1 ∆ω
α≡ 2 = (average angular speed)
t2 − t1 ∆t
∆ω dω d dφ d 2φ
Hence, α = lim becomes α = = = (angular acceleration). Let us
∆t → 0 ∆t dt dt dt dt 2
see what this means for the speed with which a particle goes around. Now use s = rφ .
ds dφ
Differentiate with respect to time t: = r . The rate of change of arc length s is clearly
dt dt
Why are they almost identical even though they describe two totally different physical
situations. Answer: because the mathematics is identical!
7. The angular speed of a car engine is increased from 1170 rev/min to 2880 rev/min in
12.6 s. a)Find the average angular acceleration in rev/min 2 . (b) How many revolutions
does the engine make during this time?
SOLUTION: this is a straightforward application of the formulae in point 5 above.
ω f − ωi 1
α= = 8140 rev/min 2 , φ = ωi t + α t 2 = 425rev.
t 2
rC
rA
A B C
SOLUTION: Obviously every part of the chain moves with the same speed and so
rω
v A = vC . Hence rAω A = rCωC ⇒ ω A = C C . From the definition of acceleration,
rA
ωA − 0 ω A rCωC
α= . From this, t = = = 16.4 s.
t α rAα
9. Imagine a disc going around. All particles on the disc will have same 'ω ' and 'α '
but different 'v ' and 'a' . Clearly 'ω ' and 'α ' are simpler choices !!
v2
v1
P2
r
θ P1
v 2 − v1
C r v1
v2
12. Vector Cross Products: The vector crossproduct of two vectors is defined as:
r r r r
A × B = AB sin θ nˆ where nˆ is a unit vector that is perpendicular to both A and B.
∧ ∧ ∧ ∧ ∧ ∧ ∧ ∧ ∧
Apply this definition to unit vectors in 3-dimensions: i × j = k , k × i = j , j× k = i .
14. The cross product is only definable in 3 dimensions and has no meaning in 2-d. This is
unlike the dot product which as a meaning in any number of dimensions.
Q.2 The Moon revolves about the Earth, making a complete revolution in 27.3 days. Assume
that the orbit is circular and has a radius of 238,000 miles. What is the magnitude of the
acceleration of the Moon towards the Earth?
Q.3 When a car engine is turned off, the angular speed of the flywheel decreases linearly
with time from 1200 rev/min to zero.
(a) Find the average angular acceleration.
(b) How many revolutions does the flywheel make before topping?
Q.4 Calculate the acceleration of a car that is travelling at a constant speed of 100 km per
hour on a straight road. The radius R of the Earth is 6370 km. Why do we not feel this
acceleration?
Q.5 A boy ties a stone to a string and whirls it around. Just as the stone reaches the highest
point, the string breaks. What will be the values of aT and aR at the very moment?
What will be the path of the stone before it hits the ground?
1. A body is made of a collection of particles. We would like to think of this body having
r r
r m1r1 + m2 r2
a "centre". For two masses the "centre of mass" is defined as: rcm ≡ .
r m m 1 + m2
2
r2
r m1
r1
2. Example: one mass is placed at x = 2cm and a second mass, equal to the first, is placed
at x = 6cm. The cm position lies halfway between the two as you can see from:
mx + mx2 2m + 6m
xcm = 1 = = 4cm.
m+m 2m
Note that there is no physical body that is actually located at xcm = 4cm! So the centre
of mass can actually be a point where there is no matter. Now suppose that the first
mass is three times bigger than the first:
(3m) x1 + mx2 2(3m) + 6m
xcm = = = 3cm
3m + m 4m
This shows that the cm lies closer to the heavier body. This is always true.
3. For N masses the obvious generalization of the centre of mass position is the following:
r r r
r m1r1 + m2 r2 + ⋅ ⋅ ⋅ ⋅ + mN rN 1 ⎛ n= N r ⎞
rcm = = ⎜ ∑ mn rn ⎟ .
m1 + m2 + ⋅ ⋅ ⋅ ⋅ + mN M ⎝ n =1 ⎠
r r r
In words, this says that the following: choose any origin and draw vectors r1 , r2 , ⋅ ⋅ ⋅ rN
that connect to the masses m1 , m2 , ⋅ ⋅ ⋅mN . Heavier masses get more importance in the sum.
r
r m2 r2 r
So suppose that m2 is much larger than any of the others. If so, rcm ≈ = r2 . Hence, the
m2
cm is very close to the position vector of m2 .
4. For symmetrical objects, it is easy to see where the cm position lies: for a sphere or circle
it lies at the centre; for a cylinder it is on the axis halfway between the two faces, etc.
5. Our definition of the cm allows Newton's Second Law to be written for entire collection
6. Consider rotational motion now for a rigid system of N particles. Rigid means that
all particles have a fixed distance from the origin. The kinetic energy is,
1 1 1
K=
m1v12 + m2 v 22 + m3 v32 + ⋅ ⋅ ⋅ ⋅
2 2 2
1 1 1
= m1r12ω 2 + m2 r22ω 2 + m3r32ω 2 + ⋅ ⋅ ⋅ ⋅
2 2 2
1
= ∑ mi ri 2 ω 2
2
( )
Now suppose that we define the "moment of inertia" I ≡ ∑ mi ri 2 . Then clearly the
1
kinetic energy is K = 1 I ω 2 . How similar this is to K = Mv 2 !
2 2
7. To familiarize ourselves with I, let us consider the following: Two particles m1 and m2
are connected by a light rigid rod of length L. Neglecting the mass of the rod, find the
rotational inertia I of this system about an axis perpendicular to the rod and at a
distance x from m1.
x
m1 m2
L
Answer: I = m1 x 2 + m2 ( L − x ) . Of course, this was quite trivial. Now we can ask
2
a more interesting question: For what x is I the largest? Now, near a maximum, the
dI
slope of a function is zero. So calculate and then put it equal to zero:
dx
dI m2 L
∴ = 2m1 x − 2m2 ( L − x ) = 0 ⇒ xmax = .
dx m1 + m2
9. A simple application: suppose there is a hoop with mass distributed uniformly over it.
The moment of inertia is: I = ∫ r 2 dm = R 2 ∫ dm = MR 2 .
10. A less trivial application: instead of a hoop as above, now consider a solid plate:
I = ∫ r 2 dm (dm = 2π rdr ρ0 )
R
1 1
= ∫ 2π r 3dr ρ0 = (π R 2 ρ0 ) R 2 = M R 2
0
2 2
11. You have seen that it is easier to turn things (e.g. a nut, when changing a car's tyre
after a puncture) when the applied force acts at a greater distance. This is because
r r r
the torque τ is greater. We define τ = r × F from the magnitude is τ = r F sin θ .
Here θ is the angle between the radius vector and the force.
r r
12. Remember that when a force F acts through a distance dr it does an amount of work
r r
equal to F ⋅ dr . Now let us ask how much work is done when a torque acts through a
certain angle as in the diagram below:
z r r
dW = F ⋅ ds
y
dφ r
x F r
ds
13.A comparison between linear and rotational motion quantities and formulae:
LINEAR ROTATIONAL
x, M φ, I
dx dφ
v= ω=
dt dt
dv dω
a= α=
dt dt
F = Ma τ = Iα
1 1
K= Mv 2 K = Iω 2
2 2
W = ∫ Fdx W = ∫ τ dφ
14. Rotational and translational motion can occur simultaneously. For example a car's
wheel rotates and translates. In this case the total kinetic energy is clearly the sum of
1 1
the energies of the two motions: K = M vcm 2
+ I cm ω 2 .
2 2
15. It will take a little work to prove the following fact that I simply stated above: for a
system of N particles, the total kinetic energy divides up neatly into the kinetic energy
r r
of rotation and translation. Start with the expression for kinetic energy and write vcm + v′i
r
where v′i is the velocity of a particle with respect to the cm frame,
1 1 r r
K = ∑ mi vi 2 = ∑ mi vi ⋅ vi
2 2
1 r r r r
= ∑ mi ( vcm + v′i ) ⋅ ( vcm + v′i )
2
1
= ∑ mi vcm
2
2
(
r r
+2v cm .v′i + v′i 2 )
Q.1 A triangle has 3 equal masses placed at each of its vertices. Locate the centre of mass
if: a)All 3 angles are equal, b)Two sides are equal and there is one right angle.
Q.2 Suppose that two concentric rings, each with equal mass per unit length are joined
so that they rotate together. Find the moment of inertia.
Q.3 A wheel of mass M, radius R, and moment of inertia I is on a surface and moves
towards the right as shown.
v cm
a) What is the kinetic energy if the wheel slips and does not rotate?
b) What is the kinetic energy if it does not slip? Write your answer only in terms of the
quantities specified above.
Q.4 Find the moment of inertia of a solid cylinder that rotates about the (long) axis of
symmetry.
2. Let us use this definition to calculate the angular momentum of a projectile thrown from
the ground at an angle θ . Obviously, initial angular momentum is zero (why?).
vy
v
v0
y vx
θ
O x
We know what the projectile's coordinates will be at time t after launch,
1
x = ( v0 cosθ ) t , y = ( v0 sin θ ) t − gt 2
2
as well as the velocity components,
v x = v0 cos θ , v y = v0 sin θ − gt.
r r r
( ) ( )
Hence, L = r × p = x iˆ + y ˆj × v x iˆ + v y ˆj m = m ( x v x − y v y ) v x kˆ
⎛1 ⎞ m
= m ⎜ gt 2 v0 cosθ − gt 2 v0 cosθ ⎟ kˆ = − gt 2 v0 cosθ kˆ.
⎝2 ⎠ 2
In the above, kˆ = iˆ × ˆj is a unit vector perpendicular to the paper. You can see here
that the angular momentum increases as t 2 .
2. Momentum changes because a force makes it change. What makes angular momentum
r r r
change? Answer: torque. Here is the definition again: τ = r × F . Now let us establish a
very important relation between torque and rate of change of L.
Begin:
r r r r r r r r r
L = r × p. At a slightly later time, L + ∆ L = ( r + ∆ r ) × ( p + ∆p )
r r r r r r r r
= r × p + r × ∆p + ∆ r × p + ∆ r × ∆ p
r r r r r r r
r r r r r ∆L r × ∆ p + ∆ r × p r ∆ p ∆ r r
By subtracting, ∆L = r × ∆p + ∆r × p = =r× + × p.
∆t ∆t ∆t ∆t
∆φ
L
∆L
θ
O O
4. Now consider the case of many particles. Choose any origin with particles moving with
respect to it. We want to write down the total angular momentum,
N
L = L1 + L2 + ⋅ ⋅ ⋅ + LN = ∑ Ln
n =1
N
dL dL1 dL2 dL dL
= + + ⋅⋅⋅ + N = ∑ n
dt dt dt dt n =1 dt
d Ln dL N
Since = τ n , it follows that = ∑τ n . Thus the time rate of change of the total
dt dt n =1
angular momentum of a system of particles equals the net torque acting on the system.
I showed earlier that internal forces cancel. So also do internal torques, as we shall see.
⎛ MR 2 2 ⎞ 1
Hence, ω f = ωi ⎜ × ⎟ = ωi .
⎝ 2 3MR 2 ⎠ 3
7. You shoud be aware of the similarities and differences between the equations for linear
r
r d pr r dL
and rotational motion: F = ⇔τ = . One big difference is that for momentum,
dt dt
r r r
p = m v, it does not matter where you pick the origin. But L definitely depends on the
r r r r r
choice of the origin. So changing r to c + r changes L to L′ :
r r r r r r r r r
L′ = r ′ × p = ( c + r ) × p = c × p + L.
Q1. A car goes along a section of road shaped like the letter S. The speed of the
car stays constant. Discuss how the acceleration changes, and make a rough
plot.
Q2. All bodies dropped together fall to the ground in the same amount of time. But if
two spheres roll down an inclined plane, one may take more time than the other.
Explain why.
Q3. A solid cylinder of mass M, radius R, and moment of inertia I rolls without
slipping down an inclined plane of length L and height h. Find the speed of its
centre of mass when the cylinder reaches the bottom.
Q4. If the radius of the earth, assumed to be a perfect sphere, suddenly shrinks to half
its present value, the mass of the Earth remaining unchanged, what will be the
duration of one day?
Q5. A uniform solid cylinder of radius R = 12cm and mass M = 3.2kg is given an
initial clockwise angular velocity ω0 of 15rev/s and then lowered on to a flat
horizontal surface. The coefficient of kinetic friction between the surface and
the cylinder is µ = 0.21. Initially, the cylinder slips as it moves along the surface,
but after a time t pure rolling without slipping begins.
(a) What is the velocity vcm?
(b) What is the value of t ?
[Hint: find the acceleration, and hence the force. When slipping stops, the frictional
force produces acceleration]
1. A rigid body is one where all parts of the body are fixed relative to each other (for
example, a pencil). Fluids and gases are non-rigid.
2. The translational motion of the centre of mass of a rigid body is governed by:
r
dP r r r
= F where F = ∑ Fext is the net external force.
dt
r
dL r r r
Similarly, for rotational motion, = τ where τ = ∑τ ext is the net external torque.
dt
r
3. A rigid body is in mechanical equilibrium if both
r the linearrmomentum P and angular
r dP dL
momentum L have a constant value. i.e., = 0 and = 0. Static equilibrium
dt dt
r r
refers to P = 0 and L = 0.
We want to find the forces F1 and F2 with which the supports push on the rod
in the upwards direction. First, balance forces in the vertical y direction:
∑F y = F1 + F2 − Mg − mg = 0
Now demand that the total torque vanishes:
∑τ y = ( F1 )( 0 ) + ( F2 )( L ) − ( Mg )( L / 4 ) − ( mg )( L / 2 ) = 0
From these two conditions you can solve for F1 and F2 ,
F1 =
( 3M + 2m ) g , and F2 =
( M + 2m ) g .
4 4
5. Angular momentum and torque depend on where you choose the origin of your
coordinates. However, I shall now prove that for a body in equilibrium, the choice of
origin does not matter. Let's start with the origin O and calculate the torque about O,
r r r r r r r r r r
τ O = τ 1 + τ 2 + ⋅ ⋅ ⋅ + τ N = r1 × F1 + r2 × F2 + ⋅ ⋅ ⋅ + rN × FN
Now, if we take a second point P, then all distances will be measured from P and each
L L
x x
mg m1 g m2 g mg
Taking the torques about the knife edge in the two cases, we have:
mg x = m1 g ( L − x ) and m2 g x = mg ( L − x )
m m1
⇒ = or m = m1m2 .
m2 m
Remarkably, we do not need the values of x or L.
7. Centre of Gravity. The centre of gravity is the average location of the weight of an
object. This is not quite the same as the centre of mass of a body (see lecture 12) but
r
suppose the gravitational acceleration g has the same value at all points of a body.
r
Then: 1) The weight is equal to M g , and 2)the centre of gravity coincides with
the centre of mass. Remember that weight is force, so the CG is really the centre of
gravitational force acting on the body. The net force on the whole body = sum of forces
r r r
over all individual particles, ∑ F = ∑ mi g . If g
y mi g
has the same value at all points of the body, then
r r r CG
∑ = g ∑ mi = M g. So the net torque about
F
r r r r r ri
the origin O is ∑τ = ∑ ( ri × mi g ) = ∑ ( mi ri × g )
r r r r r rCG
Hence, ∑τ = M rcm × g = rcm × M g . So the torque Mg
O
due to gravity about the centre of mass of a body x
r z
(i.e. at rcm = 0 ) is zero !!
+
CM
− Wx + (T2 cos φ ) L = 0 ⇒ x =
(T2 cos φ ) L =
L cos φ
W sin (θ + φ )
10.Here is another problem of the same kind: find the least angle θ at which the rod can lean
to the horizontal without slipping.
µ1R1
Solution : Considering the translational equilibrium of
the rod, R1 = µ2 R2 and R2 + µ1R1 = W . This gives, A R1
W C
R2 = . Now consider rotational equilibrium
(1 + µ1µ2 ) R2
about the point A: R2 × OB = W × OD + µ2 R2 × OA
θ
D
AB cosθ O
or, R2 × AB cosθ = W × + µ 2 R2 × AB sin θ .
2
W W
µ2 R2
B
R2 −
⎛ W⎞ 2 with R = W
This gives cosθ ⎜ R2 − ⎟ = µ2 R2 sin θ from which tanθ = .
⎝ 2⎠ µ2 R2 2
(1 + µ1µ2 )
1-µ1µ2
Using this value of R2 , we get tanθ = .
2 µ2
10. Types of Equilibrium. In the lecture you heard about:
a)Stable equilibrium: object returns to its original position if displaced slightly.
b)Unstable equilibrium: object moves farther away from its original position if
displaced slightly.
c)Neutral equilibrium: object stays in its new position if displaced slightly.
Q.1 Give three examples of each of the following that are not given in either the
lecture or these notes:
a) Static equilibrium
b) Dynamic equilibrium
c) Stable equilibrium
d) Neutral equilibrium
e) Unstable equilibrium
Q.2 Ship designers want to keep the CG of their ships as low as possible. Why? Discuss
why there is a contradiction between this requirement, and other requirements.
F
Q.3 Work through the example given in the lecture
r−h W r
where you are asked to find the minimum force
F applied horizontally at the axle of the wheel in h x
order to raise it over an obstacle of height h.
2. Why does a system oscillate? It does so because a force is always directed towards
a central equilibrium position. In other words, the force always acts to return the
object to its equilibrium position. So the object will oscillate around the equilibrium
position. The restoring force depends on the displacement Frestore = − k ∆x, where ∆x
is the distance away from the equilibrium point, the negative sign shows that the
force acts towards the equilibrium point, and k is a constant that gives the strength of
the restoring force.
4. In order to solve the SHO equation, we shall first learn how to differentiate some
d
elementary trignometric functions. So let us first learn how to calculate cos ωt
dt
starting from the basic definition of a derivative:
6. Having done all the work above, now you can easily see that any function of the
d 2x
form x(t ) = a cos ωt + b sin ωt satisfies2
= −ω 2 x. But what do ω , a, b represent ?
dt
2π
a) The significance of ω becomes clear if your replace t by t + in either sin ωt
ω
⎛ 2π ⎞
or cos ωt. You can see that cos ω ⎜ t + = cos (ωt + 2π ) = cos ωt . That is, the
⎝ ω ⎟⎠
function merely repeats itself after a time 2π / ω. So 2π / ω is really the period of the
2π m
motion T , T = = 2π . The frequency ν of the oscillator is the number of
ω k
1 1 k 2π k
complete vibrations per unit time: ν = = so ω = 2πν = = .
T 2π m T m
Sometimes ω is also called the angular frequency. Note that dim [ω ] = T −1 , from
it is clear that the unit of ω is radian/second.
b) To understand what a and b mean let us note that from x(t ) = a cos ωt + b sin ωt it
d
follows that x(0) = a and that x(t ) = −ω a sin ωt + ωb cos ωt = ωb (at t = 0). Thus,
dt
a is the initial position, and b is the initial velocity divided by ω.
7. Energy of simple harmonic motion. Put φ = 0 for convenience, and so imagine a mass
whose position oscillates like x = xm cos ωt. Let us first calculate the potential energy:
1 2 1 2
U= kx = kxm cos 2 ωt.
2 2
Now calculate the kinetic energy:
2
1 2 1 ⎛ dx ⎞ 1 1
mv = m ⎜ ⎟ = mω 2 xm2 sin 2 ωt = kxm2 sin 2 ωt
K=
2 2 ⎝ dt ⎠ 2 2
The sum of potential + kinetic is:
1 1
E = K + U = kxm2 cos 2 ωt + kxm2 sin 2 ωt
2 2
1 1
(
= kxm2 cos 2 ωt + sin 2 ωt = kxm2 .
2 2
)
Note that this is independent of time and energy goes from kinetic to potential, then
back to kinetic etc.
dx k 2
8. From the above, you can see that v =
dt
=±
m
( )
xm − x 2 . From this it is clear that
9. Putting two springs in parallel makes it harder to stretch them, and keff = k1 + k2 . In
⎛ kk ⎞
series they are easier to stretch, and keff = ⎜ 1 2 ⎟ . So a mass will oscillate faster in
⎝ k1 + k2 ⎠
the first case as compared to the second.
Q.1 A bottle is half filled with water (so that it floats upright) and then pushed a little
into the water. As you can see, it oscillates up and down.
a) Where does the restoring force come from?
b) Suppose that you filled the bottle 3/4 full. What would happen to the oscillation
frequency?
c) Why does the bottle eventually stop oscillating?
Q.3 Referring to the figure below, you can see that the coordinates of a particle going
around a circle are given by (x, y ) = ( R cosθ , R sin θ ) where θ = ωt.
a) On the same axes, plot x and y as a function of time. Obviously, here is a case of
two harmonic oscillations. What is the phase difference between the two?
dx
b) Find x& 2 + y& 2 , where x& is (this is a very popular way of denoting time derivatives
dt
because it is short, so you should be familiar with it.
x 2 + &&
c) Repeat the above for && y 2 , where we now have second derivatives instead.
y
v
R
θ
O x
1. In this chapter we shall continue with the concepts developed in the previous chapter
that relate to simple harmonic motion and the simple harmonic oscillator (SHO). It is
really very amazing that the SHO occurs again and again in physics, and in so many
different branches.
dθ
2
inertia and α is the angular acceleration, α = . Hence, we have
dt 2
d 2θ d 2θ ⎛ Mgd ⎞
I 2
= − Mgdθ , or, 2
= −⎜ ⎟θ . From this we immediately
dt dt ⎝ I ⎠
Mgd
see that the oscillation frequency is ω = . Of course, we have
I
used the small angle approximation over here again. Since all variables
except I are known, we can use this formula to tell us what I is about
any point. Note that we can choose to put the pivot at any point on the
body. However, if you put the pivot exactly at the centre of mass then
it will not oscillate. Why? Because there is no restoring force and the
torque vanishes at the cm position, as we saw earlier.
5. Sum of two simple harmonic motions of the same period along the same line:
x1 = A1 sin ωt and x2 = A2 sin (ωt + φ )
Let us look at the sum of x1 and x2 ,
x = x1 + x2 = A1 sin ωt + A2 sin (ωt + φ )
= A1 sin ωt + A2 sin ωt cos φ + A2 sin φ cos ωt
= sin ωt ( A1 + A2 cos φ ) + cos ωt ( A2 sin φ )
Let A1 + A2 cos φ = R cos θ and A2 sin φ = R sin θ . Using some simple trigonometry,
you can put x in the form, x = R sin (ωt + θ ) . It is easy to find R and θ :
A2 sin φ
R= A12 + A22 + A1 A2 cos φ and tan θ = .
A1 + A2 cos φ
( A1 + A2 )
2
Note that if φ = 0 then R = A12 + A22 + A1 A2 = = A1 + A2 and tan θ = 0
⇒ θ = 0. So we get x = ( A1 + A2 ) sin ωt. This is an example of constructive
( A1 − A2 )
2
interference. If φ = π then R = A12 + A22 − A1 A2 = = A1 − A2 and tan θ = 0
⇒ θ = 0. Now we get x = ( A1 − A2 ) sin ωt. This is destructive interference.
6. Composition of two simple harmonic motions of the same period but now at right
angles to each other:
Suppose x = A sin ωt and y = B sin (ωt + φ ) . These are two independent motions. We
x
can write sin ωt = and cos ωt = 1 − x 2 / A2 .
A
y x
From this, = sin ωt cos φ + sin φ cos ωt = cos φ + sin φ 1 − x 2 / A2 . Now square and
B A
rearrange terms to find:
x2 y2 xy
2
+ 2 −2 cos φ = sin 2 φ
A B AB
This is the equation for an ellipse (see questions at the end of this section).
8. Damped harmonic motion: Typically the frictional force due to air resistance, or in a
dx
liquid, is proportional to the speed. So suppose that the damping force = −b (why
dt
dx d 2x
negative sign?). Now apply Newton's law to a SHO that is damped: − kx − b = m 2
dt dt
2
d x dx
Rearrange slightly to get the equation for a damped SHO: m 2 + b + kx = 0.
dt dt
k
m
2
k ⎛ b ⎞
Its solution for ≥⎜ ⎟ is x = xm e
− bt / 2 m
cos (ω ' t + φ ) . The frequency is now
m ⎝ 2m ⎠
2
k ⎛ b ⎞
changed: ω ' = −⎜ ⎟ . The damping causes the amplitude to decrease with
m ⎝ 2m ⎠
time and when bt / 2m = 1, the amplitude is 1/ e ≈ 1/ 2.7 of its initial value.
Q.1 For the equation derived in point 6, sketch the curves on an x-y plot for:
a)φ = 0, and, b)φ = π / 2. Take A = 1, B = 2.
Q.2 A light rod of length L has two masses M and 3M attached to it as shown in the
diagrams below.
a) In each case, calculate the frequency of small oscillations.
b) In each case, calculate the centre of gyration.
L 2L
2 3
L
2 L
3
Q.3 In each of the two cases below, eliminate the time t. In other words, find a relation
between x and y which does not involve t.
a) x = sin t , y = 2sin t
b) x = sin t , y = cos t
c) x = cos t , y = sin 2t
Q.4 Verify that x (t ) = xme − bt / 2 m cos (ω ' t + φ ) is a solution of the damped SHO equation.
Plot x(t ) from t = 0 to t = 2 for the following case: xm = 1, b = 2, m = 1, φ = 0.
Q.5 A SHO is driven by a force F (t ) that depends upon time and obeys the equation,
d 2x k
m + k x = F (t ). Suppose that F (t ) = F0 cos 2ω0t + 2 F0 sin 3ω0t , where ω0 = .
dt 2 m
a) Find the general solution x(t ).
b) Find that particular solution which has x(t = 0) = 0 and x& (t = 0) = 1.
1. Elasticity : the property by virtue of which a body tends to regain its original shape and
size when external forces are removed. If a body completely recovers its original shape
and size , it is called perfectly elastic. Quartz, steel and glass are very nearly elastic.
2. Plasticity : if a body has no tendency to regain its original shape and size , it is called
perfectly plastic. Common plastics, kneaded dough, solid honey, etc are plastics.
3. Stress characterizes the strength of the forces causing the stretch, squeeze, or twist. It
is defined usually as force/unit area but may have different definitions to suit different
situations. We distinguish between three types of stresses:
a) If the deforming force is applied along some linear dimension of a body, the
stress is called longitudinal stress or tensile stress or compressive stress.
b) If the force acts normally and uniformly from all sides of a body, the stress is called
volume stress.
c) If the force is applied tangentially to one face of a rectangular body, keeping the other
face fixed, the stress is called tangential or shearing stress.
4. Strain: When deforming forces are applied on a body, it undergoes a change in shape or
size. The fractional (or relative) change in shape or size is called the strain.
change in dimension
Strain =
original dimension
Strain is a ratio of similar quantities so it has no units. There are 3 different kinds of
strain:
a) Longitudinal (linear) strain is the ratio of the change in length (∆L) to original
∆l
length (l ), i.e., the linear strain = .
l
b) Volume strain is the ratio of the change in volume (∆V ) to original volume (V )
∆V
Volume strain = .
V
c) Shearing strain : The angular deformation (θ ) in radians is called shearing stress.
∆x
For small θ the shearing strain ≡ θ ≈ tan θ = .
l
∆x F
θ
l
(b) Bulk Modulus (B) for volume strain: Let a body of volume V be subjected to a
uniform pressure ∆P on its entire surface and let ∆V be the corresponding
decrease in its volume. Then,
Volume Stress ∆P
B ≡ =− .
Volume Strain ∆V / V
1/ B is called the compressibility. A material having a small value of B can be
compressed easily.
(c) Shear Modulus (η ) for shearing strain: Let a force F produce a strain θ as in the
diagram in point 4 above. Then,
shearing stress F/A F Fl
η≡ = = = .
shearing strain θ A tan θ A∆x
6. When a wire is stretched, its length increases and radius decreases. The ratio of the
∆r / r
lateral strain to the longitudinal strain is called Poisson's ratio, σ = . Its value lies
∆l / l
between 0 and 0.5.
7. We can calculate the work done in stretching a wire. Obviously, we must do work
against a force. If x is the extension produced by the force F in a wire of length l ,
YA
then F = x. The work done in extending the wire through ∆l is given by,
l
YA ( ∆l )
∆l ∆l 2
YA
W = ∫ Fdx = ∫ xdx =
0
l 0 l 2
YA ( ∆l )
2
1 ⎛ Y ∆l ⎞⎛ ∆l ⎞ 1
= = ( Al ) ⎜ ⎟⎜ ⎟ = × volume × stress × strain
l 2 2 ⎝ l ⎠⎝ l ⎠ 2
1
Hence, Work / unit volume = × stress × strain. We can also write this as,
2
1 ⎛ YA∆l ⎞ 1
W= ⎜ ⎟ ∆l = × load × extension.
2⎝ l ⎠ 2
∆F
9. The normal force per unit area is called pressure, P = . Pressure is a scalar quantity.
∆A
∆m
Its unit is Newtons/metre 2 , or Pascal (Pa). Another scalar is density, ρ = , where
∆V
∆m is the mass of a small piece of the material and ∆V is the volume it occupies.
Note that we are taking the origin (y = 0) at the bottom of the liquid. Therefore as the
elevation increases (dy positive), the pressure decreases (dp negative). The quantity ρ g
is the weight per unit volume of the fluid. For liquids, which are nearly incompressible,
ρ is practically constant.
dp ∆p p2 − p1
∴ ρ g = constant ⇒ = = = − ρ g ⇒ p2 − p1 = − ρ g ( y2 − y1 ) .
dy ∆y y2 − y1
Q.1 Give 3 examples (other than those given in the lecture or the summary above) of
a) Nearly elastic materials.
b) Nearly plastic materials.
Would you consider glass to be elastic or plastic? Why?
Q.2 In the summary above, I wrote that "the forces acting on a fluid at rest have to be
normal to the surface. This implies that the free surface of a liquid at rest, under gravity,
in a container, is horizontal.". Be sure that you understand this, and then explain what
happens as:
a) You slowly tilt a glass. Why does the surface stay horizontal?
b) You stir a cup of tea with a spoon. Why does the surface of the liquid rise up as
you move away from the centre?
Q.3 To what height should a cylindrical vessel be filled with a homogeneous liquid to
make the force with which the liquid presses on the sides of the wall equal to the force
exerted by the liquid on the bottom of the vessel? [Hint: the force exerted by the liquid
1
on the bottom = ( hg ρ ) π r 2 ; mean pressure on the wall = ( hg ρ ) . Answer: the liquid
2
should be filled up to the height equal to the radius of the cylinder. Work out the
remaining details!]
Q.4 As discussed in the lecture, Archimedes Principle states that a body wholly or partially
immersed in a fluid is buoyed up by a force (or upthrust) equal in magnitude to the weight
of the fluid displaced by the body. Show how this follows for a cube immersed in water
using the equation derived earlier, namely, p2 − p1 = − ρ g ( y2 − y1 ) .
1. A fluid is matter that has no definite shape and adjusts to the container that it is
placed in. Gases and liquids are both fluids. All fluids are made of molecules. Every
molecules attracts other molecules around it.
2. Liquids exhibit surface tension. A liquid has the property that its free surface
tends to contract to minimum possible area and is therefore in a state of tension.
The molecules of the liquid exerts attractive forces on each other, which is called
cohesive forces. Deep inside a liquid, a molecule is surrounded by other molecules
in all directions. Therefore there is no net force on it. At the surface, a molecule is
surrounded by only half as many molecules of the liquid, because there are no
molecules above the surface.
5. Let's ask how much work is done when we stretch the skin of a liquid. If we move the
sliding wire through a displacement ∆x, the work done is F ∆x. Now F is a conservative
force, so there is potential energy ∆U = F ∆x = γ L∆x where L is the length of the surface
∆U
layer L∆x = ∆A = change in area of the surface. Thus γ = . So we see that surface
∆A
tension is the surface potential energy per unit area !
6. When liquids come into contact with a solid surface, the liquid's molecules are attracted
by the solid's molecules (called "adhesive" forces). If these adhesive forces are stronger
than the cohesive forces, the liquid's molecules are pulled towards the solid surface and
8. From the fact that liquids are incompressible, equal volumes of liquid must flow in both
sections in time t, i.e. V1 / t = V2 / t ⇒ V1 = V2 . But you can see that V1 = A1L1 = A1v1t and
similarly that V2 = A2 L2 = A2 v 2t. Hence A1v1 = A2 v 2 . This means that liquid will flow
faster when the tube is narrower, and slower where it is wider.
L1 = v1t
L2 = v2t
1 2
10. Let us apply this to water flowing in a pipe whose crossection decreases along its
length. (A1 is area of the wide part, etc).There is no change in the height so y1 = y2 and
v1
v2
This is exactly why an aircraft flies: the wing shape is curved so that when the aircraft
moves through the air, the air moves faster on the top part of the wing than on the
lower part. Thus, the air pressure is lower on the top compared to the bottom and there
is a net pressure upwards. This is called lift.
Q.2 When water is poured on to a concrete floor from a height of 10 cm, it breaks up
into many drops. About how many more drops will be made if it is poured from 30
cms instead? Assume that the size of the drops is the same in both cases.
Q.3 Does it take more energy to create two bubbles of radius of radius 0.5cm or one
bubble of radius 1.0cm? Find the ratio of the two energies.
Q.4 Apply Bernoulli's equation to find the difference in pressures between the lower and
upper parts of the tube below. An incompressible fluid flows in it. [Hint: v1 = v2 . Why?]
v2
v1
h
2. To measure the intensity of sound, we use decibels as the unit. Decibels (db) are a relative
measure to compare the intensity of different sounds with one another,
I
R ≡ relative intensity of sound I = log10 (decibels)
I0
Typically, on a street without traffic the sound level is about 30db, a pressure horn creates
about 90db, and serious ear damage happens around 120db.
λ vO
r1
Moving observer, source at rest. If the observer was at rest, the number of waves she
would receive in time t would be t / T (or v t / λ ). But if she is moving towards the
source with speed v 0 (as in the above figure), the additional number of waves received is
obviously v 0 t / λ . By definition,
number of waves received
ν ′ = frequency actually heard =
unit time
vt v 0t
+
∴ν ′ = λ λ = v + v0 = v + v0 = ν v + v0
t λ v /ν v
⎛ v ⎞
We finally conclude that the frequency actuall heard is ν ′ = ν ⎜ 1 + 0 ⎟ . So as the
⎝ v⎠
observer runs towards the source, she hears a higher frequency (higher pitch).
Moving source, observer at rest : As the source runs towards the observed, more
vs
waves will have to packed together. Each wavelength is reduced by . So the
ν
v vs
wavelength seen by the observer is λ ′ = − . From this, the frequency that she
ν ν
v v v
hears is ν ′ = = =ν .
λ ( v − vs ) /ν ( v − vs )
v + vo
Moving source and moving observer : ν ′ = ν . As you can easily see, the
v − vs
above two results are special cases of this.
Q.1 A cubical box of side 5 metres is filled uniformly with sound of 40 db intensity. What
is the total power that a loudspeaker must have to maintain this intensity?
Q.2 On the same piece of graph paper plot the following functions for θ between 0 and 2π :
a) sinθ
1
b) sin 2θ
2
1
c) sinθ + sin 2θ
2
Q.3 Why are the Doppler shift formulae different for the two cases: moving observer and
static source as compared to static observer and moving source? Explain without using
any formula. Why is knowing the relative velocity of source and observer not
sufficient for knowing the frequency received by the observer?
Q.4 Suppose that two trucks are racing towards each other with the same speed. One is
blowing its horn with frequency 1000 Hz (cycles per second). The other truck driver
hears a frequency of 1050 Hz. How fast were the trucks moving in km/hour before they
collided? [The speed of sound in air is about 340 m/sec]
Q.5 In a musical instrument like the sitar, how does the player control the relative amplitude
of the harmonics present?
1. Wave motion is any kind of self-repeating (periodic, or oscillatory) motion that transports
energy from one point to another. Waves are of two basic kinds:
(a) Longitudinal Waves: the oscillation is parallel to the direction of wave travel.
Examples: sound, spring, "P-type" earthquake waves.
(b) Transverse Waves: the oscillation is perpendicular to the direction of wave travel.
Examples: radio or light waves, string, "S-type" earthquake waves.
2. Waves transport energy, not matter. Taking the vibration of a string as an example, each
segment of the string stays in the same place, but the work done on the string at one end is
transmitted to the other end. Work is done in lifting the mass at the other end below.
3. The height of a wave is called the amplitude. The average power (or intensity) in a wave
is proportional to the square of the amplitude. So if a(t ) = a0 sin(ωt − kx) is a wave of some
kind, then a0 is the amplitude and I ∝ a02 .
4. A sound source placed at the origin will radiate sound waves in all directions equally. These
1 1
are called spherical waves. For spherical waves the amplitude ∝ and so the power ∝ 2 .
r r
We can easily see why this is so. Consider a source of sound and draw two spheres:
Let P1 be the total radiated power and I1 the intensity at r1 , etc. All the power (and energy)
that crosses r1 also crosses r2 since none is lost in between the two. We have that,
I1 r22 1
4π r12 I1 = P1 and 4π r22 I 2 = P2 . But P1 = P2 = P, and so = 2 or I ∝ 2 .
I 2 r1 r
r2
r1
r1
6. When two sources are present the total amplitude at any point is the sum of the two
separate amplitudes, y ( x, t ) = y1 ( x, t ) + y2 ( x, t ) . Now you remember that the power is
proportional to the square of the amplitude, so P ∝ ( y1 + y2 )2 . This is why interference
happens. In the following we shall see why. Just to make things easier, suppose the two
waves have equal amplitude. So lets take the two waves to be :
y1 ( x, t ) = ym sin ( kx − ωt − φ1 ) and y2 ( x, t ) = ym sin ( kx − ωt − φ2 )
The total amplitudes is: y ( x, t ) = y1 ( x, t ) + y2 ( x, t )
= ym ⎡⎣sin ( kx − ωt − φ1 ) + sin ( kx − ωt − φ2 ) ⎤⎦
1
Now use the trigonometric formula, sin B + sin C = 2sin ( B + C ) × cos ( B − C ) to get,
2
y ( x, t ) = ym ⎡⎣sin ( kx − ωt − φ1 ) + sin ( kx − ωt − φ2 ) ⎤⎦
⎡ ⎛ ∆φ ⎞ ⎤
= ⎢ 2 ym cos ⎜ ⎟ ⎥ × sin ( kx − ωt − φ ′ ) .
⎣ ⎝ 2 ⎠⎦
(φ + φ )
Here ∆φ = φ2 − φ1 is the difference of phases, and φ ′ = 1 2 is the sum. So what do
2
we learn from this? That if φ2 = φ1 , then the two waves are in phase and the resultant
amplitude is maximum (because cos 0 = 1). And that if φ2 = φ + π , then the two waves are
out of phase and the resultant amplitude is minimum (because cos π / 2 = 0). The two
waves have interfered with each other and have increased/decreased their amplitude in
⎛ ∆φ ⎞
these two extreme cases. In general cos ⎜ ⎟ will be some number that lies between
⎝ 2 ⎠
-1 and +1.
8. The speed of a pulse. A pulse is a burst of energy (sound, electromagnetic, heat,...) and
could have any shape. Mathematically any pulse has the form y ( x, t ) = f ( x − vt ) . Here
f is any function (e.g. sin, cos, exp,....). Note that at time t = 0, y ( x,0 ) = f ( x ) and the
shape would look as on the left in the diagram below. At a late time t, it will look just the
same, but shifted to the right. In other words at time t , y ( x, t ) = f ( x′ ) where x′ = x − vt.
Fix your attention on any one point of the curve and follow it as the pulse moves to the
dx dx
right. From x − vt = constant it follows that − v = 0 , or v = . This is called the
dt dt
phase velocity because we derived it using the constancy of phase.
y y′
P
f ( x′ )
O O′ x′
vt
Q.1 Give a qualitative explanation (no formulas) for why sound travels faster through iron
than through air. Can you think of an experiment that would demonstrate this fact?
Q.2 A stone is dropped into a lake and the waves spread out uniformly. Using the same
reasoning as in point 4, find:
a) The rate at which the wave intensity decreases with distance r from the centre.
b) The rate of decrease of amplitude.
1
− ( x − 2t )2
Q.3 Make a plot of the function y ( x, t ) = e 2
as a function of x for two different values
of t , t = 0 and t = 1.
1. Newton's law of universal gravitation states that the force of attraction between two
mm
masses m1 and m2 is F ∝ 1 2 2 and is directed along the line joining the two bodies.
r
mm
Putting in a constant of proportionality, F = G 1 2 2 . Now let's be a bit careful of the
r
direction of the force. Looking at the diagram below, F21 = Force on m2 by m1 ,
F12 = Force on m1 by m2 , F12 = F21 = F . By Newton's Third Law, F12 = − F21.
m1 F12 F21 m2
I 4π 2 I
observing the period of free oscillations, T = 2π ⇒κ = 2
κ T
2
mL
with I = . The modern value is G = 6.67259 × 10−11 N .m 2 / kg 2
2
3. The magnitude of the force with which the Earth attracts a body of mass m towards its
GmM E
centre is F = , where RE = 6400 km is the radius of the Earth and M E is the mass.
RE 2
The material does not matter - iron, wood, leather, etc. all feel the force in proportion to
GmM E
their masses. If the body can fall freely, then it will accelerate. So, F = mg = .
RE 2
4. The gravitational potential is an important quantity. It is the work done in moving a unit
GM
mass from infinity to a given point R, and equals V (r ) = − .
R
0 ∞
Proof: Conservation of energy says, dV = − Fdr ⇒ ∫
V ( R)
dV = − ∫ drF (r )
R
∞ ∞
dr ⎡1⎤ GM
Integrate both sides: 0 − V ( R) = GM ∫ 2
= −GM ⎢ ⎥ , ∴V ( R ) = −
R
r ⎣ r ⎦R R
5. Using the above formula, let us calculate the change in potential energy ∆U when we raise
a body of mass m to a height h above the Earth's surface.
∆U = GMm ⎜
⎛ 1
−
1 ⎞
⎝ RE RE + h ⎠
⎛
⎟ = GMm ⎜1 −
1
⎝ 1 + h / RE ⎠
⎞
⎟ = GMm 1 − (1 + h / RE )
−1
( )
Now suppose that the distance h is much smaller than the Earth's radius. So, for h RE ,
⎛ GM ⎞
(1 + h / RE ) = 1 − h / RE . So we find ∆U = GMm (1 − (1 − h / RE ) ) = m ⎜
−1
⎟ h = mgh.
⎝ RE ⎠
6. We can use the expression for potential energy and the law of conservation of energy to
find the minimum velocity needed for a body to escape the Earth' gravity. Far away from
the Earth, the potential energy is zero, and the smallest value for the kinetic energy is
1 GMm
zero. Requiring that ( KE + PE )r = R = ( KE + PE )r =∞ gives mv e2 − = 0 + 0. From
2 RE
2GM
this, v e = = 2 gRE . Putting in some numbers we find that for the Earth ve = 11.2 km/s
RE
and for the Sun v e = 618km/s. For a Black Hole, the escape velocity is so high that nothing
can escape, even if it could move with the speed of light! (Nevertheless, Black Holes can be
observed because when matter falls into them, a certain kind of radiation is emitted.)
GM
8. What is the total energy of a satellite moving in a circular orbit around the earth? Clearly,
it has two parts, kinetic and potential. Remeber that the potential energy is negative. So,
1 GM E m GM
E = KE + PE = mvo2 − . But, vo2 = as we saw earlier and therefore,
2 r r
1 GMm GMm 1 GMm
E= − =− . Note that the magnitude of the potential energy is
2 r r 2 r
larger than the kinetic energy. If it wasn't, the satellite would not be bound to the Earth!
9. A famous discovery of the astronomer Johann Kepler some 300 years ago says that the
line joining a planet to the Sun sweeps out equal areas in equal intervals of time. We can
easily see this from the conservation of angular momentum. Call ∆ A the area swept out
1
in time ∆ t. Then from the diagram below you can see that ∆ A = r ( r ∆θ ) . Divide this by
2
∆ t and then take the limit where it becomes very small,
dA ∆A 1 2 ⎛ ∆θ ⎞ 1 2 L
= lim = r ⎜ lim ⎟= r ω= .
dt ∆t → 0 ∆t 2 ⎝ ∆t → 0 ∆t ⎠ 2 2m
Since L is a constant, we have proved one of Kepler's laws (with so little effort)!
∆θ
r ∆θ
M r ∆A
θ
1. The acceleration due to gravity g is, as you know, about 9.8 metres/second 2 on the
surface of the earth. However, this is not a constant!
a) Why does g vary a little bit from place to place on the surface of the Earth even
if the height at which it is being measured remains constant?
b) What happens to g as you go higher and higher?
c) What happens to g as you approach the centre of the Earth?
d) Where do you expect g will have its maximum value, and why?
3. A planet moves in an elliptical orbit (see below) and has a speed v1 at a distance d1 from
the Sun.
v2
M
d1 d2
v1
v1d1
(a) What will be its speed at distance d 2 ? [Ans: v 2 = ]
d2
2GMd1d 2
(b) Show that the value of its (constant) angular momentum is, m .
( d1 + d 2 )
1. Like charges repel, unlike charges attract. But by how much? Coulomb's Law says that
this depends both upon the strength of the two charges and the distance between them.
qq qq
In mathematical terms, F ∝ 1 2 2 which can be converted into an equality, F = k 1 2 2 .
r r
The constant of proportionality will take different values depending upon the units we
1
choose. In the MKS system, charge is measured in Coulombs (C) and k = with
4πε 0
ε 0 = 8.85 × 10−12 C 2 / Nm 2 and hence k = 8.99 × 109 Nm 2 / C 2 .
2. The situation is quite similar to that of gravity, except that electric charges and not masses
r 1 q1q2
are the source of force. In vector form, F12 = rˆ12 is the force exerted by 2 on 1,
4πε 0 r122
r r
r12 1 q1q2
where the unit vector is rˆ12 = . On the other hand, F21 = rˆ21 is the force exerted
r12 4πε 0 r122
r r
by 1 on 2. By Newton's Third Law, F12 = − F21. For many charges, the force on charge 1 is
r r r r
given by, F1 = F12 + F13 + F14 + ⋅ ⋅ ⋅ ⋅
3. Charge is quantized. This means that charge comes in certain units only. So the size of a
charge can only be 0, ± e, ± 2e, ± 3e, ⋅ ⋅ ⋅ ⋅ where e = 1.602 × 10−19 C is the value of the
charge present on a proton. By definition we call the charge on a proton positive. This
makes the charge on an electron negative.
4. Charge is conserved. This means that charge is never created or destroyed. Equivalently,
in any possible situation, the total charge at an earlier time is equal to the charge at a
later time. For example, in any of the reactions below the initial charge = final charge:
e− + e+ → γ + γ (electron and positron annihilate into neutral photons)
π0 →γ +γ (neutral pion annihilates into neutral photons)
H 2 + H 2 → H 3 + p (two deuterons turn into tritium and proton)
5. Field : this a quantity that has a definite value at any point in space and at any time. The
simplest example is that of a scalar field, which is a single number for any value of x, y , z , t.
Examples: temperature inside a room T ( x, y, z, t ) , density in a blowing wind ρ ( x, y, z , t ), ⋅ ⋅ ⋅
There are also vector fields, which comprise of three numbers at each value of x, y , z , t.
Examples: the velocity of wind, the pressure inside a fluid, or even a sugarcane field. In
6. The electric field is also an example of a vector field, and will be the most important for
our purpose. It is defined as the force on a unit charge. Or, since we don't want the charge
to disturb the field it is placed in, we should properly define it as the force on a "test"
F
charge q0 , E ≡ . Here q0 is very very small. The electric field due to a point charge can
q0
1 qq0
be calculated by considering two charges. The force between them is F = and so
4πε 0 r 2
F 1 q
E= = . A way to visualize E fields is to think of lines starting on positive charges
q0 4πε 0 r 2
and ending on negative charges. The number of lines leaving/entering gives the amount of
charge.
8. Measuring charge. One way to do this is to balance the gravitational force pulling a
charged particle with mass m with the force exerted on it by a known electric field
(see below). For equilibrium, the two forces must be equal and so mg = qE. The
mg
unknown charge q can then be found from q = .
E
r y
eE
r
E
r
mg
9. Given several charges, one can find the total electric field at any point as the sum of the
r r r r
fields produced by the charges at that point individually, E = E1 + E2 + E3 + ⋅ ⋅ ⋅ ⋅ ⋅ or
r r q
E = ∑ Ei = k ∑ 2i rˆi ( i = 1, 2,3, ⋅ ⋅ ⋅) . Here rˆi is the unit vector pointing from the charge to
i i ri
1 q d /2 1 qd
find: E = 2 = .
4πε 0 x + ( d / 2 )
2 2
x + ( d / 2)
2 2 4πε 0 ⎡ x 2 + ( d / 2 )2 ⎤ 3 / 2
⎣ ⎦
+
r
θ
d
x r r
θ E− θ θ E+
r
− r
E
11. The result above is so important that we need to discuss it further. In particular, what
happens if we are very far away from the dipole, meaning x >> d ? Let us first define
the dipole moment as the product of the charge × the separation between them p = qd .
−3 / 2
1 p⎡ ⎛ d ⎞ ⎤
2
p 1 1 1 p
Then, E = = ⎢1+ ⎜ ⎟ ⎥ = . In the above,
4πε 0 x ⎡1 + ( d / 2 x )2 ⎤
3 3 / 2
4πε 0 x ⎣⎢ ⎝ 2 x ⎠ ⎦⎥
3
4πε 0 x 3
⎣ ⎦
d / 2 x has been neglected in comparison to 1. So finally, we have found that for x >> d ,
1 p
E= .
4πε 0 x 3
12. It is easy to find the torque experienced by an electric dipole that is placed in a uniform
d d
electric field: The magnitude is τ = F sin θ + F sin θ = Fd sin θ , and the direction is
2 2
perpendicular and into the plane. Here θ is the angle between the dipole and the electric
field. So τ = ( qE ) d sin θ = pE sin θ .
1. What will be the direction of the electric field in each of the following circumstances :
(a) In the middle of a square with four equal positive charges at the corners?
(b) In the middle of a square with 3 equally positive and one negative charge at the corners?
(c) At the centre of a hollow sphere with charges distributed uniformly on the surface?
(d) At the centre of a hollow sphere with the charge on one hemisphere is positive, and the
charge on the other hemisphere is negative.
2. How could you experimentally investigate (as a matter of principle) where the lines of
electric force are? Why can two lines of electric force never cross each other?
1
3. (a) The electric field of a dipole does not fall off as . Why?
r2
(b) Instead of calculating the electric field of the dipole, use the figure on the previous
page to calculate the electric potential on the x -axis.
4. Work is done by an electric field acting upon a dipole because the dipole is turned
θ θ
r r
through a certain angle. So, W = ∫ dW = ∫ τ .dθ = ∫ −τ dθ .
θ0 θ0
1. In the last lecture we learned how to calculate the electric field if there are any number
of point charges. But how to calculate this when charges are continuously distributed over
some region of space? For this, we need to break up the region into little pieces so that
r r r r
each piece is small enough to be like a point charge. So, E = ∆E1 + ∆E2 + ∆E3 + ⋅ ⋅ ⋅⋅, or
r r r
E = ∑ ∆Ei is the total electric field. Remember that E is a vector that can be resolved
r
into components, E = Exiˆ + E y ˆj + Ez kˆ. In the limit where the pieces are small enough,
r r
we can write it as an integral, E = ∫ dE (or Ex = ∫ dEx , E y = ∫ dE y , Ez = ∫ dEz )
2. Charge Density: when the charges are continuously distributed over a region - a line, the
surface of a material, or inside a sphere - we must specify the charge density. Depending
upon how many dimensions the region has, we define:
(a) For linear charge distribution: dq = λ ds
(b) For surface charge distribution: dq = σ dA
(c) For volume charge distribution: dq = ρ dV
The dimensions of λ ,σ , ρ are determined from the above definitions.
3. As an example of how we work out the electric field coming from a continuous charge
distribution, let us work out the electric field from a uniform ring of charge at the point P.
r
dE
θ
dE cos θ
P
θ z
y
R
λ ds x
The small amount of charge λ ds gives rise to an electric field whose magnitude is
1 λ ds λ ds
dE = =
4πε 0 r 2
(
4πε 0 z 2 + R 2 )
z z
The component in the z direction is dEz = dE cosθ with cos θ = = .
( )
1/ 2
r z + R2
2
4. As another example, consider a continuous distribution of charges along a wire that lies
along the z -axis, as shown below. We want to know the electric field at a distance x from
the wire. By symmetry, the only non-cancelling component lies along the y -axis.
r
z dE z r
dE
y θ y
θ r
z P dE y
r
dz dq
x
Applying Coulomb's law to the small amount of charge λ dz along the z axis gives,
1 dq 1 λ dz
dE = =
4πε 0 r 2
4πε 0 y + z 2
2
The rest is just technical: to solve the integral, put z = y tan θ ⇒ dz = y sec 2 θ dθ . And so,
λ θ =π / 2 λ
E= ∫
2πε 0 y θ = 0
cosθ dθ =
2πε 0 y
. Now, we could have equally well taken the x axis. The
only thing that matters is the distance from the wire, and so the answer is better written as:
λ
E= .
2πε 0 r
5. The flux of any vector field is a particularly important concept. It is the measure of the
"flow" or penetration of the field vectors through an imaginary fixed surface. So, if there
is a uniform electric field that is normal to a surface of area A, the flux is Φ = EA. More
generally, for any surface, we divide the surface up into little pieces and take the
5. Let us apply the above concept of flux to calculate the flux leaving a sphere which has
a charge at its centre. The electric field at any point on the sphere has magnitude equal
1 q
to and it is directed radially outwards. Let us now divide up the surface of the
4πε 0 r 2
1 q
sphere into small areas. Then Φ = ∑ E ∆A = E ∑ ∆A = (
4πε 0 r 2
)
4π r 2 . So we end up with
q
the important result that the flux leaving this closed surface is Φ = .
ε0
6. Gauss's Law: the total electric flux leaving a closed surface is equal to the charge
enclosed by the surface divided by ε 0 . We can express this directly in terms of the
r r q
mathematics we have learned, Φ ≡ ∫ E ⋅ dA = enclosed . Actually, we have already seen
ε0
why this law is equivalent to Coulomb's Law in point 5 above, but let's see it again. So,
r r
applying Gauss's Law to a sphere containing charge, ε 0 ∫ E ⋅ dA = ε 0 ∫ EdA = qenclosed . If
the surface is a sphere, then E is constant on the surface and ε 0 E ∫ dA = q and from this
1 q
ε 0 E ( 4π r 2 ) = q ⇒ E = . This is Coulomb's Law again, but the power of Gauss's
4πε 0 r 2
law is that it holds for any shape of the (closed) surface and for any distribution of charge.
7. Let us apply Gauss's Law to a hollow sphere that has charges only on the surface. At any
( )
distance r from the centre, Gauss's Law is ε 0 E 4π r 2 = qenclosed . Now, if we are inside the
sphere then qenclosed = 0 and there is no electric field. But if we are outside, then the total
1 q
charge is qenclosed = q and E = , which is as if all the charge was concentrated at the
4πε 0 r 2
centre.
8. Unfortunately, it will not be possible for me to prove Gauss's Law in the short amount of
time and space available but the general method can be outlined as follows: take any
volume and divide it up into little cubes. Each little cube may contain some small amount
of charge. Then show that for each little cube, Gauss's Law follows from Coulomb's Law.
Finally, add up the results. For details, consult any good book on electromagnetism.
Q.1 As we all know, everything is made up of atoms and the only charges present inside
matter reside on atoms or electrons. So everything is ultimately "bumpy". When then
is the concept of a continuous distribution of charge used in this lecture? Under what
conditions would this become inadequate, or even wrong?
Q.2 Give the dimensions of λ ,σ , ρ using their definitions. In 1,2,3 dimensions write the
expression for the total charge.
Q.3 In the diagram below you see two parallel plates with surface charge density + σ and
-σ respectively. We would like to calculate the electric field everywhere using Gauss's
theorem, and the assumption that the plates are infinitely long in the vertical direction.
L R
+ −
Gaussian surface
+ −
+ −
+ −
+ −
+ −
+ −
a) Show that the electric field vanishes everywhere except in between the two plates. Use
Gauss's Law to show this. [Hint: draw a suitable gaussian surface that extends both to
the left and right, and use the fact that the total enclosed charge is zero.]
b) Look at the gaussian surface that has been drawn in the diagram and evaluate each of
r r r r r r r r
terms defining the flux, Φ = ∫ E ⋅ dA = ∫ E ⋅ dA + ∫ E ⋅ dA + ∫ E ⋅ dA
outer inner side
cap cap walls
σ
c) Show that the field between the plates is given by E = .
ε0
d) Suppose that one plate is totally removed and that the charge density is +σ on the
remaining plate. What is the electric field now?
1. You are already familiar with the concept of gravitational potential energy. When you
lift a weight, you have to do work against the downwards pull of the Earth. That work
r r
is stored as potential energy. Suppose a force F acts on something and displaces it by ds .
r r
Then the work done is F ⋅ ds . The work done in going from point a to point b (call it Wab )
b r r
is then got by adding together the little bits of work, Wab = ∫ F ⋅ ds . The change in
a
2. The electrostatic force is conservative and can be represented by a potential. Let us see
how to calculate the potential. So consider two charges separated by a distance as below.
+q q =1
r
Let us take the point a very far from the fixed charge q, and the unit charge at the point b
to be at a distance R from q. Then the work you did in bringing the unit charge from infinity
R R
1 q⎛1 1⎞ 1
q 1
to R is, Wab = ∫ ( −qE ) dr = − ∫ dr r =−
q⎜ − ⎟ = − . Since the charges
∞
4πε 0 ∞ 4πε 0 ⎝ R ∞ ⎠
2
4πε 0 R
repel each other, it is clear that you had to do work in pushing the two charges closer
together. So where did the negative sign come from? Answer: the force you exert on the
unit charge is directed towards the charge q, i.e. is in the negative direction. This is why
r r q 1
F ⋅ ds = (− qE )dr. Now ∆U = U ( R ) - U (∞) = . If we take the potential at ∞ to be
4πε 0 R
q 1
zero, then the electric potential due to a charge q at the point r is U (r ) = .
4πε 0 r
dU
Remember that we know how to calculate the force given the potential: F = − . Apply
dr
q 1
this here and you see that F = , (which also has the correct repulsive sign).
4πε 0 r 2
3. From the above, it is quite obvious that the potential energy of two charges q1 , q2 is,
1 q1q2 mm
U (r) = . Compare this with the formula for gravitational energy, U ( r ) = −G 1 2 .
4πε 0 r r
What is the difference? From here, you can see that the gravitational force is always
negative (which means attractive), whereas the electrostatic force can be both attractive
3. The electric potential (or simply potential) is the energy of a unit charge in an electric field.
Joule
So, in our MKS units, the unit of potential is 1 = 1 Volt. Another useful unit is
Coulomb
"electron volt" or eV. The definition is:
One electron - volt = energy gained by moving one electron charge through one Volt
( )
= 1.6 × 10−19 C × 1V = 1.6 × 10−19 J
It is useful to note that 1 Kev = 103eV (kilo-electron-volt)
1 Mev = 106eV (million-electron-volt)
1 Gev = 109eV (giga-electron-volt)
1 Tev = 1012eV (tera-electron-volt)
4. Every system seeks to minimize its potential energy (that is why a stone falls down!).
So, positive charges accelerate toward regions of lower potential, but negative charges
accelerate toward regions of higher potential. Note that only the potential difference
matters - even if a charge is placed in a region where there is a high potential, it will not
want to move unless there is some other place where the potential is higher/lower.
5. Given a system of charges, we can always compute the force - and hence the potential -
that arises from them. Here are some important general statements:
a)Potentials are more positive in regions which have more positive charge.
b)The electric potential is a scalar quantity (a scalar field, actually).
dU
c)The electric potential determines the force through F = − , and hence the electric
dr
field because F = qE.
d)The electric potential exists only because the electrostatic force is conservative.
the distance of the i'th charge from the point where r12
q2
the potential is being calculated or measured. As an r13
example, the potential from the three charges is:
r23
1 q1q2 1 q1q3 1 q2 q3
V(r ) = + + .
4πε 0 r12 4πε 0 r13 4πε 0 r23 q3
r2
−q r2 − r1 = d cos θ
Q.1 How much work (in joules) is done by a force that moves a charge of 2 coulombs
through a distance of 10cm in a constant electric field of 7 V/cm?
Q.2 It is common to call the potential of the earth as zero. Is this necessary? Is it okay to
do so, and why?
Q.3 There are two hollow metal spheres each with one coulomb of charge upon it. One has
radius 4cm and the other 14cm. Find the ratio of the electric fields on the two surfaces.
Repeat if instead they have the same charge density σ on the surfaces.
1. Two conductors isolated from one another and from their surroundings, form a capacitor.
These conductors may be of any shape and size, and at any distance from each other. If
a potential difference is created between the conductors (say, by connecting the terminals
of a battery to them), then there is an electric field in the space between them. The electric
field comes from the charges that have been pushed to the plates by the battery. The
amount of charge pushed on to the conductors is proportional to the potential difference
between the battery terminals (which is the same as between the capacitor plates). Hence,
Q ∝ V . To convert this into an equality, we write Q = CV . This provides the definition of
Q
capacitance, C = .
V
2. Using the above definition, let us calculate the capacitance of two parallel plates separated
by a distance d as in the figure below.
+q
+ + + + + +
+ + + + + + + + +
d Gaussian surface
− − − − − − − − −
−q
r r q
Recall Gauss's Law: Φ ≡ ∫ E ⋅ dA = enclosed . Draw any Gaussian surface. Since the electric
ε0
Q
field is zero above the top plate, the flux through the area A of the plate is Φ = EA = ,
ε0
Q
where Q is the total charge on the plate. Thus, E = is the electric field in the gap
ε0 A
E Q ε A
between the plates. The potential difference is V = , and so C = = 0 . You can see
d V d
that the capacitance will be large if the plates are close to each other, and if the plates
have a large area. We have simplified the calculation here by assuming that the electric
field is strictly directed downwards. This is only true if the plates are infinitely long. But
we can usually neglect the side effects. Note that any arrangement with two plates forms
a capacitor: plane, cylindrical, spherical, etc. The capacitance depends upon the geometry,
the size of plates and the gap between them.
C1
Ceq
V
C2
4. We can repeat the analysis above when the capacitors are put in series. Here the difference
is that now we must start with V = V1 + V2 , where V1 and V2 are the voltages across the two.
Clearly the same charge had to cross both the capacitors. Hence,
Q Q 1 1
V = V1 + V2 = + = Q( + ).
C1 C2 C1 C2
Q 1 1 1
From our definition, Ceq = , it follows that = + . The total capacitance is now
V Ceq C1 C2
1 1
less than if they were in parallel. In general, =∑ ( for n capacitors ) .
Ceq Cn
V1 V2
Ceq
C1 C2
V
V
5. When a battery is connected to a capacitor, positive and negative charges appear on the
opposite plates. Some energy has been transferred from the battery to the capacitor, and
now been stored in it. When the capacitor is discharged, the energy is recovered. Now
let us calculate the energy required to charge a capacitor from zero to V volts.
Begin: the amount of energy required to transfer a small charge dq to the plates is
dU = vdq, where v is the voltage at a time when the charge is q = Cv. As time goes on,
the total charge increases until it reaches the final charge Q (at which point the voltage
becomes V ). So,
Q
q q Q2 1
dU = vdq = dq ⇒ U = ∫ dU = ∫ dq = = CV 2 .
C 0
C 2C 2
1 Q
6. Dielectrics. Consider a free charge + Q. Around it is an electric field, E = .
4πε 0ε r r 2
Now suppose this charge is placed among water molecules. These molecules will polarise,
i.e. the centre of positive charge and centre of negative charge will be slightly displaced.
The negative part of the water molecule will be attracted toward the positive charge + Q.
1 1 Q
So, in effect, the electric field is weakened by and becomes, . Here I have
εr 4πε 0ε r r 2
introduced a new quantity ε r called "dielectric constant". This is a number that is usually
bigger than one and measures the strength of the polarization induced in the material. For
air, ε r = 1.0003 while ε r ≈ 80 for pure water. The effect of a dielectric is to increase the
capacitance of a capacitor: if the air between the plates of a capacitor is replaced by a
ε0 A ε0 A
dielectric, C = → εr .
d d
Q.1 Suppose a dielectric material is inserted between the plates of a parallel plate capacitor.
What will happen to the electric field in the gap between plates? To the total stored
energy in the capacitor?
Q.2 Two capacitors of 10 µ F and 15 µ F are joined together in parallel. A third capacitor of
15 µ F is placed in series with these two. What is the total capacitance?
1. Electric current is the flow of electrical charge. If a small amount of charge dq flows in
dq
time dq, then the current is i = . If the current is constant in time, then in time t , the
dt
current that flows is q = i × t. The unit of charge is ampere, which is define as:
1 coulomb
1 ampere =
second
A car's battery supplies upto 50 amperes when starting the car, but often we need to deal
with smaller values:
1 milliampere = 1 ma = 10−3 A
1 microampere = 1 µ A = 10−6 A
1 nanoampere = 1 nA = 10−9 A
1 picoampere = 1 pA = 10−12 A
2. The direction of current flow is the direction in which positive charges move. However, in
a typical wire, the positive charges are fixed to the atoms and it is really the negative
charges (electrons) that move. In that case the direction of current flow is reversed.
electron flow
conventional
current flow
device
+ −
3. Current flows because something forces it around a circuit. That "something" is EMF,
electromotive force. But remember that we are using bad terminology and that EMF is not
a force - it is actually the difference in electric potentials between two parts of a circuit. So,
in the figure below, V = Va − Vb is the EMF which causes current to flow in the resistor.
How much current? Generally, the larger V is , the more current will flow and we expect
I ∝ V . In general this relation will not be completely accurate but when it holds, we say
V V
Ohm's Law applies: I = . Here, R = is called the resistance.
R I
R
I I
I ( Amps ) R=
∆i
∆V does not obey
Ohm's Law
obeys Ohm's Law
V ( volts )
5. Charge is always conserved, and therefore current is conserved as well. This means that
when a current splits into two currents the sum remains constant, i1 = i2 + i3 .
i2
i1
i3
6. When resistors are put in series with each other, the same current flows through both. So,
V1 = iR1 and V2 = iR2 . The total potential drop across the pair is V = V1 + V2 = i ( R1 + R2 ).
⇒ Req = R1 + R2 . So resistors in series add up.
R1 R2
V1 V2
7. Resistors can also be put in parallel. This means that the same
V V
voltage V is across both. So the currents are i1 = , i2 = .
R1 R2
V V V
Since i = i1 + i2 it follows that i = = + a R1 b
Req R1 R2 R2
1 1 1 RR
⇒ = + or Req = 1 2 . a Req b
Req R1 R2 R1 + R2
This makes sense: with two possible paths the current will find
less resistance than if only one was present.
8. When current flows in a circuit work is done. Suppose a small amount of charge dq is
moved through a potential difference V. Then the work done is dW = Vdq = V idt. Hence
9. Kirchoff's Law: The sum of the potential differences encountered in moving around a
closed circuit is zero. This law is easy to prove: since the electric field is conservative,
therefore no work is done in taking a charge all around a circuit and putting it back
where it was. However, it is very useful in solving problems. As a trivial example,
consider the circuit below. The statement that, starting from any point a we get back
to the same potential after going around is: Va − iR + ε = Va . This says − iR + ε = 0.
a b
+ i
ε i R
− i
d c
Make sure that you understand each of these, and then check that the solution is:
ε1 ( R2 + R3 ) − ε 2 R3 ε1R3 − ε 2 ( R1 + R3 ) −ε1R2 − ε 2 R1
i1 = i2 = i3 =
R1R2 + R2 R3 + R1R3 R1R2 + R2 R3 + R1R3 R1R2 + R2 R3 + R1R3
12. A charge inside a wire moves under the influence of the applied electric field and suffers
many collisions that cause it to move on a highly irregular, jagged path as shown below.
Nevertheless, it moves on the average to the right at the "drift velocity" (or speed).
13. Consider a wire through which charge is flowing. Suppose that the number of charges
per unit volume is n. If we multiply n by the crossectional area of the wire A and the
length L, then the charge in this section of the wire is q = ( nAL ) e. If the drift velocity
of the charges is v d , then the time taken for the charge to move through the wire is
L q nALe
t= . hence the current is i = = = nAevd . From this we can calculate the drift
vd t L / vd
i
velocity of the charges in terms of the measured current, vd = . The current density,
nAe
i
which is the current per unit crossectional area is defined as j = = nevd . If j varies
A
r r
inside a volume, then we can easily generalize and write, i = ∫ j ⋅ dA.
L
I A I
V
Q.1 a) Sometimes people replace thin fuse wire with thick fuse wire. Why is this a wrong
and dangerous practice?
b) Give two examples where currents flow outside of wires.
Q.2 In the lecture, you learned about the internal resistance of a battery. Suppose that the
terminals of a battery of 1.5 V and internal resistance 3Ω are connected by an ammeter.
The ammeter shows that 300 ma flows. What is the internal resistance of the ammeter?
Q.3 A certain electronic device has a current voltage behaviour which is very non-linear,
I = 3(eV / V0 − 1), where V0 = 2.
a) Plot the current versus voltage upto a maximum voltage of 3.
b) What is the incremental resistance at V=2 ?
c) Expand the exponential for very small values of V, and hence find the incremental
resistance as V → 0.
d) Explain in your own words why Ohm's law does not apply for this device.
Q.4 Apply Kirchoff's Law to the circuit below and find the currents. Take the following
parameters: R1 = 5Ω, R2 = 5Ω, R3 = 15Ω, ε1 = 4V , ε 2 = 6V , ε 3 = 25V .
ε3 ε2 ε1
i2
i2 i1 R1 i1
R2
Q.5 A metal wire of diameter 0.5mm has resistivity 6 ohms per metre. A different kind of
metal wire with diameter 0.25mm has resistivity 12 ohms per metre. Calculate the ratio
of the drift velocities in the two metals.
1. The magnetic field exerts a force upon any charge that moves
in the field. The greater the size of the charge, and the faster r r r
F = qv × B
it moves, the larger the force. The direction of the force is
perpendicular to both the direction of motion and the magnetic r
r r B
field. If θ is the angle between v and B, then F = qvB sin θ is the + θ
r r q
magnitude of the force. This vanishes when v and B are r
parallel (θ = 0), and is maximum when they are perpendicular.
v
2. The unit of magnetic field that is used most commonly is the tesla. A charge of one
coulomb moving at 1 metre per second perpendicularly to a field of one tesla experiences
a force of 1 newton. Equivalently,
newton newton
1 tesla = 1 =1 = 104 gauss (CGS unit)
coulomb ⋅ meter/second ampere ⋅ meter
In order to have an appreciation for how much a tesla is, here are some typical values of
the magnetic field in these units:
Earth's surface 10-4 T
Bar magnet 10-2 T
Powerful electromagnet 1T
Superconducting magnet 5T
3. When both magnetic and electric fields are present at a point, the total force acting upon
r r r r
a charge is the vector sum of the electric and magnetic forces, F = qE + qv × B. This is
known as the Lorentz Force. Note that the electric force and magnetic force are very
different. The electric force is non-zero even if the charge is stationary, and it is in the
r
same direction as E.
4. The Lorentz Force can be used to select charged particles of whichever velocity we want.
In the diagram below, particles enter from the left with velocity v. They experience a force
due to the perpendicular magnetic field, as well as force downwards because of an electric
field. Only particles with speed v = E / B are undeflected and keep going straight.
+ + ++ + + FB = q v B
⊗ ⊗ ⊗ ⊗ +
⊗ ⊗ ⊗ ⊗
+⊗ ⊗ ⊗ ⊗
+
⊗ ⊗ ⊗ ⊗
−−−−−−
FE = qE
E
qE = qvB ⇒ v = velocity selector !!
B
6. A magnetic field bends a charged particle into a circular orbit because the particle feels
a force that is directed perpendicular to the magnetic field. As we saw above, the particle
cannot change its speed, but it certainly does change direction! So it keeps bending and
bending until it makes a full circle. The radius of orbit can be easily calculated: the magnetic
v2
and centrifugal forces must balance each other for equilibrium. So, qvB = m and we
r
mv
find that r = . A strong B forces the particle into a tighter orbit, as you can see. We
qB
v qB
can also calculate the angular frequency, ω = = . This shows that a strong B makes
r m
the particle go around many times in unit time. There are a very large number of applications
of these facts.
⊗ ⊗ ⊗ ⊗r
− v
r
⊗ ⊗ F⊗ r ⊗ v2 mv
F qv B = m ⇒ r= .
⊗ ⊗ ⊗ r qB
⊗
r
⊗ ⊗ F⊗ ⊗ r
r
v v
⊗ ⊗ ⊗ ⊗
7. The fact that a magnetic field bends charged particles is responsible for shielding the
earth from harmful effects of the "solar wind". A large number of charged particles are
released from the sun and reach the earth. These can destroy life. Fortunately the earth's
magnetic field deflects these particles, which are then trapped in the "Van Allen" belt
around the earth.
9. A wire carries current, and current is flowing charges. Since each charge experiences a
force when placed in a magnetic field, you might expect the same for the current. Indeed,
that is exactly the case, and we can easily calculate the force on a wire from the force on
individual charges. Suppose N is the total number of charges and they are moving at the
r r r r
average (or drift) velocity v d . Then the total force is F = Nevd × B. Now suppose that the
wire has length L, crossectional area A, and it has n charges per unit volume. Then clearly
r r r
N = nAL, and so F = nALevd × B. Remember that the current is the charge that flows
r r
through the wire per unit time, and so nAev d = I . We get the important result that the
r r r
force per unit length on the wire is F = I × B.
11. Magnetic fields are produced by currents. Every small bit of current produces a small
amount of the B field. Ampere's Law, illustrated below, says that if one goes around a
loop (of any shape or size) then the integral of the B field around the loop is equal to the
enclosed current. In the loop below I = I1 + I 2 . Here I 3 is excluded as it lies outside.
r r
i1 •
∫ B ⋅ ds = µ0 I enclosed
•i3 Amperian loop
r
• i2 r
ds
θ B
13. Assuming that the current flows uniformly over the crossection, we can use Ampere's
Law to calculate the magnetic field at distance r , where r now lies inside the wire.
r r
B ds
R ⎛ π r2 ⎞ µ Ir
B ( 2π r ) = µ 0 I ⎜ 2 ⎟
⇒B= 0 2
⎝πR ⎠ 2π R
Here is a sketch of the B field inside and outside the wire as a function of distance r.
inside outside
r
B r=R
Q.1 In the lecture and notes, I have emphasized that a magnetic field can only be measured
through the magnetic force acting upon a moving charged particle. Yet the commonest
way to measure the field is by means of a compass. How are the statements consistent
with each other?
Q.2 A pair of capacitor plates is charged upto some potential and an electric field exists in
the gap between them. Suppose an electron is released from the negative plate. Discuss
qualitatively its path if a magnetic field is applied perpendicular to the electric field if:
a) the B field is not too strong,
b) the B field is very strong.
F
B
I
Q.4 In the figure below, an infinitely long solenoid has N turns of wire per unit length and the
current in the wire is I . The goal is to find the magnetic field along the axis. The ⊗ and
• denote current going in/out of the paper.
h c
d
• • • • • • • • • • •
a b
⊗ ⊗ ⊗ ⊗ ⊗ ⊗ ⊗ ⊗ ⊗ ⊗ ⊗
2. A fundamental law of magnetism states that the net flux through a closed surface is always
r r
zero, Φ B = ∫ B ⋅ dA = 0. Note that this is very different from what you learned earlier in
electrostatics where the flux is essentially the electric charge. There is no such thing as a
magnetic charge! What we call the magnetic north (or south) pole of a magnet are actually
due to the particular electronic currents, not magnetic charges. In the bar magnet below,
no matter which closed surface you draw, the amount of flux leaving the surface is equal
to that entering it.
N S
Example : A sphere of radius R is placed near a long, straight wire that carries a steady
current I . The magnetic field generated by the current is B. Find the total magnetic flux
passing through the sphere.
3. Faraday's Law for Induced EMF: when the magnetic flux changes in a circuit, an electro-
Solution: Here the loop area changes, hence the flux. So the induced emf is:
⎡ 0 − π (0.12) 2 × 0.15 ⎤
ε = − dΦB ⎡ final flux − initial flux ⎤
≈ −⎢ ⎥⎦ = − ⎢ ⎥ = 0.034 Volts.
dt ⎣ time taken ⎣ 0.2 ⎦
⊗ ⊗ ⊗ ⊗ ⊗ ⊗
⊗ ⊗ ⊗ ⊗ ⊗ ⊗ r
D v
⊗ ⊗ ⊗ ⊗ ⊗ ⊗
⊗ ⊗ r⊗ ⊗ ⊗ ⊗
F3 r
B (into paper)
Solution: As the wire is pulled to the right, the area of the circuit increases and so the
flux increases. Measure x as above so that x = vt , i.e. x keeps increasing as we pull.
The flux at any value of x is Φ B = BDx, and so,
d ( BDx )
ε = − dΦB =− = − BD
dx
= BDv.
dt dt dt
ε BD v
To calculate the current, we simply use Ohm's Law: I = = . Here R is the
R R
resistance of the circuit.
Example : Find the power dissipated in the above circuit using I 2 R, and then by directly
calculating the work you do by pulling the wire.
B 2 D2 v2
Solution: Clearly I 2 R = is the power dissipated, as per usual formula. Now let us
R
calculate the force acting upon the piece of wire (of length D) that you are pulling. From
r r r B2D2v
the formula for the force on a wire, F = I L × B, the magnitude is F = IBD = . So,
R
B 2 D2 v2
the power is P = Fv = . This is exactly the value calculated above!
R
5. Lenz's Law : The direction of any magnetic induction effect is such as to oppose the cause
of the effect. Imagine a coil wound with wire of finite resistance. If the magnetic field
decreases, the induced EMF is positive. This produces a positive current. The magnetic
field produced by the current opposes the decrease in flux. Of course, because of finite
resistance in loop, the induced current cannot completely oppose the change in flux.
Q.2 You have a circular coil and a uniform magnetic field between the poles of a magnet.
How will you spin the coil so that the maximum emf is induced in it?
Q.3 Land mines made of metal can be detected by sending in a pulse of magnetic field but
plastic land mines cannot be detected in this way. Why?
Q.4 In the figure below the switch is closed in the primary coil and a current is induced in
the secondary coil.
1. Alternating current (AC) is current that flows first in one direction along a wire, and then
in the reverse direction. The most common AC is sinuisoidal in which the current (and
voltage) follow a sine function, as in the graph below. The average value is zero because
the current flows for the same time in one direction as in the other.
εm
ε = ε m sin ω t
ε ε average ≡ < ε > = 0
However, the square of any AC wave is always positive. Thus its average is not zero. As
we shall see, upon averaging the square we get half the square of the peak value.
ε rms
ε 02
ε
ε0
The calculation follows: if there is a sine wave of amplitude (height) equal to one and
frequency ω , (ω = 2π / T , T=time period), then the squared amplitude is sin 2 ωt and its
average is:
1T 1 T 1 − cos 2ω t 1T 1 1
T ∫0 T ∫0 T ∫0 2 2
< sin 2 ωt > ≡ dt sin 2
ω t = dt ( ) = dt =
2
Taking the square root gives the root mean square value as 1/ 2 of the maximum value.
Of course, it does not matter whether this is of the voltage or current:
ε m2 εm I m2 I
ε rms = = = 0.707ε m , and I rms =
= m = 0.707 I m .
2 2 2 2
Exactly the same results are obtained for cosine waves. This is what one expects since
the difference between sine and cosine is only that one starts earlier than the other.
∂Φ
Suppose that the flux in the core is Φ and that its rate of change is . Call the number
∂t
of turns in the primary and secondary N p and N s respectively. Then, from Faraday's law,
∂Φ ∂Φ
the primary emf is ε p = − N p and the secondary emf is ε s = − N s . The ratio is,
∂t ∂t
εp N p N N
= . So, the secondary emf is ε s = s ε p . If s is less than one, then it is called
εs Ns Np Np
a step-down transformer because the secondary voltage is less than the input voltage.
Else, it is a step-up transformer. Both types are used.
4. If this is a lossless transformer (and good transformers are 99% lossless), then the input
power must equal the output power, ε p I p = ε s I s . From above, this shows that the ratio
Np
of currents is I s = I p.
Ns
5. Whatever the shape or size of a current carrying loop, the magnetic flux that passes through
it is proportional to the current, Φ ∝ I . The inductance L (called the self-inductance if
6. Let us calculate the inductance of a long coil wound with n turns per unit length. As
calculated earlier, the B field is B = µ0 nI , and the flux passing through N turns of the
coil of length l and area A is N Φ B = ( nl )( BA ) = µ0 n 2 IAl. From the definition, we find:
N Φ B µ0 n 2lIA
L= = = µ0 n 2lA (inductance of long solenoid).
I I
Example: Find the inductance of a coil with 3500 turns, length 10 cm, and radius 5cm.
π ( 0.05m )
2
T ⋅m T ⋅ m2
Solution: L = 4π ⋅ 10 −7
⋅ 35002 ⋅ = 1.21 = 1.21H .
A 0.10m A
value. Units: [τ ] =
[ L ] = henry = volt.second / ampere = ⎛ volt ⎞ second = second.
⎜ ⎟
[ R ] ohm ohm ⎝ ampere.ohm ⎠
R R
9. When we pass current through an inductor a changing magnetic field is produced. This,
by Faraday's Law, induces an emf across the coil. So work has to be done to force the
current through. How much work? The power, or rate of doing work, is emf × current.
dU B ⎛ dI ⎞ dI
Let U B be the work done in passing current I . Then, = ⎜ L ⎟ I = LI . Let us
dt ⎝ dt ⎠ dt
UB I
1 2
integrate dU B = LI dI . Then, ∫0
dU B = ∫ LIdI ⇒ U B =
0
2
LI . This is an important
1 2
result. It tells us that an inductor L carrying current I requires work LI . By conservation
2
of energy, this is also the energy stored in the inductor. Compare this result with the result
1
U E = CV 2 for a capacitor. Notice that C ↔ L and V ↔ I .
2
10. Let us use the result derived earlier for the inductance of a solenoid and the magnetic field
1 1
in it, L = µ0 n 2lA and B = µ0 nI . Putting this into U B = LI 2 gives U B = ( µ0 n 2lA)( B / µ0 n) 2 .
2 2
2
UB B
Divide the energy by the volume of the solenoid, = . This directly gives the
volume 2 µ0
energy density (energy per unit volume) contained in a magnetic field.
Q.1 a) What is the average value of the half-wave sine currents (maximum value=1) below?
b) Repeat the above for the square of the currents below.
t t
Q.2 Suppose you have a wire that you have to wind into a small space. How should you
wind it so that a)the inductance is maximum, b)the inductance is minimum?
Q.3 a) Two coils with inductance L1 and L2 are connected in series. Calculate the total L.
b) The two coils are now connected in parallel. Calculate the total L.
c) In both the above parts, what assumption did you have to make? Would your answers
be correct if the two coils were close to each other?
dI
Q.4 In the circuit below, first show that Kirchoff's Law gives L = V0 sin ωt. Then show that
dt
V0 V
the solution is: I = − cos ωt = 0 sin(ωt − π2 ).
ωL ωL
V0 sin ω t
Q.5 In point no. 11 above, the charge on the capacitor was calculated to be q = qm cos ω t.
a) What is meaning of qm ? Relate this to the maximum current that flows in the circuit.
b) What is the maximum value of the energy stored in the capacitor? In the inductor?
c) At what value of the time will there be equal energy in both parts?
1. Before the investigations of James Clerk Maxwell around 1865, the known laws of
electromagnetism were:
r r q
a) Gauss' law of electricity: ∫ E ⋅ dA = (integral is over any closed surface)
ε0
r r
b) Gauss' law of magnetism: ∫ B ⋅ dA = 0 (integral is over any closed surface)
r r dΦB
c) Faraday's law of induction: ∫ E ⋅ ds = − (integral is over any closed loop)
dt
r r
d) Ampere's law: ∫ B ⋅ ds = µ0 I (integral is over any closed loop)
2. But Maxwell realized that the above 4 laws were not consistent with the conservation of
charge, which is a fundamental principle. He argued that if you take the space between
two capacitors (see below) and take different surfaces 1,2,3,4 then applying Ampere's
r r r r
Law gives an inconsistency: ⎡ ∫ B ⋅ ds ⎤ ≠ ⎡ ∫ B ⋅ ds ⎤ because obviously charge cannot
⎣ ⎦ (1,2,4) ⎣ ⎦3
flow in the gap between plates. So Ampere's Law gives different results depending upon
which surface is bounded by the loop shown!
circuit
r r
Maxwell modified Ampere's law as follows: ∫ B ⋅ ds = µ ( I + I ) where the "displacement
0 d
dΦE
current" is I d = ε 0 . Let's look at the reasoning that led to Maxwell's discovery of the
dt
dQ
displacement current. The current that flows in the circuit is I = . But the charge on the
dt
d d ( EA) dΦE
capacitor plate is Q = ε 0 EA. Hence, I = ( ε 0 EA ) = ε 0 = ε0 ≡ I D . In words, the
dt dt dt
changing electric field in the gap acts as source of the magnetic field in just the same way as
the current in the outside wires. This is really the most important point - a magnetic field
may have two separate reasons for existence - flowing charges or changing electric fields.
wavelength
amplitude
wavelength
node
Example: Red light has λ = 700 nm. The frequency ν is calculated as follows:
3.0 × 108 m / sec
ν= −7
= 4.29 × 1014 Hertz
7 × 10 m
By comparison, the electromagnetic waves inside a microwave oven have wavelength of
6 cm, radio waves are a few metres long. For visible light, see below. On the other hand,
X-rays and gamma-rays have wavelengths of the size of atoms and even much smaller.
In the diagram above, an electromagnetic wave is moving in the z direction. The electric
field is in the x direction, Ex = E0 sin(kz − ω t ), and the magnetic field is perpendicular to
it, By = B0 sin(k z − ω t ). Here ω = kc. From Maxwell's equations the amplitudes of the two
fields are related by E0 = cB0 . Note that the two fields are in phase with each other.
sin 2 θ
The power, which is the square of the amplitude, falls off as I (θ ) ∝ . The sin 2 θ
r2
dependence shows that the power is radiated unequally as a function of direction. The
maximum power is at θ = π / 2 and the least at θ = 0.
8. As we have seen, the electric field of a wave is perpendicular to the direction of its motion.
If this is a fixed direction (say, xˆ ), then we say that wave is polarized in the x direction.
Most sources - a candle, the sun, any light bulb - produce light that is unpolarized. In
this case, there is no definite direction of the electric field, no definite phase between the
orthogonal components, and the atomic or molecular dipoles that emit the light are randomly
oriented in the source. But for a typical linearly polarized plane electromagnetic wave
E0
polarized along xˆ , Ex = E0 sin( kz − ω t ), By =sin(kz − ω t ) with all other components zero.
c
Of course, it may be that the wave is polarized at an angle θ relative to xˆ , in which case
Ex = E0 cos θ ⋅ sin(kz − ω t ), E y = E0 sin θ ⋅ sin(k z − ω t ), Ez = 0.
microwave
oven
polarization
metal plate
Q.1 A star that suddenly explodes is called a supernova. Looking at supernova events in
different parts of the universe, do you think we can learn about what the laws of physics
were at different times in the past?
Q.2 Near a wire carrying current I (t ), use Ampere's Law to show that the magnetic field is
µ0 I (t )
B(t ) = . What is the direction of the field? Now suppose that the current varies
2π r
rapidly with time. Do you expect your answer to be still correct? Why?
Q.4 Starting from Maxwell's equations, it is possible to show that the power radiated by a
2
single charge q with acceleration a is P = 3 q 2 a 2 . Accepting this, answer the following:
3c
a) Check if the expression for P is dimensionally correct.
b) Why would an answer that is proportional to q (rather than q 2 ) be physically wrong?
c) Why would an answer that is proportional to a (rather than a 2 ) be physically wrong?
d) Suppose that electrons and protons with equal velocities are injected into a constant
magnetic field. Both bend around, and so both experience acceleration. Given that a
proton is 2000 heavier than an electron, find the ratio of the energy radiated by a
proton to that of an electron.
e) Repeat the above if they are injected with the same energy instead of same velocity.
1. Light travels very fast but its speed is not infinite. Early attempts to measure the speed
using earth based experiments failed. Then in 1675 the astronomer Roemer studied timing
of the eclipse of one of Jupiter's moon called Io. In the diagram below Io is observed with
Earth at A and then at C. The eclipse is 16.6 minutes late, which is the time taken for light
to travel AC. Roemer estimated that c = 3 × 108 metres/sec, a value that is remarkably
close to the best modern measurement, c = 299792458.6 metres/sec.
3. If light contained all frequencies with equal strength, it would appear as white to us.
course, most things around us appear coloured. That is because they radiate more strongly
in one range of frequency than in others. If there is more intensity in the yellow range
than the green range, we will see mostly yellow. The sky appears blue to us on a clear
day because tiny dust particles high above in the atmosphere reflect a lot of the blue light
coming from the sun. In the figure below you can see the hump at smaller wavelengths.
4. What path does light travel upon? If there is no obstruction, it obviously likes to travel on
straight line which is the shortest path between any two points, say A and B. Fermat's
Principle states that in all situations, light will always take that path for which it takes the
least time. As an example, let us apply Fermat's Principle to the case of light reflected from
a mirror, as below.
B
θ1′
A
θ1
a ′ b
θ1 θ1
x d−x
is t = L / c. To find the smallest time, we must differentiate and then set the derivative
dt 1 dL
to zero, 0= =
dx c dx
1 1
= ( a 2 + x 2 ) ( 2 x ) + ⎡b 2 + ( d − x ) ⎤ ( 2 )( d − x )( −1)
−1/ 2 2 1/ 2
2c 2c ⎣ ⎦
x d−x
From here we immediately see that = From the
a +x b2 + ( d − x )
2 2 2
above diagram, sin θ1 = sin θ1′. Of course, it is no surprise that the angle of reflection
equals the angle of incidence. You knew this from before, and this seems like a very
complicated derivation of a simple fact. But it is still nice to see that there is a deeper
principle behind it.
d
Fermat's Principle says that the time t must be C
dt 1 dL n1 2 n
= ( a + x 2 ) ( 2 x ) + 2 ⎡b 2 + ( d − x ) ⎤ ( 2 )( d − x )( −1)
−1/ 2 2 1/ 2
minimized: 0 = =
dx c dx 2c 2c ⎣ ⎦
x d−x
And so we get "Snell's Law", n1 = n2 or n1 sin θ1 = n2 sin θ 2 . This
a2 + x2 b + (d − x)
2 2
required a little bit of mathematics, but you can see how powerful Fermat's Principle is!
7. Light coming from air into water bends toward the normal. Conversely, light from a
source in the water will bend away from the normal. What if you keep increasing the
angle with respect to the normal so that the light bends and begins to just follow the
surface? This phenomenon is called total internal reflection and θ c is called the critical
n1
n2 θc
8. Fibre optic cables, which are now common everywhere, make use of the total internal
reflection principle to carry light. Here is what a fibre optic cable looks like from inside:
Even if the cable is bent, the light will continue to travel along it. The glass inside the
cable must have exceedingly good consistency - if it thicker or thinner in any part, the
refractive index will become non-uniform and a lot of light will get lost. Optical fibres
now carry thousands of telephone calls in a cable whose diameter is only a little bigger
than a human hair!
Q.1 If you look at the reflection of street light from across a body of water, it appears long
in one direction but not the other. Explain.
Q.2 You are given a glass cube and a laser. How will you find the speed of light inside the glass?
Q.3 With reference to the fibre optic cable above, which refractive index must be bigger - that
of the core glass or the cladding glass? If the core glass has n = 1.5, what must be n for the
cladding if θc = 10o.
Q.4 If light passes through a transparent medium, can we say that there is interaction between
that medium and the light?
1. In this lecture I shall deal with the 4 basic ways in which light interacts with matter:
a) Emission - matter releases energy as light.
b) Absorption - matter takes energy from light.
c) Transmission - matter allows light to pass through it.
d) Reflection - matter repels light in another direction.
2. When an object (for example, an iron rod or the filament of a tungsten bulb) is heated, it
emits light. When the temperature is around 800o C, it is red hot. Around 2500o C it is
yellowish-white. At temperatures lower than 800o C, infrared (IR) light is emitted but our
eyes cannot see this. This kind of emission is called blackbody radiation. Blackbody
radiation is continuous - all wavelengths are emitted. However most of the energy is
radiated close to the peak. As you can see in the graph, the position of the peak goes to
smaller wavelengths (or higher frequencies) as the object becomes hotter. The scale of
temperature is shown in degrees Kelvin (o K ). To convert from o C to o K , simply add 273.
We shall have more to say about the Kelvin scale later.
Where exactly does the peak occur? Wien's Law states that λmaxT = 2.90 × 10−3 m K. We
can derive this in an advanced physics course, but for now you must take this as given.
3. In the lecture on electromagnetic waves you had learnt that these waves are emitted when
charges accelerate. Blackbody radiation occurs for exactly this reason as well. When a
body is heated up, the electrons, atoms, and molecules which it contains undergo violent
random motion. Light may emitted by electrons in one atom and absorbed in another. Even
an empty box will be filled with blackbody radiation because the sides of the box are
4. When can you use Wien's Law? More generally, when can you expect a body to emit
blackbody radiation? Answer: only for objects that emit light, not for those that merely
reflect light (e.g. flowers). The Sun and other stars obey Wien's Law since the gases they
are composed of emit radiation that is in equilibrium with the other materials. Wien's law
allows astronomers to determine the temperature of a star because the wavelength at
which a star is brightest is related to its temperature.
5. All heated matter radiates energy, and hotter objects radiate more energy. The famous
Stefan-Boltzman Law, which we unfortunately cannot derive in this introductory course,
states that the power radiated per unit area of a hot body is P = σ T 4 , where the Stefan-
Boltzman constant is σ = 5.67 × 10-8 W m -2 K -4 .
6. Let us apply P = σ T 4 for finding the temperature of a planet that is at distance R from the
sun. The sun has temperature Tsun and radius Rsun . In equilibrium, the energy received from
the sun is exactly equal to the energy radiated by the planet. Now, the total energy radiated
1
by the sun is σ T04 × 4π Rsun
2
. But on a unit area of the planet, only of this is received.
4π R 2
1
So the energy received per unit area on the planet is σ T04 × 4π Rsun2
× . This must be
4π R 2
R
equal to σ T 4 ⇒ T = T0 sun .
R
7. The above was for blackbody radiation where the emitted light has a continuous spectrum.
But if a gas of identical atoms is excited by some mechanism, then only a few discrete
wavelengths are emitted. Each chemical element produces a very distinct pattern of colors
called an emission spectrum. So, for example, laboratory hydrogen gas lamps emit 3 lines in
the visible region, as you can see below. Whenever we see 3 lines spaced apart in this way,
we immediately know that hydrogen gas is present. It is as good as the thumbprint of a man!
9. Everything that I have said about the emission of light applies exactly to the absorption
of light as well. So, for example, when white light (which has all different frequencies
within it) passes through hydrogen gas, you will see that all wavelengths survive except
the three on the previous page. So the absorption spectrum looks exactly the same as the
emission spectrum - the same lines are emitted and absorbed. This how we know that there
are huge clouds of hydrogen floating in outer space. See the diagram below.
10. The atmosphere contains various gases which absorb light at many different wavelengths.
Molecules of oxygen, nitrogen, ozone, and water have their own absorption spectra, just
as atoms have their own.
11. All the beauty of colours we see is due to the selective absorption by molecules of certain
frequencies. So, for example, carotene is a long, complicated molecule that makes carrots
orange, tomatoes red, sarson yellow, and which absorbs blue light. Similarly chlorophyll
makes leaves green and which absorbs red and blue light.
12. From unpolarized light we can make polarized light by passing it through a polarizer as
shown below.
Each wave is reduced in amplitude by cosθ , and in intensity by cos 2 θ . The wave that
emerges is now polarized in the θ direction.
13. We can design materials (crystals or stressed plastics) so that they have different optical
properties in the two transverse directions. These are called birefringent materials. They
are used to make commonly used liquid crystal displays (LCD) in watches and mobile
phones. Birefringence can occur in any material that possesses some asymmetry in its
structure where the material is more springy in one direction than another.
Q.1 a) Taking the earth's average temperature to be 300o K, find the frequency and wavelength
at which maximum emission occurs.
b) Repeat the above for the sun, for T ≈ 4700o K.
Q.2 Estimate the total amount of radiation emitted in 24 hours by a human being. Make a
reasonable estimate of the surface area of the body. For the purposes of this calculation,
does it make any difference that he/she is wearing clothes?
Q.3 In calculating the temperature of a planet in the solar system, I had made the assumption
that all the energy absorbed was re-radiated into space.
a) Why is this assumption correct and under what circumstances?
b) Suppose that only 60% of the sun's radiation is absorbed, and that the remainder is
reflected back into space. Redo the calculation under this assumption.
c) Carbon dioxide gas in the atmosphere absorbs certain frequencies in the infrared
region. What effect will this have on the temperature of the earth?
Q.4 A wave that is linearly polarized in the x direction has the following electric and magnetic
E0
fields: Ex = E0 sin(k z − ω t ), E y = 0, Ez = 0, Bx = 0, By = sin(k z − ω t ), Bz = 0
c
a)Write the fields for a wave that is polarized in the y direction.
b)Write the fields for a wave that is equally polarized in the x and y directions.
Q.5 A birefringent material has refractive indices 1.51 and 1.53 in the x and y directions
respectively. A ray of red light enters the material, which has thickness 2cm.
a) Suppose the light is polarized in the x direction. How long will the light take to cross
the crystal? What if it is polarized in the y direction?
b) Suppose the light is equally polarized in the x and y directions? Compute the phase
difference between the two waves.
1. Two waves (of any kind) add up together, with the net result being the simple sum of the
two waves. Consider two waves, both of the same frequency, shown below. If they start
together (i.e. are in phase with each other) then the net amplitude is increased. This is
called constructive interference. But if they start at different times (i.e. are out of phase
with each other) then the net amplitude is decreased. This is called destructive interference.
destructive
constructive
In the example above, both waves have the same frequency and amplitude, and so the
resulting amplitude is doubled (constructive) or zero (destructive). But interference
occurs for any two waves even when their amplitudes and frequencies are different.
3. Wherever there is a bright fringe, constructive interference has occured, and wherever
there is a dark fringe, destructive interference has occured. We shall now calculate
where on the third screen the interference is constructive. Take any point on the third
screen. Light reaches this point from both S1 and S2 , but it will take different amounts
of time to get there. Hence there will be a phase difference that we can calculate. Look
at the diagram below. You can see that light from one of the slits has to travel an extra
⎝L⎠ ⎝ 2.3m ⎠
d ⎛ 8.5 × 10−5 ⎞
λ= sin θ = ⎜ −7
⎟ sin(3.0 ) = 4.4 × 10 m = 440nm (nanometres).
o
n ⎝ 10 ⎠
0o phase change
n >1 d
To simplify matters, suppose that you are looking at the thin film almost directly from
above. Here n is the index of refraction for the medium. Then,
The condition for destructive interference is: 2nd = mλ (m = 0,1,2,...)
1
The condition for constructive interference is : 2nd = (m + )λ (m = 0,1,2,...).
2
Prove it!
9. Light with wavelength of 511 nm forms a diffraction pattern after passing through a
single slit of width 2.2×10-6 m. Find the angle associated with (a) the first and (b) the
second bright fringe above the central bright fringe.
⎛ mλ ⎞ −1 ⎛ (1)(511 × 10 m ⎞
−9
SOLUTION: For m = 1, θ = sin −1 ⎜ ⎟ = sin ⎜ −6 ⎟ = 13.4
o
⎝ W ⎠ ⎝ 2.20 × 10 m ⎠
⎛ mλ ⎞ −1 ⎛ (2)(511 × 10 m ⎞
−9
For m = 2, θ = sin −1 ⎜ ⎟ = sin ⎜ −6 ⎟ = 27.7
o
⎝W ⎠ ⎝ 2.20 × 10 m ⎠
Q.1 In the lecture on sound, I demonstrated interference of waves coming from two
separate loudspeakers. Why is not possible to demonstrate interference of waves
coming from two separate light sources?
Q.2 In the two-slit experiment (see the diagram next to point no. 2 above) what would
happen to the pattern on the third screen if one of the slits is covered up?
Q.3 Simplify f (t ) = sin ω t + cos ω t using a trigonometric identity that you certainly know.
Then make a plot of f 2 (t ) from t = 0 to 2π / T .
Q.4 In a double slit experiment, green light of wavelength 550 nm illuminates slits that
are 1.5mm apart. The screen is 2 m away. What will be the separation between the
dark fringes?
Q.5 A compact disc player uses a laser to reflect light from the metal layer deposited on a
protective coating, as shown below. When the light reflected from both segments is
combined at the receiver, interference from the two waves results. We can therefore
whether there is "one" or "zero" at that point on a CD.
1. In the previous lecture I gave you some very strong reasons to believe that light is waves.
Else, it is impossible to explain the interference and diffraction phenomena that we see in
innumerable situations. Interference from two slits produces the characteristic pattern.
2. Light is waves, but waves in what? of what? The thought that there is some invisible
medium (given the name aether ) turned out to be wrong. Light is actually electric and
magnetic waves that can travel through empty space. The electric and magnetic waves
are perpendicular to each other and to the direction of travel (here the z direction).
3. Electromagnetic waves transport linear momentum and energy. If the energy per unit
volume in a wave is U then it is carrying momentum p, where p = U / c. Waves with large
amplitude carry more energy and momentum. For the sun's light on earth the momentum is
rather small (although it is very large close to or inside the sun). Nevertheless, it is easily
measureable as, for example, in the apparatus below. Light strikes a mirror and rebounds.
Thus the momentum of the light changes and this creates a force that rotates the mirror.
The force is quite small - just 5 × 10-6 Newtons per unit area (in metre 2 ) of the mirror.
and knocks out electrons that travel to the anode. A current flows only as long as the light
is shining. Above the threshold frequency, the number of electrons ejected depends on the
intensity of the light. This was called the photoelectric effect. The following was observed:
a) The photoelectric effect does not occur for all frequencies ν ; it does not occur at all
when ν is below a certain value. But classically (meaning according to the Maxwell
nature of light as an electromagnetic wave) electrons should be ejected at any ν. If
an electron is shaken violently enough by the wave, it should surely be ejected!
b) It is observed that the first photoelectrons are emitted instantaneously. But classically
the first photoelectrons should be emitted some time after the light first strikes the
surface and excites its atoms enough to cause ionization of their electrons.
5. Explanation of the photoelectric puzzle came from Einstein, for which he got the Nobel
Prize in 1905. Einstein proposed that the light striking the surface was actually made of
little packets (called quanta in plural, quantum in singular). Each quantum has an energy
energy ε = hν (or ε = hω where h = h 2π and
ω = 2πν ) where h is the famous Planck's constant, packet of light with
energy hν (or hω )
6.626 × 10-34 Joule-seconds. An electron is kicked
out of the metal only when a quantum has energy
(and frequency) big enough to do the job. It doesn't
matter how many quanta of light - called photons -
are fired at the metal. No photoelectrons will be
released unless ν is large enough. Furthermore the
light is made of photons
photoelectrons are released immediately when the
photon hits an electron.
7. How many photons do we see? Here is a table that gives us some interesting numbers:
a) Sunny day (outdoors): 1015 photons per second enter eye (2 mm pupil).
b) Moonlit night (outdoors): 5 × 1010 photons/sec (6 mm pupil).
c) Moonless night (clear, starry sky): 108 photons/sec (6 mm pupil).
d) Light from dimmest naked eye star (mag 6.5): 1000 photons/sec entering eye.
8. Where do photons come from? For this it is necessary to first understand that electrons
inside an atom can only be in certain definite energy states. When an electron drops
from a state with higher energy to one with lower energy, a photon is released whose
energy is exactly equal to the difference of energies. Similarly a photon is absorbed
when a photon of just the right energy hits an electron in the lower state and knocks it
into a higher state.
Electrons can be here
or here
The upper and lower levels can be represented differently with the vertical direction
representing energy. The emission and absorption of photons is shown below.
Visible
Visible
Photon
Photon
Absorption Emission
9. Fluorescence and phosphorence are two phenomena observed in some materials. When
they are exposed to a source of light of a particular colour, they continue to emit light of
a different colour even after the source has been turned off. So these materials can be
10. One of the most important inventions of the 20th century is the laser which is short for:
LASER ≡ Light Amplification by the Stimulated Emission of Radiation
Lasers are important because they emit a very large number of photons all with one single
frequency. How is this done? By some means - called optical pumping - atoms are excited
to a high energy level. When one atom starts decaying to the lower state, it encourages all
the others to decay as well. This is called spontaneous emission of radiation.
Q.1 With reference to the diagram in point no. 3, suppose that light illuminates both the
mirror and the darkened plate. On which will the force be greater? To answer this, ask
on which plate is the change of momentum greater and then relate this to Newton's
Law which states that force is rate of change of momentum.
Q.2 In order to remove an electron from a certain metal, it is necessary to give at least 2.2
eV energy to the electron. What should be the minimum frequency of a beam of light
to ensure that photoelectrons are emitted? What if the frequency is larger than this?
Where will the energy go?
Q.3 A laser pulse of green light at 550 nm puts out 2 joules of energy in 1 millisecond. How
many photons does the pulse have? What if it is blue light?
Now just to make the point even more forcefully, in all three situations below, the virtual
image is in the same position although the actual object is in 3 different places.
5. Imagine that you have a sphere of radius R and that you can cut
out any piece you want. The outside or inside surface can be
silvered, as you want. You can make spherical mirrors in this
way. These can be of two kinds. In the first case, the silvering
can be on the inside surface of the sphere, in which case this is
is called a convex spherical mirror. The normal directed from
the shiny surface to the centre of the sphere (from which it
was cut out from) is called the principal axis, and the radius
of curvature is R. The other situation is that in which the
outside surface is shiny. Again, the principal axis the same,
but now the radius of curvature (by definition) is - R. What
does a negative curvature mean? It means precisely what
has been illustrated - a convex surface has a positive and a
concave surface has a negative curvature.
12. Here is how a concave (or divergent) lens forms an image. An observer on the side
opposite to the object will see the image upright and smaller in size than the object.
Divergent lens
Object
1 ⎛ 1 1 ⎞
lens is , = ( n − 1) ⎜ − ⎟ .
f ⎝ R1 R2 ⎠
Bi-convex Bi-concave
R1 > 0 R1 < 0
R2 < 0 R2 > 0
Note that a flat surface has infinite radius of curvature. The focal legth of each can be
calculated using the previous formula.
15. The "strength" of a lens is measured in diopters. If the focal length of a lens is expressed
in metres, the diopter of the lens is defined as D = 1/ f . If the refractive index of the
glass in a lens is n, then the diopters due to the first interface D1 and the second interface
D2 are, D1 = (n − 1) / R 1 and D2 = (1 − n) / R 2 . The total diopter of the lens is D = D1 + D2 .
16. For any optical system - meaning a collection of lenses and mirrors - we can define a
magnification factor as a ratio of sizes -- see the diagram below.
Optical Element
Object
Image
h h′
17. The perfect lens will focus a parallel beam of rays all to exactly the same focus. But no
lens is perfect, and every lens suffers from aberration although this can be made quite
small by following one lens with another. Below you see an example of "spherical
aberration". Rays crossing different parts of the lens do not reach exactly the same focus.
This distorts the image. Computers can design lens surfaces to minimize this aberration.
Spherical Aberration
Principal
Q.1 Is it possible to project a real image on to a screen? What about a virtual image?
Q.2 A mirror has only one focus but a lens has two. Why?
Q.3 Consider a concave (converging) lens and an object placed in front of it. By drawing
rays of light emanating from the object, answer the following:
a)Can the image ever be inverted?
b)Can the image ever be bigger than the object?
Q.4 If the refractive index of the glass in a lens is different for different lenses, what will
happen to a beam of parallel rays on a convex lens? On a concave lens?
Q.5 In the three cases below, draw the position of the images. State if they are real or virtual,
and whether bigger or smaller than the object.
c f f c
c f f c
c f f c
Q.6 Do the following experiment: hold a 5-rupee coin and gradually bring it closer to your
eye. Measure the distance (it will be approximately 25 cm) after which your eye starts
to lose focus. Now calculate the angle subtended by the coin by measuring the diameter
of the coin.
1. The ancient view was that heat is a colourless, weightless, fluid which occupies no volume
has no smell, etc. This imagined substance - called phlogiston - was supposedly stored in
objects and transferred between objects. It took a long time to reject this notion. Why is it
wrong? Because (as we will see) heat can be created and destroyed, whereas liquids keep
their volume and cannot be created or destroyed.
2. We are all familiar with an intuitive notion of temperature. We know that hotter things
have higher temperature. But let us try to define temperature more rigorously. In the
diagram below, the three bodies A,B,C are in contact with each other. After sufficient
time passes, one thing will be common to all three - a quantity that we call temperature.
A B
3. Now let us understand heat. Heat is energy, but it is a very special kind of energy: it is
that energy which flows from a system at high temprature to a system at low temperature.
high T heat low T
Stated in a slightly different way: heat is the flow of internal energy due to a temperature
difference. (Note that we do not have to know about atoms, molecules, and the internal
composition of a body to be able to define heat - all that will come later).
10. When you rub your hands, they get hot. Mechanical work has been converted into heat.
The first person to investigate this scientifically was Joule. In the experiment below, he
allowed a weight to drop. This turned a paddle that stirred up the water and caused the
temperature to rise. The water got hotter if the weight was released from a greater height.
Joule established the units for the mechanical equivalent of heat. The units we use today
are: 1 calorie (1 cal) raises the temperature of 1 g of water by 1 °C
1 cal = 4.186 Joule, 1 kilocalorie (1 kcal) = 1000 cal
Remember also that joule is the unit of work: when a force of one newton acts through a
distance of one metre, the work done is one joule.
at T0
∆L
at T
L
Look at the above diagram. Call the length of the rod at T0 as L0 . When the rod is heated
to T , then the length increases to L = L0 ⎡⎣1 + α (T − T0 ) ⎤⎦ . The difference of the lengths is,
L − L0 = α L0 (T − T0 ) , or ∆L = α L0 ∆T . Here α is just a dimensionless number that tells you
how a particular material expands. It is called the coefficient of linear expansion. If α was
∆L / L0
zero, then the material would not expand at all. You can also write it as α = .
∆T
∆V / V0
Similarly, define a coefficient of volume expansion − call it β − as β ≡ .
∆T
12. There is a relation between α and β , the linear and volume coefficients. Let's look at the
change in volume due to expansion: V ′ = ( L + ∆L)3 = ( L + α L∆T )3
= L3 + 3α L3∆T + 3α 2 L3∆T 2 + α 3 L3∆T 3
≈ L3 + 3α L3∆T = V + 3αV ∆T
We are only looking for small changes, so the higher terms in ∆T can be safely dropped.
Hence ∆V = 3αV ∆T . From the definition, this we immediately see that β = 3α . Just to
get an idea, here is what β looks like for various different materials:
Steel
Quartz Glass Al Hg Air
13. We can use the fact that different metals expand at different rates to make thermostats.
For example, you need a thermostat to prevent an electric iron from getting too hot or a
Q
15. We define the "heat capacity" of a body as C = , where ∆T is the increase in
∆T
temperature when an amount of heat Q is added to the body. Heat capacity is always
positive; Q and ∆T have the same sign. The larger the heat capacity, the smaller is the
change in the body's temperature when a fixed amount of heat is added. In general,
Q = mc∆T , where Q = heat added , m = mass , c = specific heat , and ∆T = change in
temperature. Water has a very large specific heat c,
c = 1.0 cal /(º C g ); this means it takes one calorie
to raise the temperature of 1 gm of water by 1 degree
Celsius. In joules per kilogram this is the same as 4186.
In the table are the specific heats of various common
materials. You can see that metals have small c, which
means that it is relatively easy to raise or lower their
temperatures. The opposite is true of water. Note also
that steam and ice have smaller c ' s than water. This
shows that knowing the chemical composition is not
enough.
17. When lifting a "daigchee" from a stove, you would be wise to use a cloth. Why? Because
metals transfer, or conduct, heat easily whereas cloth does not. Scientifically we define
conductivity using experiments and apparatus similar to the following:
Heat flows from the hotter to the colder plate. Let us use the following symbols:
k = thermal conductivity Q = heat transferred
A = cross sectional area t = duration of heat transfer
L = length ∆T = temperature difference
⎛ ∆T ⎞
Then the heat transferred in time t is, Q = kA ⎜ ⎟ t. This formula allows us to measure
⎝ L ⎠
k if all the other quantities in it are measured.
18. Conduction is one possible way by which heat is transferred from one portion of a
system to another. It does not involve physical transport of particles. However, there is
another way by which heat can be transferred - convection. In convection, heat is carried
by a moving fluid. So when you heat a pot of water, molecules at the bottom move up,
and the ones at the top come down - the water has currents inside it that transfer heat.
Another mechanism for transferring heat is through radiation. We have already talked
about this while discussing blackbody radiation and the Stefan-Boltzman Law.
Q.1 The brakes of a car get hot as the car descends a mountain. Explain where the energy
is coming from.
Q.2 A thermometer calibrated in degrees Fahrenheit has the same reading as a thermometer
calibrated in degrees Celsius at one particular temperature. What is that temperature?
Q.3 A lathe machine without a load draws one kilowatt, but when it is turning its tool against
a piece of iron of 3 kilograms, it requires 2.4 kw. Assume that the heat created in the
iron has no way out. Using the specific heat given in the table shown earlier, find the
rate at which the temperature of the iron increases.
Q.4 A thin sheet of metal has length L0 and width W0 . It is heated through an amount ∆T .
If the coefficient of linear expansion is α , and ∆T is small, find the increase in area
of the plate. Now, following the way in which we defined the coefficient of volume
expansion, define the coefficient of area exapnsion.
Q.5 a) A 0.5-kg block of nearly melted ice with an initial temperature of 0 oC is dropped
into a container holding 1.5 kg of water at 20.0 oC. Find the final temperature of the
ice-water system.
b) A 0.5-kg block of very cold ice with an initial temperature of -30 oC is dropped into
a container holding 1.5 kg of water at 20.0 oC. Find the final temperature of the ice-
water system now.
Q.6 Suppose that the rate at which a hot body cools is proportional to its temperature. In
other words, suppose that in time dt the body cools by amount dT where dT ∝ −T
(the minus sign shows that the temperature is decreasing rather than increasing). Use
this to show that the temperature of a body decreases exponentially with time.
Q.7 When a lake freezes, the freezing begins at the top. Why? Are conduction and
radiation involved, or just convection? Discuss.
1. Let us agree to call whatever we are studying the "system" (a mixture of ice and water, a
hot gas, etc). The state of this system is specified by giving its pressure, volume,
temperature, etc. These are called "thermodynamic variables". The relation between these
variables is called the "equation of state". This equation relates P,V,T. So, for example:
P = f (V , T ) (knowing V and T gives you P )
V = g ( P, T ) (knowing P and T gives you V )
T = h( P,V ) (knowing P and V gives you T )
For an ideal gas, the equation of state is PV = Nk BT . Here N is the number of molecules
in the gas, T is the temperature in degrees Kelvin, and k B is called the Boltzman cosntant.
This EOS is easy to derive, although I shall not do it here. In principle it is possible to
mathematically derive the EOS from the underlying properties of the atoms and molecules
in the system. In practice, however, this is a very difficult task (except for an ideal gas).
However the EOS can be discovered experimentally.
2. Thermodynamics is the study of heat and how it flows. The First Law of Thermodynamics
is actually just an acknowledgment that heat is a form of energy (and, of course, energy
is always conserved!). Mathematically, the First Law states that:
∆E = ∆q + ∆w
where: E = internal energy of the system
q = heat transferred to the system from the surroundings
w = work done on the system by the surroundings.
In words, what the above formula says is this: if you do an amount of work ∆w and also
transfer an amount of heat ∆q, then the sum of these two quantities will be the additional
amount of energy ∆E that is stored in the system. There could be nothing simpler! Note:
∆q and ∆w are positive if heat is added to or work is done on a system, and negative if
heat is removed from the system, or if the system does work on the environment.
3. Work and heat are called path variables - their values depend on the steps leading from
one state to another. To give an obvious example: suppose you drag a heavy box on the
floor in a straight line from point A to point B, and then in a very roundabout way from
A to B again. Obviously the work done by you will be very different. On the other hand,
the internal energy does not depend upon the path. Example: a gas has internal energy
proportional to its temperature. It makes no difference whether the gas had been slowly
heated or rapidly; it will have the same internal energy. Since the internal energy does not
State 2
depend on the path we can write ∫State1
dE = E2 − E1 = ∆E .
Closed system
Compressed
ideal gas
Pint Pext
Gas Vacuum
b)Suppose the pressure outside has some constant value. Then, w = Pext × (V2 − V1 ) or,
w = Pext ∆V . You can see that this is just the area of the rectangle below.
P ( P0 ,V0 )
W = P ∆V P
∆V V
c)We can also let the gas expand so that the internal and
external pressures are almost the same (this is called
reversible expansion). Then the work done on the P ( P1 ,V1 )
system is: dw = − Pext dV . Using PV = Nk BT gives,
V2 Nk BT V
w = −∫ dV = − Nk BT ( ln V2 − ln V1 ) = − Nk BT ln 2
V1 V V1
( P2 ,V2 )
Of course, the work done by the gas as it expands is
positive since V2 is larger than V1. Remember that the
log function is positive if its argument is bigger than 1.
6. Imagine that a substance is heated while keeping its volume constant. Obviously, you have
to supply more heat as you raise the temperature higher, ∆QV = CV ∆T , where CV is given
the name "specific heat at constant volume". No work is done since there is no expansion,
∆W = 0. Hence, using the First Law, ∆QV = ∆E + ∆W = ∆E , and so ∆E = CV ∆T .
7. Now take the same system as above, but allow it to expand at constant pressure as it is
heated. Then you will have to supply an amount of heat, ∆QP = CP ∆T . Here, CP is called
the specific heat at constant pressure, ∆QP = ∆E + ∆w = ∆E + P∆V . Hence,
CP ∆T = ∆U + P∆V . If we supply only a small amount of heat then, CP dT = dU + PdV .
This gives, CP dT = CV dT + PdV . If we consider the special case of an ideal gas, then
PV = Nk BT . Hence, PdV = Nk B dT and CP dT = CV dT + Nk B dT . Hence, CP = CV + Nk B .
3 dU 3
Since the internal energy is E = Nk BT , it follows that CV = = Nk B . From this,
2 dT 2
3 5
the specific heat at constant pressure is: CP = Nk B + Nk B = Nk B . From the equation
2 2
Nk B CP − CV C
CP = CV + Nk B , it follows that = . Let us define a new symbol, P ≡ γ .
CV CV CV
8. We can find the EOS of an ideal gas when it expands without losing any heat (this is
called adiabiatic expansion). In this case, dQ = dE + dW = 0. Hence, CV dT + PdV = 0,
dV dT Nk B dV dT dV
or CV dT + Nk BT = 0. Dividing by dT gives, + = 0 or + ( γ − 1) = 0.
V T CV V T V
Integrating this gives, ln T + ( γ − 1) ln V = Constant. Equivalently, ln (TV γ −1 ) = Constant.
A more convenient form is: TV γ −1 = Constant. This holds for the entire adiabatic expansion.
9. Almost nothing beats the importance of the Second Law Of Thermodynamics. There are
many equivalent ways of stating it. The one I like is: "There can be no process whose only
final result is to transfer thermal energy from a cooler object to a hotter object". This seems
extremely simple (and almost useless), but in fact it tells you, among other things, that no
perpetual motion machine (such as that which generates electricity without any fuel input)
can ever be built. If someone could build such a machine, then it could also be used to
transfer heat from a cold object to a hot object - a contradiction with the Second Law.
the hot reservoir and rejects heat Qout , c into the cold
reservoir. Since the internal energy of the engine does
Qh
not change, ∆U = 0 and the First Law gives simply,
Heat engine
Q = ∆U + W = W . The work done by the engine is W
Q.1 a) Give an example where no heat is transferred to or from a system but the temperature
of the system changes.
b) Why is it wrong to think of heat as a form of stored energy?
Q.2 An ideal gas expands from a given initial state to a given final state along the two paths
shown below, A and B.
a) Calculate the work done along path A and path B.
b) Is the internal energy of the gas changed during the expansions, and if so by how
much?
P P ( P1 ,V1 )
( P1 ,V1 ) Path A
Path B ( P2 ,V2 )
( P2 ,V2 )
V V
Q5. An airconditioner takes air from a room at 200C and transfers it to outdoors, where the
temperature is 400C . For each joule of electricity that is delivered to the refrigerator,
how many joules of energy are transferred to the outside?
1. In the two previous lectures, I concentrated exclusively on heat as a form of energy that
flows from a hot to a cold body. I made no mention of the fact that all matter is made of
atoms and molecules; the notion of heat would exist even if we did not know this. But,
the modern understanding of heat is that it is the random kinetic energy of atoms. So, for
example, the difference between a cold and hot gas is illustrated below.
2. The study of heat, considered as arising from the random motion of the basic constituents
of matter, is an area of physics called statistical mechanics. Its goal is the understanding,
and prediction of macroscopic phenomena, and the calculation of macroscopic properties
from the properties of individual molecules. Temperature is the average energy related to
the speed of atoms in an object, and heat is the amount of energy transferred from one
object to another.
3. Imagine a gas so dilute that atoms rarely collide with each other (this is also called an ideal
gas). We can readily understand why the pressure is directly proportional to the temperature
for a gas confined to a box: increasing the temperature T makes gas molecules move faster,
striking the walls of the container harder and more often, thus giving an increase in pressure
P, i.e. P ∝ T . On the other hand, for an ideal gas at constant pressure, the volume is directly
proportional to the temperature : V ∝ T . So, as the air in a balloon is heated up, its volume
will increase in direct proportion because of the impact of the atoms on the balloon walls.
Note that this is where the "absolute" or Kelvin scale comes from: at 0° K an ideal gas
would have zero volume because the atoms would not be moving.
5. Let us return to our intuitive understanding of heat as the random energy of small particles.
⎛1 ⎞
Now, ⎜ mv 2x ⎟ is the average kinetic energy of a particle on account of its motion in the
⎝2 ⎠av
⎛1 ⎞
x direction. This will be greater for higher temperatures, so ⎜ mv 2x ⎟ ∝ T , where T is the
⎝2 ⎠av
⎛1 ⎞
absolute temperature. So we write ⎜ mv 2x ⎟ = k BT , which you can think of as the
⎝2 ⎠av
definition of the Boltzmann constant, k B . So we immediately get PV = Nk BT , the equation
obeyed by an ideal gas (dilute, no collisions). Now, there is nothing special about any
( ) = ( v ) = ( v ) and therefore the average total velocity is
particular direction, so v 2x
av
2
y av
2
z av
⎛1 ⎞ 3 ⎛1 ⎞ 3
K av = ⎜ mv 2 ⎟ = k BT , and the energy of the entire gas is K = N ⎜ mv 2 ⎟ = Nk BT .
⎝2 ⎠ av 2 ⎝2 ⎠ av 2
3k T
From this, the mean squared speed of atoms in a gas at temperature is v 2 = B .
av m
( )
6. Heat, which is random kinetic energy, causes changes of
phase in matter:
a) Water molecules attract each other, but if water is
heated then they can escape and water becomes steam.
b) If steam is heated, the water molecules break up and
water becomes separated hydrogen and oxygen atoms.
c) If a gas of atoms gets hot enough, then the atoms collide
so violently that they lose electrons (i.e. get ionized).
d) More heat (such as inside the core of a star) will break up
atomic nuclei into protons and neutrons.
no increase in no increase in
temperature temperature
added heat added heat
melts ice vaporizes water
Now let's work out how much the entropy changes when we add heat dQ to an ideal gas.
dV
From the First Law, dQ = dU + dW = dU + PdV = CV dT + Nk BT . The entropy change
V
dQ dT dV
is dS = = CV + Nk B , and for a finite change we simply integrate,
T T V
dQ T V 3 T V
∆S = ∫ = CV ln 2 + Nk B ln 2 = Nk B ln 2 + Nk B ln 2 . Note that the following:
T T1 V1 2 T1 V1
a)The entropy increases if we heat a gas (T2 > T1 ).
b)The entropy increases if the volume increases (V2 > V1 ).
10. In statistical mechanics, we interpret entropy as the degree of disorder. A gas with all
atoms at rest is considered ordered, while a hot gas having atoms buzzing around in all
directions is more disordered and has greater entropy. Similarly, as in the above example,
when a gas expands and occupies greater volume, it becomes even more disordered and
the entropy increases.
∑ v n(v ) i i
the average speed is defined as v = . i
∑n(v ) i
i
Q.2 From the distribution of velocities that you see in the graph above, some molecules are
moving very fast. Some may be moving so fast in the atmosphere that their speed can
exceed the escape velocity! So why does the earth not lose its atmosphere?
Q.4 A gas has N atoms in volume V0 at temperature T0 . The gas is heated at constant volume
upto temperature 3T0 , then allowed to expand at constant temperature upto volume 3V0 .
Finally, it is allowed to cool at constant pressure to T0 . Find the net change in entropy.
Q.5 A piece of metal of mass M with specific heat CV and temperature T1 is put into a
thermos flask containing m grams of alcohol with specific heat cV .
a) Find the temperature after equibrium is achieved.
b) By how much does the entropy of the system change upon placing the metal inside
the thermos?
Q.6 A molecule of diameter d moves randomly in a gas. Call the average distance between
collisions λ. Suppose that average speed of a molecule is v, and that per unit volume
the number of molecules is nv . Look at the diagram below and justify the formula,
1
λ (nvπ d 2 vt ) ≈ vt. From this it follows that λ ≈ , a useful formula.
nvπ d 2
vt
πd2
2. Relativity deals with time and space. So let us first get some understanding of how we
measure these two fundamental quantities:
a) Time : we measure time by looking at some phenomenon that repeats itself. There are
endless examples: your heart beat, a pendulum, a vibrating quartz crystal, rotation of
the earth around its axis, the revolution of the earth about the sun,... These can all be
used as clocks. Of course, an atomic clock is far more accurate than using your heart
beat and is accurate to one part in a trillion. Although different systems of measure-
ment have different units it is fortunate that time is always measured in seconds.
b) Distance : intuitively we know the difference between short and long. But to do a
measurement, we first have to agree on what should be the unit of length. If you use
a metre as the unit, then you can use a metre rod and measure any length you want.
Of course, sometimes we may use more sophisticated means (such as finding how
high a satellite is) but the basic idea is the same: the distance between point A and
point B is the number of metre rods (or fractions thereof) that can be made to fit in
between the two points.
3. Newton had believed that there was one single time for the entire universe. In other words,
time was absolute and could be measured by one clock held somewhere in the centre of
the universe. Similarly, he believed that space was absolute and that the true laws of
physics could be seen in that particular frame which was fixed to the centre of the universe.
(As we shall see, Einstein shocked the world by showing that time and space are not
absolute quantities, but depend on the speed of your reference frame. Even more shocking
was his proof that time and space are not entirely separate quantities!)
y
vt x′
y′
If you were Newton, then you would look at the above figure and say that obviously it is
the following: x′ = x − vt , y′ = y , z′ = z , t ′ = t . These are called Galilean coordinate
transformations. Note that it is assumed here that the time is the same in both frames
because of the Newtonian belief that there is only one true time in the world.
5. There are certain obvious consequences of using the Galilean transformations. So, for
example a rod is at rest in S-frame. The length in S-frame = xB − x A , while the length in
the S′-frame = x′B − x′A = xB − x A − v ( t B − t A ) . Since t B = t A , the length is the same in
both frames: x′B − x′A = xB − x A . (As we shall see later this will not be true in Einstein's
Special Relativity).
A B
6. Let us now see what the Galilean transformation of coordinates implies for transformations
dx′ dx
of velocities. Start with x′ = x − vt and differentiate both sides. Then = − v. Since
dt dt
dx′ dx′ dx′ dx dy′ dy dz′ dz
t = t ′, it follows that = and so = − v. Similarly, = and = .
dt dt ′ dt ′ dt dt ′ dt dt ′ dt
dx′
Now, = u′x is the x-component of the velocity measured in S′-frame, and similarly
dt ′
dx
= u x is the x-component of the velocity measured in S-frame. So we have found that:
dt
u′x = u x − v, u′y = u y , u′z = u z (or, in vector form, u′ = u − v)
du′x d du
Taking one further derivative, = ( u x − v ) = x (remember that v = constant),
dt ′ dt dt
we find that the components of acceleration are the same in S and S′:
du′x du x du′y du y du′z du z
= , = , = .
dt ′ dt dt ′ dt dt ′ dt
9. Let's first get one thing clear: in any one inertial frame, we can imagine that there are
rulers and clocks to measure distances and times. We can synchonize all the clocks to
read one time, which will be called the time in that frame S. But how do we do this? If
are two clocks, then we can set them to read the same time (i.e. synchronize them) by
taking account of the time light takes to travel between them.
Example: The observer with clock A sees the time on clock B as 2:55pm. But he knows
that light took 5 minutes to travel from B to A, and therefore A and B are actually reading
exactly the same time.
A c = 3 × 108 m / sec B
d = 9 × 10 1 0 m
d 9 × 1010 m
(Time taken by light in going from B to A is equal to = = 300sec = 5min.)
c 3 × 108 m / sec
10. Now I shall derive the famous formula which shows that a moving clock runs slow. This
will be something completely different from the older Newtonian conception of time.
Einstein derived this formula using a "gedanken" experiment, meaning an experiment
which can imagine but not necessarily do. So imagine the following: a rail carriage has a
Now suppose that the same flash is observed by an observer S standing on the ground.
According to S, the train is moving with speed v. Let's consider the same light ray. Clearly,
the train has moved forward between the time when the light left the ceiling and when it
h 2 + ( v∆t )
2
d
reached the floor. According to S, the time taken is ∆t = = . Now
c c
d
h
v∆t
v
1 h
square both sides: (c∆t ) 2 = h 2 + ( v∆t ) , which gives ∆t =
2
= γ∆t ′. Here γ is
v c
2
1− 2
c
1
the relativistic factor, γ = and is a number that is always bigger than one. As v
2
v
1− 2
c
gets closer and closer to c, the value of γ gets larger and larger. For v=4c/5, γ =5/3. So, if
1 sec elapses between the ticks of a clock in S′ (i.e. ∆t ′ = 1), the observer in S will see
5/3 seconds between the ticks. In other words, he will think that the moving clock is slow!
11. The muon is an unstable particle. If at rest, it decays in just 10-18 seconds. But if traveling
1 5
at 3/5 the speed of the light, it will last 25% longer because γ = = . If it is
1 − ( 3/ 5 ) 4
2
traveling at v=0.999999c, it will last 707 times longer. We can observe these shifts due
to time dilation quite easily, and they are an important confirmation of Relativity.
12. Another amazing prediction of Relativity is that objects are shortened (or contracted) along
the direction of their motion. Einstein reached this astonishing conclusion on the basis of
yet another gedanken experiment. Again, consider a moving railway carriage with a bulb
at one end that suddenly flashes. Let ∆x be the length of the carriage according to ground
observer S, and ∆x′ be the length according to the observer S′ inside the carriage. So,
2∆x′
∆t ′ = is the time taken for the light to go from one end to the other, and then return
c
after being reflected by a mirror. Now let's look at this from the point of view of S, who is
fixed to the ground. Let ∆t1 be the time for the signal to reach the front end. Then,
∆x + v∆t1
because the mirror is moving forward, ∆t1 = . Call ∆t2 the return time. Then,
c
v∆t1 v∆t2
∆x − v∆t2 ∆x ∆x
∆t2 = . Solving for ∆t1 and ∆t2 : ∆t1 = and ∆t2 = . The total time
c c−v c+v
2∆x / c
is therefore ∆t = ∆t1 + ∆t2 = . Now, from the time dilation result derived
(
1 − v2 / c 2 )
⎛ 2∆x / c ⎞ 2 ∆x / c
earlier, ∆t ′ = 1 − v 2 / c 2 ∆t , and so ∆t ′ = 1 − v 2 / c 2 ⎜ ⎟= . This
⎝ (
⎜ 1− v /c ⎟
2 2
⎠ 1)− v 2
/ c 2
∆x
gives, ∆x′ = , or ∆x = ∆x′ / γ . This an astonishing result! Suppose that there
1− v /c
2 2
is a metre rod. Then the observer riding with it has ∆x′ = 1. But according to someone
who sees the metre rod moving towards/away from him, the length is 1/γ . This is less
than 1 metre!
13. Although an object shrinks in the direction of motion (both while approaching and
receding), the dimensions perpendicular to the velocity are not contracted. It is easy
to conceive of a gedanken experiment that will demonstrate this. One of the exercises
will guide you in this direction.
y y′
v
vt d x′
x
Here d ′ = . This gives x = γ ( x′ + vt ′ ) = γ ( γ ( x − vt ) + vt ′ ) . We find that the time in S′ is
γ
⎛ v ⎞
related to the time in S by t ′ = γ ⎜ t − 2 x ⎟ . Note that if we make c very large, then t ′ = t.
⎝ c ⎠
To summarize:
LORENTZ TRANSFORMATION
x′ = γ ( x − vt )
y′ = y
z′ = z
⎛ v ⎞
t′ = γ ⎜ t − 2 x ⎟.
⎝ c ⎠
(Note: in various books you will find slightly different derivations of the above Lorentz
transformation. You should look at one of your choice and understand that as well.)
1. Instead of using a ruler to measure distances, can we use a laser together with an accurate
clock instead? Discuss.
2. What must be the value of v/c if a)γ = 1.01, b)γ = 10, c)γ = 1000.
3. Two identical spaceships carrying identical clocks pass by each other at a speed of 0.99c.
a) Each sees the other's clock as running slow. By how much?
b) Each sees the other's spaceship shortened. By how much?
c) How is it possible for the observers on both spaceships to be correct in saying that the
other's clock is slow, and the other's spaceship is shortened?
4. Devise a logical argument that distances perpendicular to the direction of motion do not
shrink. As a help, consider the following gedanken experiment: a train passes through a
tunnel where, at a certain height above the ground a blue stripe has been painted. A man
in the train has a paintbrush with red paint on it which leaves a red stripe as the train moves
at high speed. Make your case using the equivalence of rest frames.
wall wall
w.r.t car
w.r.t ground
5. a) Two events happen at the same point in the same inertial frame S. To be specific, a
man stays still as blows his nose (event 1) and then wipes it (event 2). In any other
frame S′ that is moving in the x direction, show that the sequence of these two events
will remain unchanged.
b) Will this still be true if the man, instead of standing still, is walking very fast at half
the speed of light?
⎛ v ⎞
6. The Lorentz transformation is x′ = γ ( x − vt ) , y′ = y, z′ = z , t ′ = γ ⎜ t − 2 x ⎟ . Solve for
⎝ c ⎠
the inverse transformation, i.e express x, y, z, t in terms of x′, y′, z′, t ′.
⎛ v ⎞
1. Recall the Lorentz Transformation: x′ = γ ( x − vt ) and t ′ = γ ⎜ t − 2 x ⎟ . Suppose we take
⎝ c ⎠
the space interval between two events ∆x = x1 − x2 , and the time interval ∆t = t1 − t2 .
⎛ v ⎞
Then, these intervals will be seen in S′ as ∆t ′ = γ ⎜ ∆t − 2 ∆x ⎟ and ∆x′ = γ ( ∆x − v∆t ) .
⎝ c ⎠
Now consider two particular cases:
a) Suppose the two events occur at the same place (so ∆x = 0) but at different times
(so ∆t ≠ 0). Note that in S′ they do not occur at the same point: ∆x′ = γ ( 0 − v∆t )!
b) Suppose the two events occur at the same time (so ∆t = 0) but at different places
⎛ v ⎞
(so ∆x ≠ 0). Note that in S′ they are not simultaneous: ∆t ′ = γ ⎜ 0 − 2 ∆x ⎟ .
⎝ c ⎠
2. As seen in the frame S, suppose a particle moves a distance dx in time dt. Its velocity u is
dx
then u = (in S-frame). As seen in the S′-frame, meanwhile, it has moved a distance dx′
dt
⎛ v ⎞
where dx′ = γ ( dx − vdt ) and the time that has elapsed is dt ′ = γ ⎜ dt − 2 dx ⎟ . The velocity
⎝ c ⎠
dx′ γ ( dx − vdt ) dx / dt − v u−v
in S′-frame is therefore u′ = = = = . This is the
dt ′ ⎛ v ⎞ v uv
γ ⎜ dt − 2 dx ⎟ 1 − 2 dx / dt 1 − 2
⎝ c ⎠ c c
Einstein velocity addition rule. It is an easy exercise to solve this for u in terms of u′,
u′ + v
u= .
u′v
1+ 2
c
3. Note one very interesting result of the above: suppose that a car is moving at speed v and
it turns on its headlight. What will the speed of the light be according to the observer on
the ground? If we use the Galilean transformation result, the answer is v+c (wrong!). But
c+v c+v c+v
using the relativistic result we have u′ = c and u = = =c = c. In other
cv v c + v
1+ 2 1+
c c
words, the speed of the source makes no difference to the speed of light in your frame.
Note that if either u or v is much less than c, then u′ reduces to the familiar result:
u−v
u′ = → u − v, which is the Galilean velocity addition rule.
uv
1− 2
c
2
(
I ′ = ( c∆t ′ ) − ( ∆x′ ) = γ 2 ( c∆t ) + ( ∆x ) v 2 / c 2 − 2v∆t ∆x − ( ∆x ) − ( v∆t ) + 2v∆t ∆x
2 2 2 2 2
)
)(
=
1
(c 2
) (
− v 2 ( ∆t ) − ( ∆ x ) 1 − v 2 / c 2
2 2
))
(
1 − v2 / c2
= ( c∆t ) − ( ∆ x ) = I
2 2
This guarantees that all inertial observers measure the same speed of light !!
5. If the time separation is large, then I > 0 and we call the interval timelike.
If the space separation is large, then I < 0 and we call the interval spacelike.
If I = 0 and we call the interval lightlike.
Note : a) If an interval is timelike in one frame, it is timelike in all other frames as well.
b) If interval between two events is timelike, their time ordering is absolute.
c) If the interval is spacelike the ordering of two events depends on the frame from
which they are observed.
1
Let ν 0 = be the frequency measured in the source's rest frame S, where T0 is the time for
T0
one complete cycle in S. We want to calculate ν , the frequency as seen by the observer
in S′ moving to the right at speed v. Call λ the distance between two successive wave
crests (i.e. the wavelength according to the observer). In time T the crests ahead of the
source move a distance cT , even as the source moves a shorter distance vT in the same
c c
direction. Hence λ = ( c − v ) T and so ν = = . Now, as discussed earlier, the
λ ( c − v )T
time measured by observer will not be T0 because of time dilation. Instead,
⎛ c ⎞ c −v c+v
2 2
T0 cT0 c
T= = . Hence, ν = =⎜ ⎟ ν0 = ν 0.
1− v /c
2 2
c −v
2 2
( c − v )T ⎝ c − v ⎠ c c−v
c−v
( If source moves away from the observer just change the sign of v: ν = ν 0 .)
c+v
This is the famous Doppler effect formula. In the lecture I discussed some applications
such as finding the speed at which stars move away from the earth, or finding the speed
of cars or aircraft.
9. We now must decide how to generalize the concept of momentum in Relativity theory.
The Newtonian definition of momentum is p = mu . The problem with this definition is
that we are used to having momentum conserved when particles collide with each other,
and this old definition will simply not work when particles move very fast. Consider the
collision of two particles as in the diagrams below:
During Collision
After Collision
uC uD
10. Can we save the situation and make the conservation of momentum hold by finding some
suitable new definition of momentum? The new definition must have two properties:
1) At low speeds it must reduce to the old one.
2) At all speeds momentum must be conserved.
r
r mu r
Let's see if the definition p = = γ mu will do the job. Obviously if u<<c we
1 − u 2 / c2
r r
get p = mu , so requirement 1 is clearly satisfied. . Let's now see if the conservation of
momentum equation will hold if the new definition of momentum is used:
mAγ Au A + mBγ Bu B = mC γ C uC + mDγ Du D .
After doing some algebra you find,
⎛1 ⎞ ⎛1 ⎞ ⎛1 ⎞ ⎛1 ⎞
mA ⎜ γ ′Au′A + γ A v ⎟ + mB ⎜ γ B′ u′B + γ B v ⎟ = mC ⎜ γ C′ uC′ + γ C v ⎟ + mD ⎜ γ D′ u′D + γ D v ⎟
⎝γ ⎠ ⎝γ ⎠ ⎝γ ⎠ ⎝γ ⎠
This gives mAγ ′Au′A + mBγ B′ u′B = mC γ C′ uC′ + mDγ D′ u′D , which is just what we want.
=
mudu ( )
⎟
( ) ( )
3 / 2 3/ 2
⎝ 1− u /c ⎠ 1− u /c
2 2 2 2
1 − u 2 / c2 1 − u 2 / c2
u u
udu mc 2
This give K = ∫ udp = m ∫ = − mc 2
(1 − u )
3/ 2
0 0
2
/c 2
1− u /c 2 2
mc 2
Note that K is zero if there is no motion, or K = E − E0 where E = and
1 − u 2 / c2
1 u2
( )
−1/ 2
E0 = mc . Now expand
2
= 1− u /c 2 2
= 1+ 2 + ⋅⋅⋅
1 − u 2 / c2 2c
⎛ u2 ⎞ 1
Hence, K = mc 2 ⎜ 1 + 2 + ⋅ ⋅ ⋅ ⎟ − mc 2 → mu 2 as u / c → 0.
⎝ 2c ⎠ 2
12. Now that we have done all the real work, let us derive some alternative expressions
mu mc 2
using our two main formulae: p = = γ mu and E = = γ mc 2 .
1− u /c 2 2
1− u /c
2 2
2
p pc ⎛u⎞
a) u = = or pc = E ⎜ ⎟
γm E ⎝c⎠
2
⎛u⎞
2
⎝c⎠
2
( )
b) Clearly ( pc ) = E ⎜ ⎟ = γ 2 m 2c 4 1 − 1/ γ 2 = γ 2 m 2c 4 − m 2c 4 = E 2 − m 2c 4 . Hence,
E 2 = p 2c 2 + m 2c 4 .
c) For a massless particle (m = 0), E = pc.
13. A particle with mass m has energy mc 2 even though it is at rest. This is called its rest
energy. Since c is a very large quantity, even a small m corresponds to a very large
energy. We interpret this as follows: suppose all the mass could somehow be converted
into energy. Then an amount of energy equal to mc 2 would be released.
1. a) In measuring the length of a metre stick, how does the requirement of simultaneity enter?
b) What is the equivalent requirement in the meassurement of time?
2. A spaceship moving away from earth at 0.5c fires two missiles at speed 0.6c as measured
in its rest frame. One missile is aimed away from earth and the other is towards it. What
will be the speed of the missiles as measured on earth.
3. How much work must be done to increase the speed of a particle of mass from 0.2c to
0.3c? From 0.8c to 0.9c? In each case the increase in speed is the same amount, so why
is the work done different?
4. A particle of mass m moving with velocity v collides with another particle moving with
velocity -v. The two particles stick together. What will be the mass of the single particle
that remains?
5. A particle of mass M decays while at rest into two identical particles that move off in
opposite directions. What is the mass of those particles?
6. Two identical clocks are set to the same time. One remains on earth while the other is
in an aircraft that travels at 1000km/h around the world. Will the the two clocks read
the same or a different time? [Actually this experiment has been done with extremely
accurate atomic clocks, and the time difference has been measured.]
7. How fast must a car approach a red light so that it is Doppler shifted and appears green?
8. Clocks placed at equal distances in frame S are synchronized to read the same value of
time by sending a light pulse from the origin of S. The clocks in frame S′ are similarly
synchronized. To an observer in S, will the clocks in S′ be synchronized? What about
the observer in S?
Clocks in S ′
x′ = 0
v
x=0
Clocks in S
1. We think of particles as matter highly concentrated in some volume of space, and of waves
as being highly spread out. Think of a cricket ball, and of waves in the ocean. The two are
completely different! And yet today we are convinced that matter takes the form of waves
in some situations and behaves as particles in other situations. This is called wave-particle
duality. But do not be afraid - there is no logical contradiction here! In this lecture we shall
first consider the evidence that shows the particle nature of light.
Vacuum Light, frequency ν
2. The photoelectric effect, noted nearly 100 years ago, was chamber
Collecting
crucial for understanding the nature of light. In the diagram Metal
plate
plate
shown, when light falls upon a metal plate connected to
e−
the cathode of a battery, electrons are knocked out of the
plate. They reach a collecting plate that is connected to
I
the battery's anode, and a current is observed. A vacuum
Ammeter
is created in the apparatus so that the electrons can travel
Potentiostat
without hindrance. According to classical (meaning old!)
physics we expect the following:
a)As intensity of light increases, the kinetic energy of the ejected electrons should increase.
b)Electrons should be emitted for any frequency of light ν , so long as the intensity of the
light is sufficiently large.
But the actual observation was completely different and showed the following:
a)The maximum kinetic energy of the emitted electrons was completely independent of
the light intensity, but it did depend on ν .
b)For ν <ν 0 (i.e. for frequencies below a cut-off frequency) no electrons are emitted no
matter how large the light intensity was.
3. In 1905, Einstein realized that the photoelectric effect could be explained if light actually
comes in little packets (or quanta) of energy with the energy of each quantum E = hν .
Here h is a universal constant of nature with value h = 6.63 × 10−34 Joule-seconds, and is
known as the Planck Constant. If an electron absorbs a single photon, it would be able to
leave the material if the energy of that photon is larger than a certain amount W . W is
called the work function and differs from material to material, with a value varying from
2-5 electron volts. The maximum KE of an emitted electron is then K max = hν − W . We
visualize the photon as a particle for the purposes of this experiment. Note that this is
completely different from our earlier understanding that light is a wave!
( )
1/ 2
Specifically, from conservation of energy hν + mec 2 = hν ′ + pe2c 2 + me2c 4 . The last
term is the energy of the scattered electron with mass me . Next, use the conservation
Before After pν ′
scattered
Incoming photon
θ
pν
Electron
pe
scattered
of momentum. The initial momentum of the photon is entirely along the zˆ direction,
h
pν = zˆ = pν ′ + p e . By resolving the components and doing a bit of algebra, you can get
λ
h
the change in wavelength λ ′ − λ = (1 − cosθ ) = λc (1 − cosθ ) , where the Compton
me c
h
wavelength λc = = 2.4 × 10−12 m. Note that λ ′ − λ is always positive because cos θ
me c
has magnitude less than 1. In other words, the frequency of the scattered photon is
always less than the frequency of the incoming one. We can understand this result
because the incoming photon gives a kick to the (stationary) electron and so it loses
energy. Since E = hν , it follows that the outgoing frequency is decreased. As remarked
earlier, it is impossible to understand this from a classical point of view. We shall now
see how the Compton effect is actually observed experimentally.
7. So light behaves as if made of particles. But do all particles of matter behave as if they
are waves? In 1923 a Frenchman, Louis de Broglie, postulated that ordinary matter can
have wave-like properties with the wavelength λ related to the particle's momentum p
h
in the same way as for light, λ = . We shall call λ the de Broglie (pronounced as
p
Deebrolee!) wavelength.
9. If De Broglie's hypothesis is correct, then we can expect that electron waves will undergo
interference just like light waves. Indeed, the Davisson-Germer experiment (1927) showed
that this was true. At fixed angle, one find sharp peaks in intensity as a function of electron
energy. The electron waves hitting the atoms are re-emitted and reflected, and waves from
different atoms interfere with each other. One therefore sees the peaks and valleys that
are typical of interference (or diffraction) patterns in optical experiments.
θi
θi
10. Let us look at the interference in some detail. When electrons fall on a crystalline
surface, the electron scattering is dominated by surface layers. Note that the identical
scattering planes are oriented perpendicular to the surface. Looking at the diagram,
we can see that constructive interference happens when a(cosθ r − cosθi ) = nλ . When
this condition is satified, there is a maximum intensity spot. This is actually how we
find a and determine the structure of crystals.
θi
a cosθ i
θr
d
θ
Incoming coherent d sin θ
beam of electrons
Detecting
screen
D
12. The double-slit experiment is so important that we need to discuss it further. Note the
following:
a) It doesn't matter whether we use light, electrons, or atoms - they all behave as waves.
in this experiment. The wavelength of a matter wave is unconnected to any internal
h
size of particle. Instead it is determined by the momentum, λ = .
p
b) If one slit is closed, the interference disappears. So, in fact, each particle goes through
both slits at once.
c) The flux of particles arriving at the slits can be reduced so that only one particle arrives
at a time. Interference fringes are still observed !Wave-behaviour can be shown by a
single atom. In other words, a matter wave can interfere with itself.
d) If we try to find out which slit the particle goes through the interference pattern
vanishes! We cannot see the wave/particle nature at the same time.
All this is so mysterious and against all our expectations. But that's how Nature is!
14. By using the De Broglie hypothesis in a simple gedanken experiment, we can see how
the uncertainty principle emerges. Electrons are incident upon a single slit and strike a
screen far away. The first dark fringe will be when W sin θ = λ . Since θ is small, we can
λ ∆p y ∆p y
use sin θ ≈ θ , and so θ ≈ . But, on the other hand, tan θ = and θ ≈ . So,
W px px
∆py λ
= . But W is really the uncertainty in the y position and we should call it ∆y.
h/ λ W
Momentum uncertainty in
the y component
15. In a proper course in quantum mechanics, one can give a definite mathematical meaning
to ∆x and ∆px etc, and derive the uncertainty relations:
∆x∆px ≥ h / 2, ∆y∆p y ≥ h / 2, ∆z∆pz ≥ h / 2
Note the following:
a) There no uncertainty principle for the product ∆x∆p y . In other words, we can know
in principle the position in one direction precisely together with the momentum in
another direction.
b) The thought experiment I discussed seems to imply that, while prior to experiment
we have well defined values, it is the act of measurement which introduces the
uncertainty by disturbing the particle's position and momentum. Nowadays it is more
widely accepted that quantum uncertainty (lack of determinism) is intrinsic to the
theory and does not come about just because of the act of measurement.
17. One consequence of ∆E ∆t ≥ h / 2 is that the level of an atom does not have an exact
value. So, transitions between energy levels of atoms are never perfectly sharp in
frequency. So, for example, as shown below an electron in the n = 3 state will decay
to a lower level after a lifetime of order t ≈ 10−8 s. There is a corresponding "spread"
in the emitted frequency.
I
n=3 n
E = hν 32 t
n=2 e
n
s
it ∆ν 32
y
n=1
ν 32
Frequency
Q.1 The highest wavelength for which photoelectrons are emitted from tungsten is 2300 A o .
Determine the energy of an electron ejected from the metal's surface by ultraviolet light
of wavelength 1900 A o .
Q.2 Photoelectrons emitted from a cesium plate illuminated with ultraviolet light of wave-
length 2000 A o are stopped by a potential of 4.21V. Find the work function of cesium.
Q.3 a) Show that the De Broglie wavelength of an electron with kinetic energy E (eV) is
12.3 × 10−8
λe = cm.
E1/ 2
b)Show that the De Broglie wavelength of a proton with kinetic energy E (eV) is
0.29 × 10−8
λp = cm.
E1/ 2
h
Q.4 Complete the steps in point 4 (Compton Effect) to show that λ ′ − λ = (1 − cosθ ) .
mec
Q.5 The Heisenberg microscope is an imaginary device to measure the position (y) and
momentum (p) of a particle.
Particle
θ/2
∆y
Light source,
wavelength λ Resolving power of lens:
λ
Lens, with angular ∆y ≈
θ
diameter θ
Photons transfer momentum to the particle when they scatter.
a) Show that the magnitude of p is the same before and after the collision.
b) Uncertainty in photon y-momentum= Uncertainty in particle y-momentum. Why?
c) Show that p y lies in the range − p sin (θ / 2 ) ≤ p y ≤ p sin (θ / 2 ) . Thus, the uncertainty
∆p y = 2 p sin (θ / 2 ) ≈ pθ .
hθ λ
d) Use the de Broglie relation p = h / λ to show that ∆p y ≈ . Then use ∆y ≈ to get
λ θ
back ∆p y ∆y ≈ h.
1. The word "quantum" means packet or bundle. We have already encountered the quantum
of light - called photon - in an earlier lecture. Quanta (plural of quantum) are discrete
steps. Walking up a flight of stairs, you can increase your height one step at a time and not,
for example, by 0.371 steps. In other words your height above ground (and potential
energy can take discrete values only).
3. It is impossible to cover quantum mechanics in a few lectures, much less in this single
lecture. But here are some main ideas:
a) Classical (Newtonian) Mechanics is extremely good for dealing with large objects (a
grain of salt is to be considered large). But on the atomic level, it fails.The reason for
failure is the uncertainty principle - the position and momenta of a particle cannot be
determined simultaneously (this is just one example; the uncertainty principle is actually
more general). Quantum Mechanics properly describes the microscopic - as well as
macroscopic - world and has always been found to hold if applied correctly.
b) Atoms or molecules can only exist in certain energy states. These are also called
"allowed levels" or quantum states. Each state is described by certain "quantum
numbers" that give information about that state's energy, momentum, etc.
c) Atoms or molecules emit or absorb energy when they change their energy state. The
amount of energy released or absorbed equals the difference of energies between the
two quantum states.
d) Quantum Mechanics always deals with probabilities. So, for example, in considering
the outcome of two particles colliding with each other, we calculate probabilities to
scatter in a certain direction, etc.
5. One consequence of quantum mechanics is that it explains through the uncertainty principle
the stability of the atom. But before talking of that, let us consider a particle moving
between two walls. Each time it hits a wall, a force pushes it in the opposite direction.
There is no friction, so classically the particle just keeps
moving forever between the two potential walls. In QM
the uncertainty of the particle's position is ∆x = a and so,
h
from ∆x∆p ≥ h / 2, we have ∆p a ≈ . From this we learn
2
(∆p) 2
h2
that the kinetic energy ≈ . This is telling us
2m 8ma 2
that as we squeeze the particle into a tighter and tighter a
space, the kinetic energy goes up and up!
6. The story repeats for a harmonic oscillator. So imagine that a mass moves in a potential
of the type shown below, and that its frequency of oscillation is ω. Classically, the lowest
energy would be that in which the mass is at rest (no kinetic energy) and it is at the
position where the potential is minimum. But this means that the body has both a
know from our experience that it will just keep oscillating - moving first towards the hill
and then down again, etc. But quantum mechanically, it can "steal" energy ∆E for a time
∆t and this may be enough to surmount the hill. Of course, the particle must respect
∆E ∆t ≥ h / 2 so the time is small if it needs a large amount of energy to cross over. Again,
I have given only a rough argument here, but in quantum mechanics we can do proper
calculations to find tunneling probabilities.
9. Without tunneling our sun would go cold. As you may know, it is powered by hydrogen
fusion. The protons must somehow overcome electrostatic repulsion to get close enough
so that they can feel the nuclear force and fuse with each other. But the thermal energy
at the core of the sun is not high enough. It is only because the tunneling effect allows
protons to sometimes get close enough that fusion happens.
13. If a1 and a2 are the amplitudes of the two possibilities for a particular event to occur,
then the amplitude for the total event is A = a1 + a2 . Here a1 and a2 are complex numbers
2 2
in general. But the probability for the event to occur is given by P = A = a1 + a2 . In
daily experience we add probabilities, P = P1 + P2 but in quantum mechanics we add
2
amplitudes: P = a1 + a2 = a1*a1 + a2*a2 + a1*a2 + a2*a1 = P1 + P2 + a1*a2 + a2*a1 . The cross
terms a1*a2 + a2*a1 are called interference terms. They are familiar to us from the lecture
on light where we add amplitudes first, and then square the sum to find the intensity. Of
course, if we add all possible outcomes then we will get 1. So, for example, in the electron
2 2
case P (↑) + P (↓) = c1 + c2 = 1. Note that amplitudes can be complex but probabilities
are always real.
15. The above brings us to the concept of a "wave function". In 1926 Schrödinger proposed
a quantity that would describe electron waves (or, more generally, matter waves).
• The wavefunction Ψ ( x, t ) of a particle is the amplitude to be at position x at time t.
• The probability of finding the particle at position x between x and x + dx (at time t ) is
2
Ψ ( x, t ) dx . Since the particle has to be somewhere, if we add up all possibilities then
+∞
∫
2
we must get one, i.e. Ψ ( x, t ) dx = 1.
-∞
Q.1 Compare the size of the kinetic energy of a particle confined to a distance which is
of the size of:
a) A grain of salt (1mm).
b) An atom
c) A nucleus
Give your answer only in terms of ratios of b/a and c/a.
Q.2 The probability of death in an accident is 1.5% and the probability of injury is 35%.
How many people are killed in 1000 accidents? Injured? Not injured or killed?
Q.3 In the double slit experiment, what would you see if one slit is closed? Why does the
interference require both slits to be open at the same time?
Q.4 Suppose a particle that can move only between x = 0 and x = a has wavefunction
πx
Ψ ( x) = N sin . What is the value of N? [Hint, the particle has to be somewhere!]
a
Q.5 Classically we are used to a picture of the atom as
a small solar system in which the electron orbits the
nucleus. But quantum mechanically this is not true.
Instead, the electron is distributed as a cloud around
the centre. Where the cloud is most dense, chances
of finding the electron are greatest. Now look at the
graph and picture at the right.
a) Does the atom have a definite size? If not, can
you say anything at all about it?
b) This particular state of the electron is such that it
is symmetrically distributed, independent of angle.
In this case, can the electron have non-zero angular
momentum?
c) If the answer above is "no", then why does the
electron not fall into the positively charged nucleus?
1. • About 2500 years ago, the ancient Greek philosopher Democritus asked the question:
what is the world made of? He conjectured that it is mostly empty, and that the
remainder is made of tiny "atoms". By definition these atoms are indivisible.
• Then 300 years ago, it was noted by the French chemist Lavoisier that in all chemical
reactions the total mass of reactants before and after a chemical reaction is the same.
He demonstrated that burning wood caused no change in mass. This is the Law of
Conservation of Matter.
• A major increase in understanding came with Dalton (1803) who showed that:
1) Atoms are building blocks of the elements.
2) All atoms of the same element have the same mass.
3) Atoms of different elements are different.
4) Two or more different atoms bond in simple ratios to form compounds.
2. Avogadro made the following hypothesis : "Equal volumes of all gases, under the same
conditions of temperature and pressure, contain equal numbers of molecules". Why?
Because we know that pressure is caused by molecules hitting the sides of the containing
vessel. If the temperature of two gases is the same, then their molecules move with
the same speeds, and so Avogadro's hypothesis follows for ideal gases. The famous number
N 0 = 6.023 × 1026 per kilogram-mole is called Avogadro's Number.
3. Let's get an idea of the size of atoms. Amazingly, we do not need high-powered particle
accelerators to do so. Consider a cube of 1m × 1m × 1m. If the radius of an atom is r , then
we have (1/ 2r )3 atoms in the cube. Now in 1kg.atom we have N 0 = 6 × 1026 atoms and
each atom occupies a volume ( A / ρ ) m3 , where A = atomic weight and ρ = density.
1m
1m
1/ 3
1⎛ A ⎞
Hence N 0 = (1/ 2r ) × A / ρ . This give r = ⎜
3
⎟ . Putting in some typical densities,
2 ⎝ ρ N0 ⎠
we find that rAg ≈ rBe ≈ 10−10 m. This shows that atoms are mostly of the same size. This
is quite amazing because one expects a Be atom to be much smaller than an Ag atom.
5. But the plum-pudding model was wrong. In 1911, Rutherford carried out his famous
experiment that showed the existence of a small but very heavy core of the atom. He
arranged for a beam of α particles to strike gold atoms in a thin foil of gold.
If the positive and negative charges in the atom were randomly distributed, all α ' s would
go through without any deflection. But a lot of backscattering was seen, and some α 's
were even deflected back in the direction of the incident beam. This was possible only if
they were colliding with a very heavy object inside the atom. Rutherford had discovered
the atomic nucleus.
7. This sounds fine, but there is a serious problem: we know that a charge that accelerates
radiates energy. In fact the power radiated is P ∝ e 2 a 2 , where e is the charge and a is the
acceleration. Now, a particle moving in a circular orbit has an acceleration even if it is
moving at constant speed because it is changing its direction all the time. So this means
8. This is not the only thing wrong with the solar system model. If you look at the light emitted
by any atom, you do not see a continuous distribution of colours (frequencies). Instead, a
spectroscope will easily show that light is emitted at only certain discrete frequencies.
9. Similarly, if white light is passed through a gas of atoms of a certain type, only certain
colours are absorbed, and the others pass through without a hindrance.
The above shows the absorption spectrum of a certain atom. The wavelengths for both
emission and absorption lines are exactly equal. Classical physics and the Rutherford
model have no explanation for the spectrum.
11. Let us pursue this idea further. The electron has a wavelength and forms standing waves
in its orbit around the nucleus. An integral number of electron wavelengths must fit into
the circumference of the circular orbit. Hence nλ = 2π r with n = 1, 2,3 ⋅ ⋅ ⋅ The momentum
h h nh
is p = mv = = = . The angular momentum L = rmv = nh is therefore
λ (2π r / n) r
quantized in units of h.
14. In the Bohr model, electrons can jump between different orbits due to the absorption or
emission of photons. Dark lines in the absorption spectra are due to photons being
absorbed, and bright lines in the emission spectra are due to photons being emitted. The
energy of the emitted or absorbed photon is equal to the difference of the initial and
final energy levels, hv = E f − Ei . The picture below shows the electron in the n = 7 and
⎛1 1 ⎞
n = 8 levels. The photon emitted has hv = E8 − E7 = −13.6 ⎜ 2 − 2 ⎟ ev.
⎝8 7 ⎠
15. The Bohr model gave wonderful results when compared against the hydrogen spectrum.
It was the among the first indications that some "new physics" was needed at the atomic
level. But this model is not to be taken too seriously - it fails to explain many atomic
properties, and fails to explain why the H atom can exist even when the electron has
no orbital angular momentum (and hence no centrifugal force to balance against the
Coulomb attraction). It cannot predict all the lines observed for H, much less for multi-
electron atoms such as Oxygen. The real value of this model was that it showed the way
forward towards developing quantum mechanics, which is the true physics of the world,
both microscopic and macroscopic. I have discussed some elements of QM in the last
r
lecture, in particular the wavefunction ψ (r , t ) of the electron.
17. Electrons can be thought of as little spinning balls of charge. All electrons spin at the
same speed (more accurately, their spin angular momentum is the same and equals h/2).
An electron can spin in only one of two possible directions.
When charges move around in a circle, that constitutes a
current. As you know, currents give rise to magnetic fields.
This is why electrons are also little magnets that interact
with other magnets. Now go back to the Stern-Gerlach
experiment described in the previous lecture, and you will
understand better why the electrons were deflected by the
applied magnetic field. Now that we have learned that the
electron has spin, we can describe the two spin states by giving the "magnetic" quantum
number ms where ms = − 12 , 12 . These two states have the same energy except when there
is some magnetic field present.
18. States (or orbitals) having l = 0,1,2, ⋅ ⋅ ⋅ are called s,p,d, ⋅ ⋅ ⋅ . The s-states are spherical. As
n increases, the s-orbitals get larger and the wavefunction is larger away from the nucleus.
20. Let us apply the Pauli Exclusion Principle to the multi-electron atom where each electron
has the quantum numbers {n, l , ml , ms }. Only one electron at a time may have a particular
set of quantum numbers. Now for some definitions:
• Shell - electrons with the same value of n
• Subshell - electrons with the same values of n and l
• Orbital - electrons with the same values of n, l , and ml
Once a particular state is occupied, other electrons are excluded from that state. The
electron configuration is how the electrons are distributed among the various atomic
orbitals in an atom. A common notation is of the type here: Number of electrons
in subshell = 6
3p6
Principal quantum Angular momentum
number n = 3 quantum number l = 1 (p)
21. Building the shell structure of multi-electron atoms through n = 4 using the Pauli Principle.
Q.2 What would the value of the principal quantum number n have to be for a hydrogen
atom to be 1 cm in radius? Why do we never see atoms this big, even though the formula
for radius allows it?
13.6
Q.3 The energy levels of the H atom are given by − eV . Why the minus sign? This is
n2
actually an important point and you should go back to the lecture on electric potential
to see why this is so.
Q.4 Using the table provided in point 21 above, find the allowed electron configurations
for the following atoms: Li, Be, C, N, O.
2 2
Q.5 Plot the function P(r ) = r 2e − r / a0 against ρ , where ρ is the distance measured in units
of a0 , ρ = r / a0 . Answer the following questions:
a) At what value of r does P (r ) have a maximum?
∞
Q.1 In the previous lecture you learned how it was discovered that the atom is mostly empty
space with a cloud of electrons. At the centre is a small but very heavy nucleus that has
protons and neutrons. The word "nucleon" refers to both of these. So you can think of
the neutron or proton as being two different varieties of the nucleon. The masses of the
two are very similar, and they are roughly 2000 times heavier than the electron.
proton mass = M p = 1.672 × 10-27 kg
neutron mass = M n = 1.675 × 10-27 kg
electron mass = M e = 9.109 × 10-31 kg
The neutron is neutral, of course, but the charge on the proton is 1.6 × 10-19 C while the
charge on the electron is the negative of this, − 1.6 × 10-19 C.
2. Using kilograms is very awkward if you are dealing with such small particles. Instead we
we use E = mc 2 to write the mass of a particle in terms of its rest energy, m = E / c 2 . So
mass is measured in units of MeV / c 2 .
proton mass = M p = 938 MeV / c 2
neutron mass = M n = 940 MeV / c 2
electron mass = M e = 0.5 MeV / c 2
4. Hydrogen, deuterium, and tritium are called isotopes. If a nucleus with Z protons has N
neutrons then its isotopes will have fewer, or more, neutrons. Since the number of electrons
is also Z, the chemical properties of all isotopes are exactly the same. But for any given
element, at most there is only one stable isotope.
7. What energy should electrons have in order to see a nucleus? We know that electrons are
waves with λ = h / p (the De Broglie relation). To see something as small as 1 fm requires
a wave with wavelength at least λ ≈ 1 fm. A wave with longer wavelength would simply
pass over the nucleus without being disturbed. So the minimum electron energy is,
p2 h2
E= = . Evaluation gives this to be a few MeV, requiring an electron accelerator
2m 2mλ 2
of more than this minimum energy.
9. Let us consider the implication of the approximate formula for the nuclear radius,
4 4
r ≈ r0 A1/ 3 where r0 = 1.2 fm. The volume of the nucleus is: V = π r 3 = π r03 A. From
3 3
A 3
this = ≈ 0.14 nucleons/fm3 . This is the number of nucleons per cubic fermi, and
V 4π r0 3
is independent of the nucleus considered. This is the density you would find at the centre
of any nucleus. Of course, this is approximately true only but it is quite remarkable.
10. Protons repel protons through the electrostatic force. So why does the nucleus not blow
apart. Obviously there must be some attractive force that is stronger than this repulsion.
It is, in fact, called the strong force. From what we have learned so far, we can guess
some of its important features:
a) Since neutron and proton distributions are almost the same, the N-P force cannot be
very different from the N-N or P-P force.
b) Since the density of nucleons in large nuclei is the same as in lighter nuclei, this means
that a given nucleon feels only the force due to its immediate neighbours, and does not
interact much with nucleons on the other side of the nucleus. In other words, the range
of the nucleon-nucleon force is very short and of the order of 1-2 fm only.
11. The force between two charges is always of one sign - repulsive if the signs are the same,
and attractive if they are opposite. In the early years of quantum theory, people realized
that this force comes about because of the exchange of photons between charges. The
nucleon-nucleon force is different. It has to be attractive to keep the nucleus together,
and has to short range (as discussed above). But, to prevent nucleons from sticking to each
other, it must be repulsive at short distances. Now here, "short" and long means distances
on the scale of fermis. Typically the distances between nucleons is on this scale as well.
the force is the negative of the slope, you can see that for
large value of r , the force is attractive (negative means V (r ) repulsive
r
directed towards smaller values of r ). At roughly 1.4 fm dV
F =−
dr
the potential reaches its most attractive point. For values 1.5 fm
r
of r smaller than this, the force is repulsive. In fact there
attractive
is a very strong core that almost completely forbids the
nucleons from getting closer than about 0.5 fm.
12. In 1935, the Japanese physicist Hideki Yukawa made an astonishing breakthrough in
understanding the basis for the attractive N-N force. He assumed that, just as between
charges, there must be a particle that is emitted by one nucleon and then captured by the
other. He called this a "pi-meson" or "pion", and made a good guess for what its mass
should be. His argument uses the time-energy uncertainty principle discussed earlier:
• Let ∆t be the time that the pion takes between emission and capture. In this time it
will have travelled a distance approximately equal to c∆t because light particles can
travel no faster than light.
• Creating a pion from "nothing" means that energy
conservation has been violated. The amount of
p
π+ n
violation is ∆E = minimum energy of pion ≈ mc 2
h
and this must obey ∆E ∆t ≈ h / 2. Hence, ∆t ≈
2mc 2
h
and the distance travelled by the pion ≈ c ∆t ≈ .
2mc
h
• Put ≈ 1.2 fm (range of nuclear force). This gives
2mc
the mass of the pion as close to mc 2 ≈ 124 MeV . This
was an amazing prediction - the first time a particle had
been predicted to exist on the basis of a theoretical
argument. When experimentalists searched for it in
1947, they indeed found a particle of mass rather close
to it, with mc 2 ≈ 138 MeV . It was a very dramatic
confirmation for which Yukawa got the Nobel prize.
• Pions can rightfully be called the carriers of the strong nuclear force. They have 3
possible charge states: π + , π 0 , π − . They belong to a larger family of particles called
mesons. Other family members are rho-mesons, omega-mesons, K-mesons,...
Today we can produce mesons in huge amounts by smashing nucleons against each
other in an accelerator.
n0
half the original sample to decay. If n = n0 / 2 then log = −λt , from which the half
2n0
log 2 0.693
life is related to λ by, T1 = = . The larger λ , the more radioactively unstable
2
λ λ
a nucleus is. Some typical half-lives are:
Polonium 214
84 P 1.64 × 10-4 s
89
Krypton 36 K 3.16 minutes
90
Strontium 38 Sr 28.5 years
226
Radium 88 Ra 1600 years
14
Carbon 6 C 5730 years
Uranium 238
92 U 4.5 × 109 years
You can see how hugely different the lifetimes of different nuclei are!
16. The decay law can be used to see how old things are. This is called radioactive dating.
Carbon dating is widely used for living things that died a few hundred or few thousand
years ago. How does it work?This uses the decay of the unstable isotope, 146C . Of course,
the stable isotope of carbon is 126C.
• When a living organism dies, CO 2 is no longer absorbed. Thus the ratio of carbon 14:12
decreases by half every 5730 years. We can measure the rate of decrease through
N = N o e- λ t or the "activity" A = Ao e − λ t with Ao = 0.23 Bq / g. (The becquerel Bq
is the unit of radioactivity, defined as the activity of a quantity of radioactive material
in which one nucleus decays per second. )
• The amount of isotopes in the atmosphere is approximately constant, despite a half-life
of 5730 y because there is a constant replenishment of 146C through the reaction,
14
7 N + n → 146C + p
17. Let us use the above idea to find the time when this
man died. His body was found a few years ago buried
under deep snow in a mountain pass, so it it did not
decay as usual. By looking at the radioactivity in his
body, it was found that that the activity of 146C was
0.121 Bq / g of body tissue. This is less than the normal
activity 0.23 Bq / g . because 146C has been decaying
0.693 0.693
away. First find λ , λ = = = 1.21 × 10−4 y −1
T1 2 5730
−4
Then use, 0.121 = 0.23 e −1.21×10 ×t
which gives,
0.121
ln = −1.21 × 10−4 × t and so t = 5300 years is
0.23
when this poor man was killed (or died somehow)!
18. The most famous formula of physics, E = mc 2 , is the basis for nuclear energy. In 1935,
it was discovered by two German physicists, Otto Robert Frisch and Lise Meitner, that
a heavy nucleus can fission (or break up) into two or more smaller nuclei. The total
energy is, of course, conserved but the mass is not. This is completely different from the
MA Mb + Mc
The masses of the two nuclei add up to less than the mass of the parent nucleus, and the
energy released is Q = ( M A − M b − M c )c 2 . This goes into kinetic energy and sends the
two daughter nuclei flying apart at a large velocity. There happen to be NO completely
stable nuclei above Z = 82, and no naturally occurring nuclei above Z = 92. Above these
limits the nuclei decay or fall apart in some fashion to get below these limits.
19. A very useful concept is "binding energy". Suppose you want to take a nucleon out of a
nucleus. The binding energy is the amount of energy that you would have to provide to
pull it on the average. Nuclei with the largest BE per nucleon are the most stable. As you
can see from the graph below, the most stable element is iron, 56 Fe with a BE per nucleon
of about 8.6 MeV. This is why iron is the heavy element found in the largest quantity on
earth and inside stars. The curve is not smmoth and you see that a 4 He nucleus (i.e. an α
parrticle, has a relatively high binding energy and so is relatively stable. In contrast,
235
Uranium U or deuterium 2 H are much less bound and they decay.
20. Nuclei can be unstable in different ways. A nucleus can emit α , β , and γ radiations.
Usually a nucleus will emit one of these three, but it is possible to emit two, or even all
three of these. (In addition, as we have discussed above, a nucleus can break up into
22. The simplest beta decay reaction is when a neutron decays, n → p + e − + ν . Other than
a proton and electron, an anti-neutrino is also emitted. As discussed in the lecture, the
(anti) neutrino is a neutral particle with a very tiny mass that interacts very weakly with
matter. A nucleus can undergo beta decay with either an electron being produced, i.e
A
Z X→ A
Z +1 X ′ + e− + ν (called β - decay) or with an anti-electron (positron, or positive
electron), ZA X → A
Z −1 X ′ + e + + ν , (called β + decay). Beta decay involves the weak
nuclear force. This is one of the four fundamental forces in the world, and its small
strength means that the decay happens much more slowly than most other reactions.
Q.2 The method of carbon dating discussed in the text can only be used for time scales of
a few thousand years. For longer time scales we must use the decay of 238
92U , which has
a half-life of 4.5 × 10 years. Discuss how this could be possible, and how we could
9
use this to find the age of rocks that formed when the earth was formed. You should
consult a book or use Google.
Q.4 In the early days of nuclear physics it was discovered that slow neutrons can be very
effective in producing the fission of uranium. Typically a reaction could be like this one:
1
0 n + 235
92U → ( U →
236
92 ) 141
56 Ba + 3692 Kr + 3 01n. The 3 neutrons produced can then produced
more fissions, and this leads to a chain reaction of the kind shown in second diagram:
a)Suppose that a slow neutron enters a block of U . Assuming that all 3 neutrons go on
235
92
to fission another nucleus. This is called a chain reaction. After 2,3,4,5..fissions, what
will be the number of neutrons? Write down a general formula for N fissions.
b)It is possible that some neutrons are too fast and can escape the uranium block without
causing fission. Suppose the probability that a neutron does cause fission is p. How is
your answer above modified. Evaluate for p = 0.7
5. In a collapsed star called a white dwarf, nuclei are almost touching each other. Calculate
the density of this matter in kg/m3 and compare with that of water.
1. In this lecture I shall pull together the different parts of physics that you have learned in
this course and apply them to understanding the source of all life on earth - our sun. We
will learn that the sun operates according to principles that we can understand, and on
the basis of this we can even predict the manner in which it will eventually die.
3. The sun puts out a huge amount of energy. In quantitative terms we measure this in by
its luminosity, 3.83 × 10 27 joules per second. The power output is 3.83 × 1024 kilowatts.
This is equal to 8 × 1016 of the largest power plants on Earth, meaning those which produce
~5000 MW of power. Another way of expressing this: every second the sun puts out as
much energy as 2.5 × 109 (2 billion) such power plants would put out every year.
4. What powers the sun? The earth is very old (billions of years). If there was a chemical
fuel (say, coal or oil) at most that would last a few million years. But the sun is many
thousands of tons older than that. Only after the discovery of E = mc 2 did we know the
real secret. The sun gets its energy from the fusion (the coming together and combining
of atomic nuclei. For this extremely high temperatures, density, and pressure is needed.
6. Fusion takes place only in the core of the sun (see diagrams below) because it is only
there that the hydrogen gas is hot enough. From the core, the heat gets out by the
emission of photons (radiation zone). The hot gas then exchanges heat with the sun's
relatively cold exterior through convective currents. Huge columns of hot gas move
from the inside towards the surface. After giving up most of the heat, the "cold" gas
sinks towards the centre and the cycle goes on.
Look at the second diagram. Let M (r ) be the mass contained within radius r. We will not
assume that the density is constant in r. First find the inward directed gravitational force
GM (r ) × ρ (r )4π r 2 dr
on a shell of matter at radius r and thickness dr , dF = − . There is a
r2
net pressure as shown, and we will call dP the difference of pressures. Then obviously
GM (r ) ρ (r )dr dP GM (r ) ρ (r )
dF = dP × 4π r 2 . Hence, dP = − 2
or, =− . The total mass
r dr r2
r
upto radius r is, M ( r ) = ∫ ρ ( r ′)4π r ′2 dr ′. If we knew ρ (r ) then M (r ) would also be known.
0
By solving the differential equation, we would also then know P(r ). To give us a better
understanding, suppose for simplicity that the sun is approximately uniform. Then the
M
density is, ρ ≈ ≈ 1.4gm cm -3 . Hence the pressure at the centre of the sun can be
(4π / 3) R 3
GM ρ
computed, Pcentre ≈ ≈ 3 × 109 atmospheres.
R
9. Let us calculate the surface temperature of a planet circulating the sun. We shall use the
Stefan Boltzman law that you studied in the lecture of heat: the power radiated by a black
body per unit surface area at temperature T is σ T 4 .
For thermal equilibrium, we must have that all the R Rs
power absorbed from the sun is re-radiated by
the planet. Let PS be the sun's flux at its surface,
Then, PS = σ TS4 . Since radiation decreases by the TE Ts
( )
2
distance squared, PR = flux at earth = σ TS4 ×
Rs
R
( ) ( )
2 1/ 2
PR × π RE2 = PE × 4π RE2 . This gives σ TS4 × = 4σ TE4 , and hence TE =
Rs Rs
R 2R
TS . This is
true for any planet, so let's see what this gives for the earth using RS = 7 × 108 metres,
R = 1.5 × 1011 metres, and TS = 5800o K . This gives TE = 280 K , which is very sensible!
Of course, we have assumed that the earth absorbs and emits as a black body. True?
earth
In the above, you can see the narrow window in which the greenhouse gases absorb the
sun's radiation. But why are only some gases responsible, and not others?
11. The answer lies in quantum mechanics. In a previous lecture you learned that molecules
can exist only in certain states that have very specific values of energy. For example, a
molecule of CO 2 can oscillate and have equally spaced energy levels as shown below:
Core
He
H burning shell
hydrogen and producing helium. At the end of this phase, the core of the sun will have
mostly helium with the little bit of hydrogen left almost entirely outside in a thin layer.
The reaction rate will fall, and the core will no longer be able to balance the pull due to
gravity. This will cause a shrinkage of the core. As a consequence the temperature will
increase. A new phase of the sun is about to start.
The Sun as a Main
Sequence Star
13. At this point a fusion reaction in next core zone diameter = 1/100 AU
Q.1 Calculate the power radiated by a 10-cm-diameter sphere of metal at room temperature
(20 0C). Give your answer in watts.
Q.2 a) Why is the rotation period of the sun different at the equator as compared to that at
the poles?
b) How do we know for sure that the centre of the sun is much hotter than the surface?
c) Since it is impossible to go near the sun, how do we know the chemical composition
of the sun? Its surface temperature?
Q.3 In a previous lecture you learned how to calculate the average speed of atoms in a hot,
ideal gas.
a) Calculate the average speeds at T = 107 0 K and T = 2 × 107 0 K . Compare with the
average estimated from the graph below for hydrogen nuclei (protons) in a star.
b) Protons repel each other. Suppose two protons collide head-on with each other at some
speed v. What will be the closest they will come to each other.
c) For typical speeds in the graph below, estimate this distance. Will this be sufficient for
a nuclear reaction to take place?
dP GM (r ) ρ (r )
4. Refer to the derivation of the solar pressure-distance relation: =− . This
dr r2
was solved assuming a constant density. In this problem, suppose we assume a linearly
decreasing density, ρ (r ) = ρ0 (1 − r / R), where R is the solar radius.
a) Find M (r ) and make a plot.
dP GM (r ) ρ (r )
b) Solve =− . This is a first-order differential equation and so will have
dr r2
one arbitrary constant. How should it be fixed? Is it sensible to have P ( R) = 0?