The Modern
Revolution


j


n


Physics


Benjamin Crowell


Book 6 in the Light and Matter series of free introductory physics textbooks
www.Iightandmatter.com


﻿
﻿


The Modern
Revolution
in Physics


﻿


The Light and Matter series of
introductory physics textbooks:

1 Newtonian Physics
2 Conservation Laws
3 Vibrations and Waves
4 Electricity and Magnetism
5 Optics
6 The Modern Revolution in Physics


﻿

I


The Modern
Revolution
in Physics

Benjamin Crowell


www.lightandmatter.com


﻿


    Light and Matter
Fullerton, California
www. lightandmatter. com


copyright 1998-2003 Benjamin Crowell


edition 3.0
rev. May 9, 2008

B        This book    is licensed  under the   Creative  Com-
         mons   Attribution-ShareAlike  license,  version  1.0,
 @i      http://creativecommons.org/licenses/by-sa/1.0/, except
         for those photographs and drawings of which I am not
         the author, as listed in the photo credits. If you agree
         to the license, it grants you certain privileges that you
         would not otherwise have, such as the right to copy the
         book, or download the digital version free of charge from
         www.lightandmatter.com. At your option, you may also
         copy this book under the GNU Free Documentation
         License version 1.2, http://www.gnu.org/licenses/fdl.txt,
         with no invariant sections, no front-cover texts, and no
         back-cover texts.


ISBN 0-9704670-6-0


﻿


To Gretchen.


﻿
﻿


Brief Contents

1  Relativity 13
2 Rules of Randomness 43
3 Light as a Particle 67
4 Matter as a Wave 85
5 The Atom 111


﻿


                                  Contents


                                              2 Rules of Randomness
                                              2.1 Randomness Isn't Random . . . .    45
                                              2.2 Calculating Randomness . . . . .   46
                                                 Statistical independence, 46. Addition of
                                                 probabilities, 47. Normalization, 48.
                                                 Averages, 48.
                                             2.3 Probability Distributions . . . . . 50
                                                 Average and width of a probability
                                                 distribution, 51.
                                             2.4 Exponential Decay and Half-Life. . 52
                                                 Rate of decay, 55.
                                             2.5 f Applications of Calculus . . . . 57
                                             Summary    . . .  . . . . .  . . . . .  60
                                             Problems   . . .  . . . . .  . . . . .  62
1 Relativity
1.1 The Principle of Relativity. .... . 14
1.2 Distortion of Time and Space . . .  18
   Time, 18.-Space, 20. No simultaneity,
   20. Applications, 22.
1.3 Dynamics   . . . .  . . . . .  . .  27
   Combination   of   velocities,  27.-
   Momentum, 28. Equivalence of mass and
   energy, 31.
Summary . . . . . . . . . . . . . 37
Problems . . . . . . . . . . . . . 39

                                             3   Light as a Particle
                                             3.1 Evidence for Light as a Particle . . 68
                                  -          3.2 How Much Light Is One Photon?. .    70
                                                 The photoelectric effect, 70. An unex-
                                                 pected dependence on frequency, 71.
                                                 Numerical relationship between energy and
                                                 frequency, 72.
                                             3.3 Wave-Particle Duality . . . . . .   75
                                                 A wrong interpretation: photons inter-


10


﻿


   fering with each other, 76. The concept
   of a photon's path is undefined., 76.
   Another wrong interpretation: the pi-
   lot wave hypothesis, 77. The probability
   interpretation, 77.
3.4 Photons in Three Dimensions . . . 80
Summary . . . . . . . . . . . . .      81
Problems  . . . . . . . . . . . . .    82


Problems . . . . . . . . . . . . . 108

5 The Atom
5.1 Classifying States . . . . . . . . 112
5.2 Angular    Momentum     in   Three
Dimensions . . . . . . . . . . . . 113
   Three-dimensional angular momentum in
   classical physics, 113.-Three-dimensional
   angular momentum in quantum physics,
   114.
5.3 The Hydrogen Atom . . . . . . . 115
5.4 * Energies of States in Hydrogen . 117
   History, 117. Approximate treatment,
   118.
5.5 Electron Spin . . . . . . . . . . 120
5.6 Atoms With More Than One Electron 121
   Deriving the periodic table, 123.
Summary . . . . . . . . . . . . . 125
Problems . . . . . . . . . . . . . 127


4 Matter as a Wave
4.1 Electrons as Waves . . . . . . .
   What kind of wave is it?, 89.
4.2 f* Dispersive Waves . . . . . .
4.3 Bound States. . . . . . . . . .
4.4 The Uncertainty Principle. . . . .
   The    uncertainty principle,  96.
   Measurement and Schr6dinger's cat, 100.
4.5 Electrons in Electric Fields . . . .
   Tunneling, 102.
4.6 f* The Schrodinger Equation. . .
   Use of complex numbers, 105.
Summary . . . . . . . . . . . . .


86

91
93
96


101


102   Appendix 1: Exercises 130
      Appendix 2: Photo Credits 132
106 Appendix 3: Hints and Solutions 133


11


﻿
﻿


Chapter 1


Relativity


Complaining about the educational system is a national sport among
professors in the U.S., and I, like my colleagues, am often tempted
to imagine a golden age of education in our country's past, or to
compare our system unfavorably with foreign ones. Reality intrudes,
however, when my immigrant students recount the overemphasis on
rote memorization in their native countries, and the philosophy that
what the teacher says is always right, even when it's wrong.
    Albert Einstein's education in late-nineteenth-century Germany
was neither modern nor liberal. He did well in the early grades,1
but in high school and college he began to get in trouble for what
today's edspeak calls "critical thinking."
    Indeed, there was much that deserved criticism in the state of
physics at that time. There was a subtle contradiction between the
theory of light as a wave and Galileo's principle that all motion
is relative. As a teenager, Einstein began thinking about this on
an intuitive basis, trying to imagine what a light beam would look
like if you could ride along beside it on a motorcycle at the speed
of light. Today we remember him most of all for his radical and
far-reaching solution to this contradiction, his theory of relativity,
but in his student years his insights were greeted with derision from
his professors. One called him a "lazy dog." Einstein's distaste
for authority was typified by his decision as a teenager to renounce
his German citizenship and become a stateless person, based purely
on his opposition to the militarism and repressiveness of German
society. He spent his most productive scientific years in Switzerland
and Berlin, first as a patent clerk but later as a university professor.
He was an outspoken pacifist and a stubborn opponent of World
War I, shielded from retribution by his eventual acquisition of Swiss
citizenship.
    As the epochal nature of his work became evident, some liberal
Germans began to point to him as a model of the "new German,"
but after the Nazi coup d'etat, staged public meetings began, at
which Nazi scientists criticized the work of this ethnically Jewish
(but spiritually nonconformist) giant of science. When Hitler was
appointed chancellor, Einstein was on a stint as a visiting professor
at Caltech, and he never returned to the Nazi state. World War

   'The myth that he failed his elementary-school classes comes from a misun-
derstanding based on a reversal of the German numerical grading scale.


a / Albert Einstein.


13


﻿


b / The  first nuclear explo-
sion on our planet, Alamogordo,
New Mexico, July 16, 1945.


II convinced Einstein to soften his strict pacifist stance, and he
signed a secret letter to President Roosevelt urging research into
the building of a nuclear bomb, a device that could not have been
imagined without his theory of relativity. He later wrote, however,
that when Hiroshima and Nagasaki were bombed, it made him wish
he could burn off his own fingers for having signed the letter.
    Einstein has become a kind of scientific Santa Claus figure in
popular culture, which is presumably why the public is always so tit-
illated by his well-documented career as a skirt-chaser and unfaithful
husband. Many are also surprised by his lifelong commitment to so-
cialism. A favorite target of J. Edgar Hoover's paranoia, Einstein
had his phone tapped, his garbage searched, and his mail illegally
opened. A censored version of his 1800-page FBI file was obtained
in 1983 under the Freedom of Information Act, and a more com-
plete version was disclosed recently.2 It includes comments solicited
from anti-Semitic and pro-Nazi informants, as well as statements,
from sources who turned out to be mental patients, that Einstein
had invented a death ray and a robot that could control the human
mind. Even today, an FBI web page3 accuses him of working for
or belonging to 34 "communist-front" organizations, apparently in-
cluding the American Crusade Against Lynching. At the height of
the McCarthy witch hunt, Einstein bravely denounced McCarthy,
and publicly urged its targets to refuse to testify before the House
Unamerican Activities Committee. Belying his other-worldly and
absent-minded image, his political positions seem in retrospect not
to have been at all clouded by naivete or the more fuzzy-minded
variety of idealism. He worked against racism in the U.S. long be-
fore the civil rights movement got under way. In an era when many
leftists were only too eager to apologize for Stalinism, he opposed it
consistently.
    This chapter is specifically about Einstein's theory of relativ-
ity, but Einstein also began a second, parallel revolution in physics
known as the quantum theory, which stated, among other things,
that certain processes in nature are inescapably random. Ironically,
Einstein was an outspoken doubter of the new quantum ideas that
were built on his foundations, being convinced that "the Old One
[God] does not play dice with the universe," but quantum and rel-
ativistic concepts are now thoroughly intertwined in physics.


1.1 The Principle of Relativity
By the time Einstein was born, it had already been two centuries
since physicists had accepted Galileo's principle of inertia. One way
of stating this principle is that experiments with material objects
don't give different results due the straight-line, constant-speed mo-

   2Fred Jerome, The Einstein File, St. Martin's Press, 2002
   3http://foia.fbi.gov/foiaindex/einstein.htm


14             Chapter 1   Relativity


﻿


tion of the apparatus. For instance, if you toss a ball up in the
air while riding in a jet plane, nothing unusual happens; the ball
just falls back into your hand. Motion is relative. From your point
of view, the jet is standing still while the farms and cities pass by
underneath.
    The teenage Einstein was suspicious because his professors said
light waves obeyed an entirely different set of rules than material
objects, and in particular that light did not obey the principle of
inertia. They believed that light waves were a vibration of a myste-
rious substance called the ether, and that the speed of light should
be interpreted as a speed relative to this ether. Thus although the
cornerstone of the study of matter had for two centuries been the
idea that motion is relative, the science of light seemed to contain
a concept that a certain frame of reference was in an absolute state
of rest with respect to the ether, and was therefore to be preferred
over moving frames.
    Experiments, however, failed to detect this mysterious ether.
Apparently it surrounded everything, and even penetrated inside
physical objects; if light was a wave vibrating through the ether,
then apparently there was ether inside window glass or the human
eye. It was also surprisingly difficult to get a grip on this ether.
Light can also travel through a vacuum (as when sunlight comes to
the earth through outer space), so ether, it seemed, was immune to
vacuum pumps.
    Einstein decided that none of this made sense. If the ether was
impossible to detect or manipulate, one might as well say it didn't
exist at all. If the ether doesn't exist, then what does it mean when
our experiments show that light has a certain speed, 3 x 108 meters
per second? What is this speed relative to? Could we, at least in
theory, get on the motorcycle of Einstein's teenage daydreams, and
travel alongside a beam of light? In this frame of reference, the
beam's speed would be zero, but all experiments seemed to show
that the speed of light always came out the same, 3 x 108 m/s.
Einstein decided that the speed of light was dictated by the laws of
physics (the ones concerning electromagnetic induction), so it must
be the same in all frames of reference. This put both light and
matter on the same footing: both obeyed laws of physics that were
the same in all frames of reference.


    the principle of relativity
Experiments don't come out different due to the straight-line,
constant-speed motion of the apparatus. This includes both light
and matter.


This is almost the same as Galileo's principle of inertia, except that
we explicitly state that it applies to light as well.


Section 1.1   The Principle of Relativity


15


﻿


c / The Michelson-Morley experi-
ment, shown in photographs, and
drawings from the original 1887
paper. 1. A simplified draw-
ing of the apparatus. A beam of
light from the source, s, is par-
tially reflected and partially trans-
mitted by the half-silvered mirror
h1. The two half-intensity parts of
the beam are reflected by the mir-
rors at a and b, reunited, and ob-
served in the telescope, t. If the
earth's surface was supposed to
be moving through the ether, then
the times taken by the two light
waves to pass through the mov-
ing ether would be unequal, and
the resulting time lag would be
detectable by observing the inter-
ference between the waves when
they were reunited. 2. In the real
apparatus, the light beams were
reflected multiple times. The ef-
fective length of each arm was
increased to 11 meters, which
greatly improved its sensitivity to
the small expected difference in
the speed  of light. 3.   In an
earlier version of the experiment,
they had run into problems with
its "extreme sensitiveness to vi-
bration," which was "so great that
it was impossible to see the in-
terference fringes except at brief
intervals ... even  at two  o'clock
in the morning." They therefore
mounted the whole thing on a
massive stone floating in a pool of
mercury, which also made it pos-
sible to rotate it easily. 4. A photo
of the apparatus. Note that it is
underground, in a room with solid
brick walls.


    This is hard to swallow. If a dog is running away from me at 5
m/s relative to the sidewalk, and I run after it at 3 m/s, the dog's
velocity in my frame of reference is 2 m/s. According to everything
we have learned about motion, the dog must have different speeds
in the two frames: 5 m/s in the sidewalk's frame and 2 m/s in mine.
How, then, can a beam of light have the same speed as seen by
someone who is chasing the beam?
    In fact the strange constancy of the speed of light had shown up
in the now-famous Michelson-Morley experiment of 1887. Michel-
son and Morley set up a clever apparatus to measure any difference
in the speed of light beams traveling east-west and north-south.
The motion of the earth around the sun at 110,000 km/hour (about
0.01% of the speed of light) is to our west during the day. Michel-
son and Morley believed in the ether hypothesis, so they expected
that the speed of light would be a fixed value relative to the ether.
As the earth moved through the ether, they thought they would
observe an effect on the velocity of light along an east-west line.
For instance, if they released a beam of light in a westward direc-
tion during the day, they expected that it would move away from
them at less than the normal speed because the earth was chasing
it through the ether. They were surprised when they found that the
expected 0.01% change in the speed of light did not occur.


b


h1i


2


16             Chapter 1    Relativity


﻿


    Although the Michelson-Morley experiment was nearly two dec-
ades in the past by the time Einstein published his first paper on rel-
ativity in 1905, it's unclear how much it influenced Einstein. Michel-
son and Morley themselves were uncertain about whether the result
was to be trusted, or whether systematic and random errors were
masking a real effect from the ether. There were a variety of compet-
ing theories, each of which could claim some support from the shaky
data. Some physicists believed that the ether could be dragged along
by matter moving through it, which inspired variations on the ex-
periment that were conducted on mountaintops in thin-walled build-
ings, d, or with one arm of the appartus out in the open, and the
other surrounded by massive lead walls. In the standard sanitized
textbook version of the history of science, every scientist does his
experiments without any preconceived notions about the truth, and
any disagreement is quickly settled by a definitive experiment. In
reality, this period of confusion about the Michelson-Morley experi-
ment lasted for four decades, and a few reputable skeptics, including
Miller, continued to believe that Einstein was wrong, and kept try-
ing different variations of the experiment as late as the 1920's. Most
of the remaining doubters were convinced by an extremely precise
version of the experiment performed by Joos in 1930, although you
can still find kooks on the internet who insist that Miller was right,
and that there was a vast conspiracy to cover up his results.


d / Dayton Miller thought that the result of the Michelson-Morley ex-
periment could be explained because the ether had been pulled along by
the dirt, and the walls of the laboratory. This motivated him to carry out a
series of experiments at the top of Mount Wilson, in a building with thin
walls.

    Before Einstein, some physicists who did believe the negative re-
sult of the Michelson-Morley experiment came up with explanations
that preserved the ether. In the period from 1889 to 1895, Hendrik
Lorentz and George FitzGerald suggested that the negative result
of the Michelson-Morley experiment could be explained if the earth,
and every physical object on its surface, was contracted slightly by
the strain of the earth's motion through the ether.4

   4See discussion question F on page 26, and homework problem 12


e / Albert Michelson, in 1887,
the year of the Michelson-Morley
experiment.


f / George  FitzGerald,  1851-
1901.


g / Hendrik Lorentz, 1853-1928.


Section 1.1   The Principle of Relativity


17


﻿


1.2 Distortion of Time and Space


                                  Time
                                      Consider the situation shown in figure h. Aboard a rocket ship
                                  we have a tube with mirrors at the ends. If we let off a flash of light at
                                  the bottom of the tube, it will be reflected back and forth between
                                  the top and bottom. It can be used as a clock; by counting the
                                  number of times the light goes back and forth we get an indication
                                  of how much time has passed: up-down up-down, tick-tock tick-
                                  tock. (This may not seem very practical, but a real atomic clock
                                  does work on essentially the same principle.) Now imagine that the
                                  rocket is cruising at a significant fraction of the speed of light relative
                                  to the earth. Motion is relative, so for a person inside the rocket,
                                  h/i, there is no detectable change in the behavior of the clock, just
                                  as a person on a jet plane can toss a ball up and down without
                                  noticing anything unusual. But to an observer in the earth's frame
                                  of reference, the light appears to take a zigzag path through space,
                                  h/2, increasing the distance the light has to travel.


h / A light beam bounces between                                       -
two mirrors in a spaceship.                  1


                                             2


                                      If we didn't believe in the principle of relativity, we could say
                                  that the light just goes faster according to the earthbound observer.
                                  Indeed, this would be correct if the speeds were much less than the
                                  speed of light, and if the thing traveling back and forth was, say,
                                  a ping-pong ball. But according to the principle of relativity, the
                                  speed of light must be the same in both frames of reference. We are
                                  forced to conclude that time is distorted, and the light-clock appears
                                  to run more slowly than normal as seen by the earthbound observer.
                                  In general, a clock appears to run most quickly for observers who
                                  are in the same state of motion as the clock, and runs more slowly
                                  as perceived by observers who are moving relative to the clock.
                                      We can easily calculate the size of this time-distortion effect. In
                                  the frame of reference shown in figure h/i, moving with the space-


18             Chapter 1   Relativity


﻿


ship, let t be the time required for the beam of light to move from
the bottom to the top. An observer on the earth, who sees the sit-
uation shown in figure h/2, disagrees, and says this motion took a
longer time T (a bigger letter for the bigger time). Let v be the
velocity of the spaceship relative to the earth. In frame 2, the light
beam travels along the hypotenuse of a right triangle, figure i, whose
base has length
                          base = vT
Observers in the two frames of reference agree on the vertical dis-
tance traveled by the beam, i.e., the height of the triangle perceived
in frame 2, and an observer in frame 1 says that this height is the
distance covered by a light beam in time t, so the height is
                         height = ct
where c is the speed of light. The hypotenuse of this triangle is the
distance the light travels in frame 2,
                      hypotenuse = cT       .                                      cT       Ct
Using the Pythagorean theorem, we can relate these three quanti-
ties,
                    (cT)2 = (vT)2 + (ct)2
and solving for T, we find

                      T =        t                                                  vT
                             1- (v/c)2                                  i / One observer says the light
                                                                        went a distance cT, while the
    The amount of distortion is given by the factor 1/ 1 - (v/c)2 other says it only had to travel ct.
and this quantity appears so often that we give it a special name, 7
(Greek letter gamma),
                                 1

                             1- (v/c)2

  self-check A
  What is 'y when v = 0? What does this mean?   > Answer, p. 133
    We are used to thinking of time as absolute and universal, so it
is disturbing to find that it can flow at a different rate for observers
in different frames of reference. But consider the behavior of the Y
factor shown in figure j. The graph is extremely flat at low speeds,
and even at 20% of the speed of light, it is difficult to see anything
happening to 'y. In everyday life, we never experience speeds that
are more than a tiny fraction of the speed of light, so this strange
strange relativistic effect involving time is extremely small. This
makes sense: Newton's laws have already been thoroughly tested
by experiments at such speeds, so a new theory like relativity must
agree with the old one in their realm of common applicability. This
requirement of backwards-compatibility is known as the correspon-
dence principle.


Section 1.2   Distortion of Time and Space


19


﻿


7


       6-


       5-

  Y
       4


       3


       2


          0        0.2       0.4       0.6       0.8       1.0

                                   v/c

j / The behavior of the -y factor.


Space
    The speed of light is supposed to be the same in all frames of ref-
erence, and a speed is a distance divided by a time. We can't change
time without changing distance, since then the speed couldn't come
out the same. If time is distorted by a factor of -y, then lengths must
also be distorted according to the same ratio. An object in motion
appears longest to someone who is at rest with respect to it, and is
shortened along the direction of motion as seen by other observers.

No simultaneity
    Part of the concept of absolute time was the assumption that it
was valid to say things like, "I wonder what my uncle in Beijing is
doing right now." In the nonrelativistic world-view, clocks in Los
Angeles and Beijing could be synchronized and stay synchronized,
so we could unambiguously define the concept of things happening
simultaneously in different places. It is easy to find examples, how-
ever, where events that seem to be simultaneous in one frame of
reference are not simultaneous in another frame. In figure k, a flash
of light is set off in the center of the rocket's cargo hold. According
to a passenger on the rocket, the parts of the light traveling for-
ward and backward have equal distances to travel to reach the front
and back walls, so they get there simultaneously. But an outside
observer who sees the rocket cruising by at high speed will see the


20             Chapter 1   Relativity


﻿


flash hit the back wall first, because the wall is rushing up to meet
it, and the forward-going part of the flash hit the front wall later,
because the wall was running away from it.


                                                                      k / Different observers don't agree
                                                                      that the flashes of light hit the
                                                                      front and back of the ship simul-
                                                                      taneously.


    We conclude that simultaneity is not a well-defined concept.
This idea may be easier to accept if we compare time with space.
Even in plain old Galilean relativity, points in space have no iden-
tity of their own: you may think that two events happened at the
same point in space, but anyone else in a differently moving frame
of reference says they happened at different points in space. For
instance, suppose you tap your knuckles on your desk right now,
count to five, and then do it again. In your frame of reference, the
taps happened at the same location in space, but according to an
observer on Mars, your desk was on the surface of a planet hurtling
through space at high speed, and the second tap was hundreds of
kilometers away from the first.
    Relativity says that time is the same way  both simultaneity
and "simulplaceity" are meaningless concepts. Only when the rela-
tive velocity of two frames is small compared to the speed of light
will observers in those frames agree on the simultaneity of events.


                                                                      I / In the garage's frame of refer-
  1   -ence, 1, the bus is moving, and
                                                                      can fit in the garage. In the bus's
                                                                      frame of reference, the garage is
                                                                      moving, and can't hold the bus.


 2

                   ||STRC |I


          "


The garage paradox
    One of the most famous of all the so-called relativity paradoxes
has to do with our incorrect feeling that simultaneity is well defined.
The idea is that one could take a schoolbus and drive it at relativistic
speeds into a garage of ordinary size, in which it normally would not
fit. Because of the length contraction, the bus would supposedly fit


Section 1.2   Distortion of Time and Space


21


﻿


     84
  6 1
  4 E
)
  0
  E 24


in the garage. The paradox arises when we shut the door and then
quickly slam on the brakes of the bus. An observer in the garage's
frame of reference will claim that the bus fit in the garage because of
its contracted length. The driver, however, will perceive the garage
as being contracted and thus even less able to contain the bus. The
paradox is resolved when we recognize that the concept of fitting the
bus in the garage "all at once" contains a hidden assumption, the
assumption that it makes sense to ask whether the front and back of
the bus can simultaneously be in the garage. Observers in different
frames of reference moving at high relative speeds do not necessarily
agree on whether things happen simultaneously. The person in the
garage's frame can shut the door at an instant he perceives to be
simultaneous with the front bumper's arrival at the back wall of the
garage, but the driver would not agree about the simultaneity of
these two events, and would perceive the door as having shut long
after she plowed through the back wall.

Applications
Nothing can go faster than the speed of light.
    What happens if we want to send a rocket ship off at, say, twice
the speed of light, v= 2c? Then - will be 1/ 5-3. But your
math teacher has always cautioned you about the severe penalties
for taking the square root of a negative number. The result would
be physically meaningless, so we conclude that no object can travel
faster than the speed of light. Even travel exactly at the speed of
light appears to be ruled out for material objects, since - would
then be infinite.
    Einstein had therefore found a solution to his original paradox
about riding on a motorcycle alongside a beam of light. The paradox
is resolved because it is impossible for the motorcycle to travel at
the speed of light.
    Most people, when told that nothing can go faster than the speed
of light, immediately begin to imagine methods of violating the rule.
For instance, it would seem that by applying a constant force to an
object for a long time, we could give it a constant acceleration,
which would eventually make it go faster than the speed of light.
We'll take up these issues in section 1.3.

Cosmic-ray muons
    A classic experiment to demonstrate time distortion uses obser-
vations of cosmic rays. Cosmic rays are protons and other atomic nu-
clei from outer space. When a cosmic ray happens to come the way
of our planet, the first earth-matter it encounters is an air molecule
in the upper atmosphere. This collision then creates a shower of
particles that cascade downward and can often be detected at the
earth's surface. One of the more exotic particles created in these cos-
mic ray showers is the muon (named after the Greek letter mu, p).


0    2     4    6
    time since creation
    (microseconds)


8


m / Decay  of muons created
at rest with respect to the
observer.


100
0c80
S 60
S40-
0
E 20-


      0-1|    1    i    i     |
        0    2     4    6    8
            time since creation
            (microseconds)

n / Decay  of  muons    moving
at a speed of 0.995c with respect
to the observer.


22             Chapter 1    Relativity


﻿


The reason muons are not a normal part of our environment is that
a muon is radioactive, lasting only 2.2 microseconds on the average
before changing itself into an electron and two neutrinos. A muon
can therefore be used as a sort of clock, albeit a self-destructing and
somewhat random one! Figures m and n show the average rate at
which a sample of muons decays, first for muons created at rest and
then for high-velocity muons created in cosmic-ray showers. The
second graph is found experimentally to be stretched out by a fac-
tor of about ten, which matches well with the prediction of relativity
theory:

                          7 = 1/1 -(v/c)2
                          = 1//1 - (0.995)2
                          ~J10

Since a muon takes many microseconds to pass through the atmo-
sphere, the result is a marked increase in the number of muons that
reach the surface.

Time dilation for objects larger than the atomic scale
    Our world is (fortunately) not full of human-scale objects mov-
ing at significant speeds compared to the speed of light. For this
reason, it took over 80 years after Einstein's theory was published
before anyone could come up with a conclusive example of drastic
time dilation that wasn't confined to cosmic rays or particle accel-
erators. Recently, however, astronomers have found definitive proof
that entire stars undergo time dilation. The universe is expanding
in the aftermath of the Big Bang, so in general everything in the
universe is getting farther away from everything else. One need only
find an astronomical process that takes a standard amount of time,
and then observe how long it appears to take when it occurs in a
part of the universe that is receding from us rapidly. A type of ex-
ploding star called a type Ia supernova fills the bill, and technology
is now sufficiently advanced to allow them to be detected across vast
distances. Figure o shows convincing evidence for time dilation in
the brightening and dimming of two distant supernovae.


The twin paradox
    A natural source of confusion in understanding the time-dilation
effect is summed up in the so-called twin paradox, which is not really
a paradox. Suppose there are two teenaged twins, and one stays at
home on earth while the other goes on a round trip in a spaceship at


Section 1.2   Distortion of Time and Space


23


﻿


o / Light curves of supernovae,                    .                supernova 1994H, receding from us at
showing a time-dilation effect for      no time dilation:           69% of the speed of light (Goldhaber et al.)
supernovae that are in motion rel-      nearby supernovae  
          atv  ou.not moving rapidly                              o supernova 1997ap, receding from us at
ative to us.                            relative to us              84% of the speed of light (Perlmutter et al.)
                                     a

                                     IA}
                                     V)
                                     -c
                                     .1


                                                0         20         40        60         80        100
                                                                      time (days)


                                     relativistic speeds (i.e., speeds comparable to the speed of light, for
                                     which the effects predicted by the theory of relativity are important).
                                     When the traveling twin gets home, she has aged only a few years,
                                     while her sister is now old and gray. (Robert Heinlein even wrote
                                     a science fiction novel on this topic, although it is not one of his
                                     better stories.)
                                         The "paradox" arises from an incorrect application of the prin-
                                     ciple of relativity to a description of the story from the traveling
                                     twin's point of view. From her point of view, the argument goes,
                                     her homebody sister is the one who travels backward on the receding
                                     earth, and then returns as the earth approaches the spaceship again,
                                     while in the frame of reference fixed to the spaceship, the astronaut
                                     twin is not moving at all. It would then seem that the twin on earth
                                     is the one whose biological clock should tick more slowly, not the
                                     one on the spaceship. The flaw in the reasoning is that the principle
                                     of relativity only applies to frames that are in motion at constant
                                     velocity relative to one another, i.e., inertial frames of reference.
                                     The astronaut twin's frame of reference, however, is noninertial, be-
                                     cause her spaceship must accelerate when it leaves, decelerate when
                                     it reaches its destination, and then repeat the whole process again
                                     on the way home. Their experiences are not equivalent, because
                                     the astronaut twin feels accelerations and decelerations. A correct
                                     treatment requires some mathematical complication to deal with the
                                     changing velocity of the astronaut twin, but the result is indeed that
                                     it's the traveling twin who is younger when they are reunited.5

                                        5Readers frequently wonder why the effects of the decelerations don't cancel
                                     out the effects of the accelerations. There are a couple of subtle issues here. First,
                                     there's no clearcut way to decide whether the time distortion happens during
                                     the accelerations and decelerations, or during the long periods of constant-speed
                                     cruising in between. This is because simultaneity isn't well defined, so there's no
                                     well-defined answer if Earth-bound Emma asks, "Is my sister's time distorted
                                     right now?" During the long period when spacefaring Sarah is cruising away
                                     from Earth at constant speed, Emma may observe that her sister's voice on


24              Chapter 1    Relativity


﻿


    The twin "paradox" really isn't a paradox at all. It may even be
a part of your ordinary life. The effect was first verified experimen-
tally by synchronizing two atomic clocks in the same room, and then
sending one for a round trip on a passenger jet. (They bought the
clock its own ticket and put it in its own seat.) The clocks disagreed
when the traveling one got back, and the discrepancy was exactly
the amount predicted by relativity. The effects are strong enough
to be important for making the global positioning system (GPS)
work correctly. If you've ever taken a GPS receiver with you on a
hiking trip, then you've used a device that has the twin "paradox"
programmed into its calculations. Your handheld GPS box gets sig-
nals from a satellite, and the satellite is moving fast enough that its
time dilation is an important effect. So far no astronauts have gone
fast enough to make time dilation a dramatic effect in terms of the
human lifetime. The effect on the Apollo astronauts, for instance,
was only a fraction of a second, since their speeds were still fairly
small compared to the speed of light. (As far as I know, none of the
astronauts had twin siblings back on earth!)

An example of length contraction
    Figure p shows an artist's rendering of the length contraction for
the collision of two gold nuclei at relativistic speeds in the RHIC ac-
celerator in Long Island, New York, which went on line in 2000. The
gold nuclei would appear nearly spherical (or just slightly lengthened
like an American football) in frames moving along with them, but in
the laboratory's frame, they both appear drastically foreshortened
as they approach the point of collision. The later pictures show the
nuclei merging to form a hot soup, in which experimenters hope to
observe a new form of matter.
the radio sounds abnormally slow, and conclude that the time distortion is in
progress. Sarah, however, says that she herself is normal, and that Emma is
the one who sounds slow. Each twin explains the other's perceptions as being
due to the increasing separation between them, which causes the radio signals
to be delayed more and more. The other thing to understand is that, even if
we do decide to attribute the time distortion to the periods of acceleration and
deceleration, we should expect the time-distorting effects of accelerations and
decelerations to reinforce, not cancel. This is because there is no clear distinction
between acceleration and deceleration that can be agreed upon by observers in
different inertial frames. This is a fact about plain old Galilean relativity, not
Einstein's relativity. Suppose a car is initially driving westward at 100 km/hr
relative to the asphalt, then slams on the brakes and stops completely. In the
asphalt's frame of reference, this is a deceleration. But from the point of view
of an observer who is watching the earth rotate to the east, the asphalt may be
moving eastward at a speed of 1000 km/hr. This observer sees the brakes cause
an acceleration, from 900 km/hr to 1000 km/hr: the asphalt has pulled the car
forward, forcing car to match its velocity.


Section 1.2  Distortion of Time and Space


25


﻿


                                     p / Colliding nuclei show relativistic length contraction.


                                     Discussion Questions
                                     A      A person in a spaceship moving at 99.99999999% of the speed
                                     of light relative to Earth shines a flashlight forward through dusty air, so
                                     the beam is visible. What does she see? What would it look like to an
                                     observer on Earth?
                                     B     A question that students often struggle with is whether time and
                                     space can really be distorted, or whether it just seems that way. Compare
                                     with optical illusions or magic tricks. How could you verify, for instance,
                                     that the lines in the figure are actually parallel? Are relativistic effects the
                                     same or not?
                                     C     On a spaceship moving at relativistic speeds, would a lecture seem
Discussion question B         even longer and more boring than normal?

                                     D     Mechanical clocks can be affected by motion. For example, it was
                                     a significant technological achievement to build a clock that could sail
                                     aboard a ship and still keep accurate time, allowing longitude to be deter-
                                     mined. How is this similar to or different from relativistic time dilation?
                                     E     What would the shapes of the two nuclei in the RHIC experiment
                                     look like to a microscopic observer riding on the left-hand nucleus? To
                                     an observer riding on the right-hand one? Can they agree on what is
                                     happening? If not, why not - after all, shouldn't they see the same thing
                                     if they both compare the two nuclei side-by-side at the same instant in
                                     time?
                                     F If you stick a piece of foam rubber out the window of your car while
                                     driving down the freeway, the wind may compress it a little. Does it make
                                     sense to interpret the relativistic length contraction as a type of strain
                                     that pushes an object's atoms together like this? How does this relate to
                                     discussion question E?


26              Chapter 1    Relativity


﻿


   1  Dynamics
So far we have said nothing about how to predict motion in relativ-
ity. Do Newton's laws still work? Do conservation laws still apply?
The answer is yes, but many of the definitions need to be modified,
and certain entirely new phenomena occur, such as the conversion
of mass to energy and energy to mass, as described by the famous
equation E = mc2.

Combination of velocities
    The impossibility of motion faster than light is a radical differ-
ence between relativistic and nonrelativistic physics, and we can get
at most of the issues in this section by considering the flaws in vari-
ous plans for going faster than light. The simplest argument of this
kind is as follows. Suppose Janet takes a trip in a spaceship, and
accelerates until she is moving at 0.8c (80% of the speed of light)
relative to the earth. She then launches a space probe in the forward
direction at a speed relative to her ship of 0.4c. Isn't the probe then
moving at a velocity of 1.2 times the speed of light relative to the
earth?
    The problem with this line of reasoning is that although Janet
says the probe is moving at 0.4c relative to her, earthbound observers
disagree with her perception of time and space. Velocities therefore
don't add the same way they do in Galilean relativity. Suppose we
express all velocities as fractions of the speed of light. The Galilean
addition of velocities can be summarized in this addition table:


      -1.00 -0.80 -0.60 -0.40 -0.20 -0.00 0.20 0.40 0.60 0.80 1.00
 -1.00 -2.00 -1.80 -1.60 -1.40 -1.20 -1.00 -0.80 -0.60 -0.40 -0.20 0.00
 -0.80 -1.80 -1.60 -1.40 -1.20 -1.00 -0.80 -0.60 -0.40 -0.20 0.00 0.20
 -0.60 -1.60 -1.40 -1.20 -1.00 -0.80 -0.60 -0.40 -0.20 -0.00 0.20 0.40
 -0.40 -1.40 -1.20 -1.00 -0.80 -0.60 -0.40 -0.20 -0.00 0.20 0.40 0.60
 -0.20 -1.20 -1.00 -0.80 -0.60 -0.40 -0.20 -0.00 0.20 0.40 0.60 0.80
 -0.00 -1.00 -0.80 -0.60 -0.40 -0.20 -0.00 0.20 0.40 0.60 0.80 1.00
 0.20 -0.80 -0.60 -0.40 -0.20 -0.00 0.20 0.40 0.60 0.80 1.00 1.20
 0.40 -0.60 -0.40 -0.20 -0.00 0.20 0.40 0.60 0.80 1.00 1.20 1.40
 0.60 -0.40 -0.20 -0.00 0.20 0.40 0.60 0.80 1.00 1.20 1.40 1.60
 0.80 -0.20 0.00 0.20 0.40 0.60 0.80 1.00 1.20 1.40 1.60 1.80
 1.00 0.00 0.20 0.40 0.60 0.80 1.00 1.20 1.40 1.60 1.80 2.00


 The derivation of the correct relativistic result requires some tedious
 algebra, which you can find in my book Simple Nature if you're
 curious. I'll just state the numerical results here:


    Janet's probe, for example, is moving not at 1.2c but at 0.91c,
which is a drastically different result. The difference between the
two tables is most evident around the edges, where all the results
are equal to the speed of light. This is required by the principle of
relativity. For example, if Janet sends out a beam of light instead


q / Galilean addition of velocities.


Section 1.3    Dynamics               27


﻿


r / Relativistic addition of veloci-
ties. The green oval near the cen-
ter of the table describes veloci-
ties that are relatively small com-
pared to the speed of light, and
the results are approximately the
same as the Galilean ones. The
edges of the table, highlighted in
blue, show that everyone agrees
on the speed of light.


     -1.00 -0.80 -0.60 -0.40 -0.20 -0.00 0.20 0.40 0.60 0.80 1.00
-1.00 -1.00 -1.00 -1.00 -1.00 -1.00 -1.00 -1.00 -1.00 -1.00 -1.00
-0.80 -1.00 -0.98 -0.95 -0.91 -0.86 -0.80 -0.71 -0.59 -0.38 0.00 1.00
-0.60 -1.00 -0.95 -0.88 -0.81 -0.71 -0.60 -0.45 -0.26 -0.00  0.38 1.00
-0.40 -1.00 -0.91 -0.81 -0.69 -0.56 -0.40 -0.22 -0.00  0.26 0.59 1.00
-0.20 -1.00 -0.86 -0.71 -0.56 -0.38 -0.20 -0.00  0.22 0.45 0.71 1.00
-0.00 -1.00 -0.80 -0.60 -0.40 -0.20 -0.00 0.20 0.40  0.60  0.80  1.00
0.20 -1.00 -0.71 -0.45 -0.22 -0.00  0.20 0.38 0.56 0.71 0.86 1.00
0.40 -1.00 -0.59 -0.26 -0.00 0.22 0.40 0.56 0.69 0.81 0.91 1.00
0.60 -1.00 -0.38 -0.00 0.26 0.45 0.60 0.71 0.81 0.88 0.95 1.00
0.80 -1.00 0.00  0.38 0.59 0.71 0.80 0.86 0.91 0.95 0.98 1.00
1.00        1.00 1.00  1.00 1.00 1.00  1.00 1.00  1.00 1.00  1.00


of a probe, both she and the earthbound observers must agree that
it moves at 1.00 times the speed of light, not 0.8 + 1 = 1.8. On
the other hand, the correspondence principle requires that the rela-
tivistic result should correspond to ordinary addition for low enough
velocities, and you can see that the tables are nearly identical in the
center.

Momentum
    Here's another flawed scheme for traveling faster than the speed
of light. The basic idea can be demonstrated by dropping a ping-
pong ball and a baseball stacked on top of each other like a snowman.
They separate slightly in mid-air, and the baseball therefore has time
to hit the floor and rebound before it collides with the ping-pong
ball, which is still on the way down. The result is a surprise if you
haven't seen it before: the ping-pong ball flies off at high speed and
hits the ceiling! A similar fact is known to people who investigate
the scenes of accidents involving pedestrians. If a car moving at
90 kilometers per hour hits a pedestrian, the pedestrian flies off at
nearly double that speed, 180 kilometers per hour. Now suppose
the car was moving at 90 percent of the speed of light. Would the
pedestrian fly off at 180% of c?
    To see why not, we have to back up a little and think about
where this speed-doubling result comes from. For any collision, there
is a special frame of reference, the center-of-mass frame, in which
the two colliding objects approach each other, collide, and rebound
with their velocities reversed. In the center-of-mass frame, the total
momentum of the objects is zero both before and after the collision.


    Figure s/1 shows such a frame of reference for objects of very
unequal mass. Before the collision, the large ball is moving relatively
slowly toward the top of the page, but because of its greater mass,
its momentum cancels the momentum of the smaller ball, which is
moving rapidly in the opposite direction. The total momentum is
zero. After the collision, the two balls just reverse their directions of
motion. We know that this is the right result for the outcome of the
collision because it conserves both momentum and kinetic energy,


28             Chapter 1    Relativity


﻿


  (1)


     O            0            OOO


  (2)


                                              O         O


s / An unequal collision, viewed in the center-of-mass frame, 1, and
in the frame where the small ball is initially at rest, 2. The motion is
shown as it would appear on the film of an old-fashioned movie camera,
with an equal amount of time separating each frame from the next. Film
1 was made by a camera that tracked the center of mass, film 2 by one
that was initially tracking the small ball, and kept on moving at the same
speed after the collision.


and everything not forbidden is mandatory, i.e., in any experiment,
there is only one possible outcome, which is the one that obeys all
the conservation laws.
  self-check B
  How do we know that momentum and kinetic energy are conserved in
  figure s/1?                                    > Answer, p. 133
  Let's make up some numbers as an example. Say the small ball
has a mass of 1 kg, the big one 8 kg. In frame 1, let's make the
velocities as follows:
         before the collision     after the collision
     *   -0.8                     0.8
     o   0.1                      -0.1

   Figure s/2 shows the same collision in a frame of reference where
the small ball was initially at rest. To find all the velocities in this
frame, we just add 0.8 to all the ones in the previous table.
         before the collision     after the collision
     *   0                        1.6
     o   0.9                      0.7
In this frame, as expected, the small ball flies off with a velocity,


Section 1.3   Dynamics             29


﻿


t / An 8-kg ball moving at 83% of
the speed of light hits a 1-kg ball.
The balls appear foreshortened
due to the relativistic distortion of
space.


1.6, that is almost twice the initial velocity of the big ball, 0.9.
    If all those velocities were in meters per second, then that's ex-
actly what happened. But what if all these velocities were in units
of the speed of light? Now it's no longer a good approximation
just to add velocities. We need to combine them according to the
relativistic rules. For instance, the table on page 28 tells us that
combining a velocity of 0.8 times the speed of light with another
velocity of 0.8 results in 0.98, not 1.6. The results are very different:
         before the collision      after the collision
     *   0                        0.98
     o   0.83                     0.76


     We can interpret this as follows. Figure s/1 is one in which the
big ball is moving fairly slowly. This is very nearly the way the
scene would be seen by an ant standing on the big ball. According
to an observer in frame t, however, both balls are moving at nearly
the speed of light after the collision. Because of this, the balls
appear foreshortened, but the distance between the two balls is also
shortened. To this observer, it seems that the small ball isn't pulling
away from the big ball very fast.
    Now here's what's interesting about all this. The outcome shown
in figure s/2 was supposed to be the only one possible, the only
one that satisfied both conservation of energy and conservation of
momentum. So how can the different result shown in figure t be
possible? The answer is that relativistically, momentum must not
equal mv. The old, familiar definition is only an approximation
that's valid at low speeds. If we observe the behavior of the small
ball in figure t, it looks as though it somehow had some extra inertia.
It's as though a football player tried to knock another player down
without realizing that the other guy had a three-hundred-pound bag
full of lead shot hidden under his uniform   he just doesn't seem
to react to the collision as much as he should. This extra inertia is
described by redefining momentum as
                          p = m-yv
At very low velocities, -' is close to 1, and the result is very nearly
my, as demanded by the correspondence principle. But at very


30             Chapter 1    Relativity


﻿


high velocities, y gets very big   the small ball in figure t has a
y of 5.0, and therefore has five times more inertia than we would
expect nonrelativistically.
    This also explains the answer to another paradox often posed
by beginners at relativity. Suppose you keep on applying a steady
force to an object that's already moving at 0.9999c. Why doesn't
it just keep on speeding up past c? The answer is that force is the
rate of change of momentum. At 0.9999c, an object already has a y
of 71, and therefore has already sucked up 71 times the momentum
you'd expect at that speed. As its velocity gets closer and closer to
c, its y approaches infinity. To move at c, it would need an infinite
momentum, which could only be caused by an infinite force.

Equivalence of mass and energy
    Now we're ready to see why mass and energy must be equivalent
as claimed in the famous E = mc2. So far we've only considered
collisions in which none of the kinetic energy is converted into any
other form of energy, such as heat or sound. Let's consider what
happens if a blob of putty moving at velocity v hits another blob
that is initially at rest, sticking to it. The nonrelativistic result is
that to obey conservation of momentum the two blobs must fly off
together at v/2. Half of the initial kinetic energy has been converted
to heat.6
    Relativistically, however, an interesting thing happens. A hot
object has more momentum than a cold object! This is because
the relativistically correct expression for momentum is myv, and
the more rapidly moving atoms in the hot object have higher values
of -y. In our collision, the final combined blob must therefore be
moving a little more slowly than the expected v/2, since otherwise
the final momentum would have been a little greater than the initial
momentum. To an observer who believes in conservation of momen-
tum and knows only about the overall motion of the objects and not
about their heat content, the low velocity after the collision would
seem to be the result of a magical change in the mass, as if the mass
of two combined, hot blobs of putty was more than the sum of their
individual masses.
    Now we know that the masses of all the atoms in the blobs must
be the same as they always were. The change is due to the change in
J with heating, not to a change in mass. The heat energy, however,
seems to be acting as if it was equivalent to some extra mass.
    But this whole argument was based on the fact that heat is a
form of kinetic energy at the atomic level. Would £ =imc2 apply to

   6A double-mass object moving at half the speed does not have the same
kinetic energy. Kinetic energy depends on the square of the velocity, so cutting
the velocity in half reduces the energy by a factor of 1/4, which, multiplied by
the doubled mass, makes 1/2 the original energy.


Section 1.3   Dynamics               31


﻿


other forms of energy as well? Suppose a rocket ship contains some
electrical energy stored in a battery. If we believed that E = mc2
applied to forms of kinetic energy but not to electrical energy, then
we would have to believe that the pilot of the rocket could slow
the ship down by using the battery to run a heater! This would
not only be strange, but it would violate the principle of relativity,
because the result of the experiment would be different depending
on whether the ship was at rest or not. The only logical conclusion is
that all forms of energy are equivalent to mass. Running the heater
then has no effect on the motion of the ship, because the total
energy in the ship was unchanged; one form of energy (electrical)
was simply converted to another (heat).
   The equation E = mc2 tells us how much energy is equivalent
to how much mass: the conversion factor is the square of the speed
of light, c. Since c a big number, you get a really really big number
when you multiply it by itself to get c2. This means that even a small
amount of mass is equivalent to a very large amount of energy.


  UGHTALASKEW

     IN TE HEAS

  Men of Science More or Less
  Agog Over Results of Eclipse
         Observations.

  EINSTEIN THEORY TRIUMPHS

  Stars Not Where They Seemed
  or Were Calculated to be,
     but Nobody Need Worry.

  A BOOK FOR 12 WISE MEN

  No More in All the World could
  Comprehend It, Said EinsteIn When
  His Daring Publishers Accepted it.

u /A  New  York Times head-
line from November 10, 1919,
describing  the   observations
discussed in example 1.


v / example 1


' Gravity bending light                             example 1
Gravity is a universal attraction between things that have mass,
and since the energy in a beam of light is equivalent to a some
very small amount of mass, we expect that light will be affected
by gravity, although the effect should be very small. The first im-
portant experimental confirmation of relativity came in 1919 when
stars next to the sun during a solar eclipse were observed to have


32            Chapter 1    Relativity


﻿


  shifted a little from their ordinary position. (If there was no eclipse,
  the glare of the sun would prevent the stars from being observed.)
  Starlight had been deflected by the sun's gravity. Figure v is a
  photographic negative, so the circle that appears bright is actu-
  ally the dark face of the moon, and the dark area is really the
  bright corona of the sun. The stars, marked by lines above and
  below then, appeared at positions slightly different than their nor-
  mal ones.
  Black holes                                          example 2
  A star with sufficiently strong gravity can prevent light from leav-
  ing. Quite a few black holes have been detected via their gravita-
  tional forces on neighboring stars or clouds of gas and dust.
    You've learned about conservation of mass and conservation of
energy, but now we see that they're not even separate conservation
laws. As a consequence of the theory of relativity, mass and en-
ergy are equivalent, and are not separately conserved  one can
be converted into the other. Imagine that a magician waves his
wand, and changes a bowl of dirt into a bowl of lettuce. You'd be
impressed, because you were expecting that both dirt and lettuce
would be conserved quantities. Neither one can be made to vanish,
or to appear out of thin air. However, there are processes that can
change one into the other. A farmer changes dirt into lettuce, and
a compost heap changes lettuce into dirt. At the most fundamen-
tal level, lettuce and dirt aren't really different things at all; they're
just collections of the same kinds of atoms carbon, hydrogen, and
so on. Because mass and energy are like two different sides of the
same coin, we may speak of mass-energy, a single conserved quantity,
found by adding up all the mass and energy, with the appropriate
conversion factor: E + mc2
  A rusting nail                                       example 3
  > An iron nail is left in a cup of water until it turns entirely to rust.
  The energy released is about 0.5 MJ. In theory, would a suffi-
  ciently precise scale register a change in mass? If so, how much?
  > The energy will appear as heat, which will be lost to the envi-
  ronment. The total mass-energy of the cup, water, and iron will
  indeed be lessened by 0.5 MJ. (If it had been perfectly insulated,
  there would have been no change, since the heat energy would
  have been trapped in the cup.) The speed of light is c = 3 x 108
  meters per second, so converting to mass units, we have
                       E
                  m =-
                       _0.5 x 106_J
                       (3 x 108 in/s)2
                     = 6 x 10--2 kilograms.
  The change in mass is too small to measure with any practical


Section 1.3   Dynamics              33


﻿


  technique. This is because the square of the speed of light is
  such a large number.
  Electron-positron annihilation                      example 4
  Natural radioactivity in the earth produces positrons, which are
  like electrons but have the opposite charge. A form of antimat-
  ter, positrons annihilate with electrons to produce gamma rays, a
  form of high-frequency light. Such a process would have been
  considered impossible before Einstein, because conservation of
  mass and energy were believed to be separate principles, and
  this process eliminates 100% of the original mass. The amount
  of energy produced by annihilating 1 kg of matter with 1 kg of
  antimatter is

                  E = mc2

                     = (2kg) (3.0 x 108 m/s)2

                     =2x 1017J       ,

  which is on the same order of magnitude as a day's energy con-
  sumption for the entire world's population!
  Positron annihilation forms the basis for the medical imaging tech-
  nique called a PET (positron emission tomography) scan, in which
  a positron-emitting chemical is injected into the patient and mapped
  by the emission of gamma rays from the parts of the body where
  it accumulates.
  One commonly hears some misinterpretations of E = mc2, one
being that the equation tells us how much kinetic energy an object
would have if it was moving at the speed of light. This wouldn't
make much sense, both because the equation for kinetic energy has
1/2 in it, KE= (1/2)mv2, and because a material object can't be
made to move at the speed of light. However, this naturally leads
to the question of just how much mass-energy a moving object has.
We know that when the object is at rest, it has no kinetic energy, so
its mass-energy is simply equal to the energy-equivalent of its mass,
mc2,
                    iE=mc2 when v= 0
where the symbol E stands for mass-energy. (You can write this
symbol yourself by writing an E, and then adding an extra line to
it. Have fun!) The point of using the new symbol is simply to
remind ourselves that we're talking about relativity, so an object at
rest has E~ = mc2, not £ 0 as we'd assume in classical physics.
    Suppose we start accelerating the object with a constant force.
A constant force means a constant rate of transfer of momentum,
but p =myv approaches infinity as v approaches c, so the object
will only get closer and closer to the speed of light, but never reach
it. Now what about the work being done by the force? The force


34             Chapter 1   Relativity


﻿


keeps doing work and doing work, which means that we keep on
using up energy. Mass-energy is conserved, so the energy being
expended must equal the increase in the object's mass-energy. We
can continue this process for as long as we like, and the amount of
mass-energy will increase without limit. We therefore conclude that
an object's mass-energy approaches infinity as its speed approaches
the speed of light,

                    E - oo when v - c

   Now that we have some idea what to expect, what is the actual
equation for the mass-energy? As proved in my book Simple Nature,
it is
                         IE = m'yc2

  self-check C
  Verify that this equation has the two properties we wanted.
  Answer, p. 133
  KE compared to ic2 at low speeds                    example 5
  > An object is moving at ordinary nonrelativistic speeds. Compare
  its kinetic energy to the energy mc2 it has purely because of its
  mass.
  > The speed of light is a very big number, so mc2 is a huge num-
  ber of joules. The object has a gigantic amount of energy be-
  cause of its mass, and only a relatively small amount of additional
  kinetic energy because of its motion.
  Another way of seeing this is that at low speeds, -y is only a tiny
  bit greater than 1, so E is only a tiny bit greater than mc2.
  The correspondence principle for mass-energy  example 6
  > Show that the equation E = myc2 obeys the correspondence
  principle.
  > As we accelerate an object from rest, its mass-energy becomes
  greater than its resting value. Classically, we interpret this excess
  mass-energy as the object's kinetic energy,

                     KE = E(v) -1E(v = 0)
                         = my c2 - mc2
                         =m(-        - - 1)c2


                              v2


Section 1.3   Dynamics             35


﻿


which is the classical expression. As demanded by the corre-
spondence principle, relativity agrees with classical physics at
speeds that are small compared to the speed of light.


36             Chapter 1   Relativity


﻿


Summary
Notation
7 . . ...       ..an abbreviation for 1/      1 - v2/c2
E . . . . . . . . . . mass-energy
Summary
    The principle of relativity states that experiments don't come
out different due to the straight-line, constant-speed motion of the
apparatus. Unlike his predecessors going back to Galileo and New-
ton, Einstein claimed that this principle applied not just to matter
but to light as well. This implies that the speed of light is the same,
regardless of the motion of the apparatus used to measure it. This
seems impossible, because we expect velocities to add in relative
motion; the strange constancy of the speed of light was, however,
observed experimentally in the 1887 Michelson-Morley experiment.
    Based only on this principle of relativity, Einstein showed that
time and space as seen by one observer would be distorted compared
to another observer's perceptions if they were moving relative to
each other. This distortion is quantified by the factor

                                1


where v is the relative velocity of the two observers. A clock ap-
pears to run fastest to an observer who is not in motion relative to
it, and appears to run too slowly by a factor of - to an observer who
has a velocity v relative to the clock. Similarly, a meter-stick ap-
pears longest to an observer who sees it at rest, and appears shorter
to other observers. Time and space are relative, not absolute. In
particular, there is no well-defined concept of simultaneity.
    All of these strange effects, however, are very small when the rel-
ative velocities are small relative to the speed of light. This makes
sense, because Newton's laws have already been thoroughly tested
by experiments at such speeds, so a new theory like relativity must
agree with the old one in their realm of common applicability. This
requirement of backwards-compatibility is known as the correspon-
dence principle.
    Relativity has implications not just for time and space but also
for the objects that inhabit time and space. The correct relativistic
equation for momentum is


which is similar to the classical p =my at low velocities, where
   7~1, but diverges from it more and more at velocities that ap-
proach the speed of light. Since 7 becomes infinite at v =c, an
infinite force would be required in order to give a material object


Summary               37


﻿


enough momentum to move at the speed of light. In other words,
material objects can only move at speeds lower than the speed of
light. Relativistically, mass and energy are not separately conserved.
Mass and energy are two aspects of the same phenomenon, known
as mass-energy, and they can be converted to one another according
to the equation
                          E=mc2

The mass-energy of a moving object is E = m-yc2. When an object is
at rest, -y= 1, and the mass-energy is simply the energy-equivalent
of its mass, mc2. When an object is in motion, the excess mass-
energy, in addition to the mc2, can be interpreted as its kinetic
energy.
Exploring Further
Relativity Simply Explained, Martin Gardner.     A beatifully
clear, nonmathematical introduction to the subject, with entertain-
ing illustrations. A postscript, written in 1996, follows up on recent
developments in some of the more speculative ideas from the 1967
edition.
Was Einstein Right?        Putting General Relativity to the
Test, Clifford M. Will. This book makes it clear that general
relativity is neither a fantasy nor holy scripture, but a scientific
theory like any other.


38             Chapter 1   Relativity


﻿


Problems


Key
V A computerized answer check is available online.
f A problem that requires calculus.
*    A difficult problem.
1     Astronauts in three different spaceships are communicating
with each other. Those aboard ships A and B agree on the rate at
which time is passing, but they disagree with the ones on ship C.
(a) Describe the motion of the other two ships according to Alice,
who is aboard ship A.
(b) Give the description according to Betty, whose frame of reference
is ship B.
(c) Do the same for Cathy, aboard ship C.
2      (a) Figure i on page 19 is based on a light clock moving at a
certain speed, v. By measuring with a ruler on the figure, determine
v/c.
(b) By similar measurements, find the time contraction factor 7,
which equals T/t.
(c) Locate your numbers from parts a and b as a point on the graph
in figure j on page 20, and check that it actually lies on the curve.
Make a sketch showing where the point is on the curve.  V
3      This problem is a continuation of problem 2. Now imagine
that the spaceship speeds up to twice the velocity. Draw a new
triangle on the same scale, using a ruler to make the lengths of the
sides accurate. Repeat parts b and c for this new diagram. V
4    What happens in the equation for y when you put in a negative
number for v? Explain what this means physically, and why it makes
sense.
5     (a) By measuring with a ruler on the graph in figure o on page
24, estimate the y values of the two supernovae.
(b) Figure o gives the values of v/c. From these, compute y values
and compare with the results from part a.
(c) Locate these two points on the graph in figure j, and make a
sketch showing where they lie.
6     The Voyager 1 space probe, launched in 1977, is moving faster
relative to the earth than any other human-made object, at 17,000
meters per second.
(a) Calculate the probe's y.
(b) Over the course of one year on earth, slightly less than one year
passes on the probe. How much less? (There are 31 million seconds
in a year.)V

7 (a) A free neutron (as opposed to a neutron bound into an
atomic nucleus) is unstable, and undergoes beta decay (which you
may want to review). The masses of the particles involved are as


Problems              39


﻿


follows:
      neutron        1.67495 x 10-27 kg
      proton         1.67265 x 10-27 kg
      electron       0.00091 x 10-27 kg
      antineutrino <i10-35 kg
Find the energy released in the decay of a free neutron.   ~
(b) Neutrons and protons make up essentially all of the mass of the
ordinary matter around us. We observe that the universe around us
has no free neutrons, but lots of free protons (the nuclei of hydrogen,
which is the element that 90% of the universe is made of). We find
neutrons only inside nuclei along with other neutrons and protons,
not on their own.

If there are processes that can convert neutrons into protons, we
might imagine that there could also be proton-to-neutron conver-
sions, and indeed such a process does occur sometimes in nuclei
that contain both neutrons and protons: a proton can decay into a
neutron, a positron, and a neutrino. A positron is a particle with
the same properties as an electron, except that its electrical charge
is positive (see chapter 7). A neutrino, like an antineutrino, has
negligible mass.
Although such a process can occur within a nucleus, explain why
it cannot happen to a free proton. (If it could, hydrogen would be
radioactive, and you wouldn't exist!)
8      (a) Find a relativistic equation for the velocity of an object
in terms of its mass and momentum (eliminating y).         V
(b) Show that your result is approximately the same as the classical
value, p/m, at low velocities.
(c) Show that very large momenta result in speeds close to the speed
of light.                                                        *
9       (a) Show that for v= (3/5)c, y comes out to be a simple
fraction.
(b) Find another value of v for which y is a simple fraction.
10      An object moving at a speed very close to the speed of light
is referred to as ultrarelativistic. Ordinarily (luckily) the only ul-
trarelativistic objects in our universe are subatomic particles, such
as cosmic rays or particles that have been accelerated in a particle
accelerator.
(a) What kind of number is y for an ultrarelativistic particle?
(b) Repeat example 5 on page 35, but instead of very low, nonrela-
tivistic speeds, consider ultrarelativistic speeds.
(c) Find an equation for the ratio IE/p. The speed may be relativis-
tic, but don't assume that it's ultrarelativistic.V
(d) Simplify your answer to part c for the case where the speed is
ultrarelativistic.V
(e) We can think of a beam of light as an ultrarelativistic object-


40             Chapter 1    Relativity


﻿


it certainly moves at a speed that's sufficiently close to the speed
of light! Suppose you turn on a one-watt flashlight, leave it on for
one second, and then turn it off. Compute the momentum of the
recoiling flashlight, in units of kg-m/s.
(f) Discuss how your answer in part e relates to the correspondence
principle.
11     As discussed in book 3 of this series, the speed at which a dis-
turbance travels along a string under tension is given by v = T/P,
where pu is the mass per unit length, and T is the tension.
(a) Suppose a string has a density p, and a cross-sectional area A.
Find an expression for the maximum tension that could possibly
exist in the string without producing v > c, which is impossible
according to relativity. Express your answer in terms of p, A, and
c. The interpretation is that relativity puts a limit on how strong
any material can be.
(b) Every substance has a tensile strength, defined as the force
per unit area required to break it by pulling it apart. The ten-
sile strength is measured in units of N/m2, which is the same as the
pascal (Pa), the mks unit of pressure. Make a numerical estimate
of the maximum tensile strength allowed by relativity in the case
where the rope is made out of ordinary matter, with a density on
the same order of magnitude as that of water. (For comparison,
kevlar has a tensile strength of about 4 x 109 Pa, and there is spec-
ulation that fibers made from carbon nanotubes could have values
as high as 6 x 1010 Pa.)
(c) A black hole is a star that has collapsed and become very dense,
so that its gravity is too strong for anything ever to escape from it.
For instance, the escape velocity from a black hole is greater than
c, so a projectile can't be shot out of it. Many people, when they
hear this description of a black hole in terms of an escape velocity
greater than c, wonder why it still wouldn't be possible to extract
an object from a black hole by other means than launching it out
as a projectile. For example, suppose we lower an astronaut into a
black hole on a rope, and then pull him back out again. Why might
this not work?


12      The earth is orbiting the sun, and therefore is contracted
relativistically in the direction of its motion. Compute the amount
by which its diameter shrinks in this direction.


Problems              41


﻿


42             Chapter 1    Relativity


﻿


The continental U.S. got its first taste of volcanism in recent memory with
the eruption of Mount St. Helens in 1980.


Chapter 2


Rules of Randomness


     Given for one instant an intelligence which could comprehend
     all the forces by which nature is animated and the respective
     positions of the things which compose it...nothing would be
     uncertain, and the future as the past would be laid out before
     its eyes.                 Pierre Simon de Laplace, 1776
     The Quantum Mechanics is very imposing. But an inner voice
     tells me that it is still not the final truth. The theory yields
     much, but it hardly brings us nearer to the secret of the Old


43


﻿


      One. In any case, I am convinced that He does not play dice.
      Albert Einstein
    However radical Newton's clockwork universe seemed to his con-
temporaries, by the early twentieth century it had become a sort of
smugly accepted dogma. Luckily for us, this deterministic picture of
the universe breaks down at the atomic level. The clearest demon-
stration that the laws of physics contain elements of randomness
is in the behavior of radioactive atoms. Pick two identical atoms
of a radioactive isotope, say the naturally occurring uranium 238,
and watch them carefully. They will decay at different times, even
though there was no difference in their initial behavior.
    We would be in big trouble if these atoms' behavior was as pre-
dictable as expected in the Newtonian world-view, because radioac-
tivity is an important source of heat for our planet. In reality, each
atom chooses a random moment at which to release its energy, re-
sulting in a nice steady heating effect. The earth would be a much
colder planet if only sunlight heated it and not radioactivity. Prob-
ably there would be no volcanoes, and the oceans would never have
been liquid. The deep-sea geothermal vents in which life first evolved
would never have existed. But there would be an even worse conse-
quence if radioactivity was deterministic: after a few billion years of
peace, all the uranium 238 atoms in our planet would presumably
pick the same moment to decay. The huge amount of stored nuclear
energy, instead of being spread out over eons, would all be released
at one instant, blowing our whole planet to Kingdom Come.1
    The new version of physics, incorporating certain kinds of ran-
domness, is called quantum physics (for reasons that will become
clear later). It represented such a dramatic break with the pre-
vious, deterministic tradition that everything that came before is
considered "classical," even the theory of relativity. The remainder
of this book is a basic introduction to quantum physics.
Discussion Question
A     I said "Pick two identical atoms of a radioactive isotope." Are two
atoms really identical? If their electrons are orbiting the nucleus, can we
distinguish each atom by the particular arrangement of its electrons at
some instant in time?


   1This is under the assumption that all the radioactive heating comes from
uranium atoms, and that all the atoms were created at the same time. In
reality, both uranium and thorium atoms contribute, and they may not have all
been created at the same time. We have only a general idea of the processes
that created these heavy elements in the gas cloud from which our solar system
condensed. Some portion of them may have come from nuclear reactions in
supernova explosions in that particular nebula, but some may have come from
previous supernova explosions throughout our galaxy, or from exotic events like
collisions of white dwarf stars.


44


Chapter 2  Rules of Randomness


﻿


Randomness Isn't Random


Einstein's distaste for randomness, and his association of determin-
ism with divinity, goes back to the Enlightenment conception of the
universe as a gigantic piece of clockwork that only had to be set
in motion initially by the Builder. Many of the founders of quan-
tum mechanics were interested in possible links between physics and
Eastern and Western religious and philosophical thought, but every
educated person has a different concept of religion and philosophy.
Bertrand Russell remarked, "Sir Arthur Eddington deduces religion
from the fact that atoms do not obey the laws of mathematics. Sir
James Jeans deduces it from the fact that they do."
    Russell's witticism, which implies incorrectly that mathematics
cannot describe randomness, reminds us how important it is not
to oversimplify this question of randomness. You should not sim-
ply surmise, "Well, it's all random, anything can happen." For
one thing, certain things simply cannot happen, either in classical
physics or quantum physics. The conservation laws of mass, energy,
momentum, and angular momentum are still valid, so for instance
processes that create energy out of nothing are not just unlikely
according to quantum physics, they are impossible.
    A useful analogy can be made with the role of randomness in
evolution. Darwin was not the first biologist to suggest that species
changed over long periods of time. His two new fundamental ideas
were that (1) the changes arose through random genetic variation,
and (2) changes that enhanced the organism's ability to survive and
reproduce would be preserved, while maladaptive changes would be
eliminated by natural selection. Doubters of evolution often consider
only the first point, about the randomness of natural variation, but
not the second point, about the systematic action of natural selec-
tion. They make statements such as, "the development of a complex
organism like Homo sapiens via random chance would be like a whirl-
wind blowing through a junkyard and spontaneously assembling a
jumbo jet out of the scrap metal." The flaw in this type of reason-
ing is that it ignores the deterministic constraints on the results of
random processes. For an atom to violate conservation of energy is
no more likely than the conquest of the world by chimpanzees next
year.
Discussion Question
A Economists often behave like wannabe physicists, probably because
it seems prestigious to make numerical calculations instead of talking
about human relationships and organizations like other social scientists.
Their striving to make economics work like Newtonian physics extends
to a parallel use of mechanical metaphors, as in the concept of a mar-
ket's supply and demand acting like a self-adjusting machine, and the
idealization of people as economic automatons who consistently strive to
maximize their own wealth. What evidence is there for randomness rather
than mechanical determinism in economics?


Section 2.1   Randomness Isn't Random


45


﻿


N


N


N11


a / The  probability that one
wheel will give a cherry is 1/10.
The probability that all three
wheels will give cherries is
1/10 x 1/10 x 1/10.


2.2 Calculating Randomness
You should also realize that even if something is random, we can still
understand it, and we can still calculate probabilities numerically.
In other words, physicists are good bookmakers. A good bookie can
calculate the odds that a horse will win a race much more accurately
that an inexperienced one, but nevertheless cannot predict what will
happen in any particular race.

Statistical independence
    As an illustration of a general technique for calculating odds,
suppose you are playing a 25-cent slot machine. Each of the three
wheels has one chance in ten of coming up with a cherry. If all
three wheels come up cherries, you win $100. Even though the
results of any particular trial are random, you can make certain
quantitative predictions. First, you can calculate that your odds
of winning on any given trial are 1/10 x 1/10 x 1/10 = 1/1000
0.001. Here, I am representing the probabilities as numbers from
0 to 1, which is clearer than statements like "The odds are 999 to
1," and makes the calculations easier. A probability of 0 represents
something impossible, and a probability of 1 represents something
that will definitely happen.
    Also, you can say that any given trial is equally likely to result in
a win, and it doesn't matter whether you have won or lost in prior
games. Mathematically, we say that each trial is statistically inde-
pendent, or that separate games are uncorrelated. Most gamblers
are mistakenly convinced that, to the contrary, games of chance are
correlated. If they have been playing a slot machine all day, they
are convinced that it is "getting ready to pay," and they do not
want anyone else playing the machine and "using up" the jackpot
that they "have coming." In other words, they are claiming that
a series of trials at the slot machine is negatively correlated, that
losing now makes you more likely to win later. Craps players claim
that you should go to a table where the person rolling the dice is
"hot," because she is likely to keep on rolling good numbers. Craps
players, then, believe that rolls of the dice are positively correlated,
that winning now makes you more likely to win later.
    My method of calculating the probability of winning on the slot
machine was an example of the following important rule for calcu-
lations based on independent probabilities:
      the law of independent probabilities
      If the probability of one event happening is PA, and the prob-
      ability of a second statistically independent event happening
      is PB, then the probability that they will both occur is the
      product of the probabilities, PAPB. If there are more than
      two events involved, you simply keep on multiplying.


46


Chapter 2   Rules of Randomness


﻿


    Note that this only applies to independent probabilities. For
instance, if you have a nickel and a dime in your pocket, and you
randomly pull one out, there is a probability of 0.5 that it will be
the nickel. If you then replace the coin and again pull one out
randomly, there is again a probability of 0.5 of coming up with the
nickel, because the probabilities are independent. Thus, there is a
probability of 0.25 that you will get the nickel both times.
    Suppose instead that you do not replace the first coin before
pulling out the second one. Then you are bound to pull out the
other coin the second time, and there is no way you could pull the
nickel out twice. In this situation, the two trials are not indepen-
dent, because the result of the first trial has an effect on the second
trial. The law of independent probabilities does not apply, and the
probability of getting the nickel twice is zero, not 0.25.
    Experiments have shown that in the case of radioactive decay,
the probability that any nucleus will decay during a given time in-
terval is unaffected by what is happening to the other nuclei, and
is also unrelated to how long it has gone without decaying. The
first observation makes sense, because nuclei are isolated from each
other at the centers of their respective atoms, and therefore have no
physical way of influencing each other. The second fact is also rea-
sonable, since all atoms are identical. Suppose we wanted to believe
that certain atoms were "extra tough," as demonstrated by their
history of going an unusually long time without decaying. Those
atoms would have to be different in some physical way, but nobody
has ever succeeded in detecting differences among atoms. There is
no way for an atom to be changed by the experiences it has in its
lifetime.

Addition of probabilities
    The law of independent probabilities tells us to use multiplica-
tion to calculate the probability that both A and B will happen,
assuming the probabilities are independent. What about the prob-
ability of an "or" rather than an "and?" If two events A and B
are mutually exclusive, then the probability of one or the other oc-
curring is the sum PA + PB. For instance, a bowler might have a
30% chance of getting a strike (knocking down all ten pins) and a
20% chance of knocking down nine of them. The bowler's chance of
knocking down either nine pins or ten pins is therefore 50%.
    It does not make sense to add probabilities of things that are
not mutually exclusive, i.e., that could both happen. Say I have a
90% chance of eating lunch on any given day, and a 90% chance of
eating dinner. The probability that I will eat either lunch or dinner
is not 180%.


Section 2.2   Calculating Randomness


47


﻿


b / Normalization: the proba-
bility of picking land plus the
probability of picking water adds
up to 1.


Normalization
    If I spin a globe and randomly pick a point on it, I have about a
70% chance of picking a point that's in an ocean and a 30% chance
of picking a point on land. The probability of picking either wa-
ter or land is 70% + 30% = 100%. Water and land are mutually
exclusive, and there are no other possibilities, so the probabilities
had to add up to 100%. It works the same if there are more than
two possibilities  if you can classify all possible outcomes into a
list of mutually exclusive results, then all the probabilities have to
add up to 1, or 100%. This property of probabilities is known as
normalization.

Averages
    Another way of dealing with randomness is to take averages.
The casino knows that in the long run, the number of times you win
will approximately equal the number of times you play multiplied
by the probability of winning. In the game mentioned above, where
the probability of winning is 0.001, if you spend a week playing,
and pay $2500 to play 10,000 times, you are likely to win about 10
times (10, 000 x 0.001 = 10), and collect $1000. On the average, the
casino will make a profit of $1500 from you. This is an example of
the following rule.
      rule for calculating averages
      If you conduct N identical, statistically independent trials,
      and the probability of success in each trial is P, then on the
      average, the total number of successful trials will be NP. If N
      is large enough, the relative error in this estimate will become
      small.
    The statement that the rule for calculating averages gets more
and more accurate for larger and larger N (known popularly as the
"law of averages") often provides a correspondence principle that
connects classical and quantum physics. For instance, the amount
of power produced by a nuclear power plant is not random at any
detectable level, because the number of atoms in the reactor is so
large. In general, random behavior at the atomic level tends to
average out when we consider large numbers of atoms, which is why
physics seemed deterministic before physicists learned techniques for
studying atoms individually.
   We can achieve great precision with averages in quantum physics
because we can use identical atoms to reproduce exactly the same
situation many times. If we were betting on horses or dice, we would
be much more limited in our precision. After a thousand races, the
horse would be ready to retire. After a million rolls, the dice would
be worn out.


48


Chapter 2   Rules of Randomness


﻿


  self-check A
  Which of the following things must be independent, which could be in-
  dependent, and which definitely are not independent?
  (1) the probability of successfully making two free-throws in a row in
  basketball
  (2) the probability that it will rain in London tomorrow and the probability
  that it will rain on the same day in a certain city in a distant galaxy
  (3) your probability of dying today and of dying tomorrow

                                                     > Answer, p. 133

Discussion Questions
A     Newtonian physics is an essentially perfect approximation for de-
scribing the motion of a pair of dice. If Newtonian physics is deterministic,
why do we consider the result of rolling dice to be random?  c / Why  are  dice  random?
B     Why isn't it valid to define randomness by saying that randomness
is when all the outcomes are equally likely?
C   The sequence of digits 121212121212121212 seems clearly nonran-
dom, and 41592653589793 seems random. The latter sequence, how-
ever, is the decimal form of pi, starting with the third digit. There is a story
about the Indian mathematician Ramanujan, a self-taught prodigy, that a
friend came to visit him in a cab, and remarked that the number of the
cab, 1729, seemed relatively uninteresting. Ramanujan replied that on
the contrary, it was very interesting because it was the smallest number
that could be represented in two different ways as the sum of two cubes.
The Argentine author Jorge Luis Borges wrote a short story called "The
Library of Babel," in which he imagined a library containing every book
that could possibly be written using the letters of the alphabet. It would in-
clude a book containing only the repeated letter "a;" all the ancient Greek
tragedies known today, all the lost Greek tragedies, and millions of Greek
tragedies that were never actually written; your own life story, and various
incorrect versions of your own life story; and countless anthologies con-
taining a short story called "The Library of Babel." Of course, if you picked
a book from the shelves of the library, it would almost certainly look like a
nonsensical sequence of letters and punctuation, but it's always possible
that the seemingly meaningless book would be a science-fiction screen-
play written in the language of a Neanderthal tribe, or the lyrics to a set
of incomparably beautiful love songs written in a language that never ex-
isted. In view of these examples, what does it really mean to say that
something is random?


Section 2.2


Calculating Randomness      49


﻿
1/6


.0
0
0Q


0


1   2   3  4   5   6
      result


d / Probability distribution for
the result of rolling a single die.


  0.2


  0.1
20
0
0Q


      2 3


e / Rolling
them up.


4 5 6 7
   result


8 9 101112


two dice and adding


2.3 Probability Distributions
    So far we've discussed random processes having only two possible
outcomes: yes or no, win or lose, on or off. More generally, a random
process could have a result that is a number. Some processes yield
integers, as when you roll a die and get a result from one to six, but
some are not restricted to whole numbers, for example the number
of seconds that a uranium-238 atom will exist before undergoing
radioactive decay.

    Consider a throw of a die. If the die is "honest," then we expect
all six values to be equally likely. Since all six probabilities must
add up to 1, then probability of any particular value coming up
must be 1/6. We can summarize this in a graph, d. Areas under
the curve can be interpreted as total probabilities. For instance,
the area under the curve from 1 to 3 is 1/6 + 1/6 + 1/6 = 1/2, so
the probability of getting a result from 1 to 3 is 1/2. The function
shown on the graph is called the probability distribution.
    Figure e shows the probabilities of various results obtained by
rolling two dice and adding them together, as in the game of craps.
The probabilities are not all the same. There is a small probability
of getting a two, for example, because there is only one way to do
it, by rolling a one and then another one. The probability of rolling
a seven is high because there are six different ways to do it: 1+6,
2+5, etc.

    If the number of possible outcomes is large but finite, for example
the number of hairs on a dog, the graph would start to look like a
smooth curve rather than a ziggurat.

   What about probability distributions for random numbers that
are not integers? We can no longer make a graph with probabil-
ity on the y axis, because the probability of getting a given exact
number is typically zero. For instance, there is zero probability that
a radioactive atom will last for exactly 3 seconds, since there are
infinitely many possible results that are close to 3 but not exactly
three, for example 2.999999999999999996876876587658465436. It
doesn't usually make sense, therefore, to talk about the probability
of a single numerical result, but it does make sense to talk about
the probability of a certain range of results. For instance, the prob-
ability that an atom will last more than 3 and less than 4 seconds is
a perfectly reasonable thing to discuss. We can still summarize the
probability information on a graph, and we can still interpret areas
under the curve as probabilities.
    But the y axis can no longer be a unitless probability scale. In
radioactive decay, for example, we want the x axis to have units of
time, and we want areas under the curve to be unitless probabilities.


E 0.03

  0.02

  S0.01
0)
.oi
6 0.00
2.


120 14


h 160 180 200
height (cm)


f / A  probability  distribution
for height of human adults. (Not
real data.)


50


Chapter 2  Rules of Randomness


﻿


The area of a single square on the graph paper is then

  (unitless area of a square)
      (width of square with time units) x (height of square)

If the units are to cancel out, then the height of the square must
evidently be a quantity with units of inverse time. In other words,
the y axis of the graph is to be interpreted as probability per unit
time, not probability.
   Figure f shows another example, a probability distribution for
people's height. This kind of bell-shaped curve is quite common.
  self-check B
  Compare the number of people with heights in the range of 130-135 cm
  to the number in the range 135-140.        D Answer, p. 133
  'Looking for tall basketball players                example 1
  > A certain country with a large population wants to find very tall
  people to be on its Olympic basketball team and strike a blow
  against western imperialism. Out of a pool of 108 people who are
  the right age and gender, how many are they likely to find who are
  over 225 cm (7 feet 4 inches) in height? Figure g gives a close-up
  of the "tail" of the distribution shown previously in figure f.
  > The shaded area under the curve represents the probability that
  a given person is tall enough. Each rectangle represents a prob-
  ability of 0.2 x 10-7 cm-1 x 1 cm = 2 x 10-8. There are about 35
  rectangles covered by the shaded area, so the probability of hav-
  ing a height greater than 225 cm is 7 x 10-7 , or just under one in
  a million. Using the rule for calculating averages, the average, or
  expected number of people this tall is (108) x (7 x 10-7) = 70.

Average and width of a probability distribution
   If the next Martian you meet asks you, "How tall is an adult hu-
man?," you will probably reply with a statement about the average
human height, such as "Oh, about 5 feet 6 inches." If you wanted
to explain a little more, you could say, "But that's only an average.
Most people are somewhere between 5 feet and 6 feet tall." Without
bothering to draw the relevant bell curve for your new extraterres-
trial acquaintance, you've summarized the relevant information by
giving an average and a typical range of variation.
   The average of a probability distribution can be defined geo-
metrically as the horizontal position at which it could be balanced
if it was constructed out of cardboard, h. A convenient numerical
measure of the amount of variation about the average, or amount of
uncertainty, is the full width at half maximum, or FWHM, defined
in figure i. (The FWHM was introduced in chapter 2 of Vibrations
and Waves.)


I 3x10 -7
E
U
A 2x1 -7
a-
c lx10 -7 -
CL
_ 0x10 -7
CL-    225


   230
height (cm)


235


g / A close-up of the  right-
hand tail of the distribution shown
in the figure f.


h /The average
bility distribution.


of a proba-


full width at half
maximum (FWHM)


i/ The full width at half maxi-
mum (FWHM) of a probability
distribution.


Section 2.3   Probability Distributions


51


﻿


    A great deal more could be said about this topic, and indeed
an introductory statistics course could spend months on ways of
defining the center and width of a distribution. Rather than force-
feeding you on mathematical detail or techniques for calculating
these things, it is perhaps more relevant to point out simply that
there are various ways of defining them, and to inoculate you against
the misuse of certain definitions.
    The average is not the only possible way to say what is a typical
value for a quantity that can vary randomly; another possible defi-
nition is the median, defined as the value that is exceeded with 50%
probability. When discussing incomes of people living in a certain
town, the average could be very misleading, since it can be affected
massively if a single resident of the town is Bill Gates. Nor is the
FWHM the only possible way of stating the amount of random vari-
ation; another possible way of measuring it is the standard deviation
(defined as the square root of the average squared deviation from
the average value).


2.4 Exponential Decay and Half-Life
Most people know that radioactivity "lasts a certain amount of
time," but that simple statement leaves out a lot. As an example,
consider the following medical procedure used to diagnose thyroid
function. A very small quantity of the isotope 131I, produced in a
nuclear reactor, is fed to or injected into the patient. The body's
biochemical systems treat this artificial, radioactive isotope exactly
the same as 127I, which is the only naturally occurring type. (Nu-
tritionally, iodine is a necessary trace element. Iodine taken into
the body is partly excreted, but the rest becomes concentrated in
the thyroid gland. Iodized salt has had iodine added to it to pre-
vent the nutritional deficiency known as goiters, in which the iodine-
starved thyroid becomes swollen.) As the 131I undergoes beta decay,
it emits electrons, neutrinos, and gamma rays. The gamma rays can
be measured by a detector passed over the patient's body. As the
radioactive iodine becomes concentrated in the thyroid, the amount
of gamma radiation coming from the thyroid becomes greater, and
that emitted by the rest of the body is reduced. The rate at which
the iodine concentrates in the thyroid tells the doctor about the
health of the thyroid.

    If you ever undergo this procedure, someone will presumably
explain a little about radioactivity to you, to allay your fears that
you will turn into the Incredible Hulk, or that your next child will
have an unusual number of limbs. Since iodine stays in your thyroid
for a long time once it gets there, one thing you'll want to know is
whether your thyroid is going to become radioactive forever. They
may just tell you that the radioactivity "only lasts a certain amount
of time," but we can now carry out a quantitative derivation of how


52


Chapter 2  Rules of Randomness


﻿


the radioactivity really will die out.
    Let PsurV(t) be the probability that an iodine atom will survive
without decaying for a period of at least t. It has been experimen-
tally measured that half all 131I atoms decay in 8 hours, so we have

                      Psurv(8 hr) = 0.5

    Now using the law of independent probabilities, the probability
of surviving for 16 hours equals the probability of surviving for the
first 8 hours multiplied by the probability of surviving for the second
8 hours,

                    Psurv(16 hr) = 0.50 x 0.50
                                 = 0.25
Similarly we have

                  Psurv(24 hr) = 0.50 x 0.5 x 0.5
                               = 0.125
Generalizing from this pattern, the probability of surviving for any
time t that is a multiple of 8 hours is

                     Psurv(t) = 0.5t/8 hr
We now know how to find the probability of survival at intervals
of 8 hours, but what about the points in time in between? What
would be the probability of surviving for 4 hours? Well, using the
law of independent probabilities again, we have

           Psurv(8 hr) = Psurv(4 hr) x Psurv(4 hr)
which can be rearranged to give

                    Psurv(4 hr) =   Psurv(8 hr)
                               = v0.5
                               = 0.707
This is exactly what we would have found simply by plugging in
Pouro(t) = 0.5t/8 hr and ignoring the restriction to multiples of 8
hours. Since 8 hours is the amount of time required for half of the
atoms to decay, it is known as the half-life, written t1/2. The general
rule is as follows:
      exponential decay equation
                        Psurel(t) = 0.5t/t,2

    Using the rule for calculating averages, we can also find the num-
ber of atoms, N(t), remaining in a sample at time t:

                      N(t) =N(0) x0.I*/
Both of these equations have graphs that look like dying-out expo-
nentials, as in the example below.


Section 2.4   Exponential Decay and Half-Life


53


﻿


           1.5
                              tree rings
           1.4-
           1.3
           1.2
           1.1
           1.0-

           0.9         Pompeii
ratio of01 B40.8
    to12   0.
    (x1012) 0.6            Tayinat          Egyptian artifacts

           0.5
           0.4
           0.3
           0.2
           0.1
           0 .0i                  i         i        i         i
               0       1000      2000     3000     4000      5000
                                   age (years)

j / Calibration of the 14C dating method using tree rings and arti-
facts whose ages were known from other methods. Redrawn from Emilio
Segre, Nuclei and Particles, 1965.


  14 Dating                                            example 2
  Almost all the carbon on Earth is 12C, but not quite. The isotope
  14C, with a half-life of 5600 years, is produced by cosmic rays in
  the atmosphere. It decays naturally, but is replenished at such a
  rate that the fraction of 14C in the atmosphere remains constant,
  at 1.3 x 10-12 . Living plants and animals take in both 12C and
  14C from the atmosphere and incorporate both into their bodies.
  Once the living organism dies, it no longer takes in C atoms from
  the atmosphere, and the proportion of 14C gradually falls off as
  it undergoes radioactive decay. This effect can be used to find
  the age of dead organisms, or human artifacts made from plants
  or animals. Figure j shows the exponential decay curve of 14C
  in various objects. Similar methods, using longer-lived isotopes,
  prove the earth was billions of years old, not a few thousand as
  some had claimed on religious grounds.


54


Chapter 2  Rules of Randomness


﻿


  Radioactive contamination at Chernobyl         example 3
  > One of the most dangerous radioactive isotopes released by the
  Chernobyl disaster in 1986 was 90Sr, whose half-life is 28 years.
  (a) How long will it be before the contamination is reduced to one
  tenth of its original level? (b) If a total of 1027 atoms was released,
  about how long would it be before not a single atom was left?
    (a) We want to know the amount of time that a 90Sr nucleus
  has a probability of 0.1 of surviving. Starting with the exponential
  decay formula,
                      Psurv = 0.5t/t /2
  we want to solve for t. Taking natural logarithms of both sides,

                              t
                      InP=      In0.5
                            t1 /2

  so
                          t=       InP
                             In 0.5
  Plugging in P = 0.1 and t1/2 = 28 years, we get t = 93 years.
  (b) This is just like the first part, but P = 10-27 . The result is
  about 2500 years.

Rate of decay
   If you want to find how many radioactive decays occur within a
time interval lasting from time t to time t + At, the most straight-
forward approach is to calculate it like this:

           (number of decays between t and t + At)
                      = N(t) - N(t + At)
                      = N(0) [Psurv(t) - Psurv(t + At)]
                      = N(0) [05L/il/2 - 0.5(t+t)/t1/2

                      = N(0) 1 - 0.5At/t1/2 0.5/1/2

A problem arises when At is small compared to t1/2. For instance,
suppose you have a hunk of 1022 atoms of 235U, with a half-life of
700 million years, which is 2.2 x 1016 s. You want to know how many
decays will occur in At =1 s. Since we're specifying the current
number of atoms, t =0. As you plug in to the formula above on
your calculator, the quantity 0.5At/t1/2 comes out on your calculator
to equal one, so the final result is zero. That's incorrect, though.
In reality, 0.5At/t1/2 should equal 0.999999999999999968, but your
calculator only gives eight digits of precision, so it rounded it off to
one. In other words, the probability that a 235U atom will survive
for 1 s is very close to one, but not equal to one. The number of
decays in one second is therefore 3.2 x 105, not zero.


Section 2.4   Exponential Decay and Half-Life


55


﻿


   Well, my calculator only does eight digits of precision, just like
yours, so how did I know the right answer? The way to do it is to
use the following approximation:

                    ab~ 1+ b Ina, ifb< 1

(The symbol < means "is much less than.") Using it, we can find
the following approximation:

        (number of decays between t and t + At)

                    = N(O) 1 - 0.5At/t1/2 0.5t/t1/2

                        ti            at
                      N(0) 1 -  1 +    ln 0.5  0.5/t 1/2
                                     tl/2

                      (ln 2)N(0)0.5/1/2At
                        ti/
                                       tl/2
This also gives us a way to calculate the rate of decay, i.e., the
number of decays per unit time. Dividing by At on both sides, we
have

  (decays per unit time)
                       (ln 2)N(0)
                                 0.5/f1/2 , if At< t1/2 -


  The hot potato                                      example 4
  > A nuclear physicist with a demented sense of humor tosses
  you a cigar box, yelling "hot potato." The label on the box says
  "contains 1020 atoms of 17F, half-life of 66 s, produced today in
  our reactor at 1 p.m." It takes you two seconds to read the label,
  after which you toss it behind some lead bricks and run away. The
  time is 1:40 p.m. Will you die?
  > The time elapsed since the radioactive fluorine was produced
  in the reactor was 40 minutes, or 2400 s. The number of elapsed
  half-lives is therefore t/t1/2 = 36. The initial number of atoms
  was N(0) = 1020 . The number of decays per second is now
  about 107 s-1, so it produced about 2 x 107 high-energy electrons
  while you held it in your hands. Although twenty million electrons
  sounds like a lot, it is not really enough to be dangerous.
  By the way, none of the equations we've derived so far was the
actual probability distribution for the time at which a particular
radioactive atom will decay. That probability distribution would be
found by substituting N(O) =1 into the equation for the rate of
decay.
    If the sheer number of equations is starting to seem formidable,
let's pause and think for a second. The simple equation for PZrV is
something you can derive easily from the law of independent prob-
abilities any time you need it. From that, you can quickly find the


56


Chapter 2  Rules of Randomness


﻿


exact equation for the rate of decay. The derivation of the approx-
imate equations for At < t is a little hairier, but note that except
for the factors of in 2, everything in these equations can be found
simply from considerations of logic and units. For instance, a longer
half-life will obviously lead to a slower rate of decays, so it makes
sense that we divide by it. As for the ln 2 factors, they are exactly
the kind of thing that one looks up in a book when one needs to
know them.
Discussion Questions
A     In the medical procedure involving 1311, why is it the gamma rays
that are detected, not the electrons or neutrinos that are also emitted?
B      For 1 s, Fred holds in his hands 1 kg of radioactive stuff with a
half-life of 1000 years. Ginger holds 1 kg of a different substance, with a
half-life of 1 min, for the same amount of time. Did they place themselves
in equal danger, or not?
C     How would you interpret it if you calculated N(t), and found it was
less than one?
D    Does the half-life depend on how much of the substance you have?
Does the expected time until the sample decays completely depend on
how much of the substance you have?


2.5 f Applications of Calculus
The area under the probability distribution is of course an integral.
If we call the random number x and the probability distribution
D(x), then the probability that x lies in a certain range is given by


                                         lb
           (probability of a < x < b) = D (x) dx


What about averages? If x had a finite number of equally probable
values, we would simply add them up and divide by how many we
had. If they weren't equally likely, we'd make the weighted average
xP1 + x2P2+... But we need to generalize this to a variable x that
can take on any of a continuum of values. The continuous version
of a sum is an integral, so the average is


              (average value of x) f xD(x)dzx


where the integral is over all possible values of x.


Section 2.5   f Applications of Calculus


57


﻿


Probability distribution for radioactive decay  example 5
Here is a rigorous justification for the statement in section 2.4
that the probability distribution for radioactive decay is found by
substituting N(0) = 1 into the equation for the rate of decay. We
know that the probability distribution must be of the form

                     D(t) = kO.5t/t /2

where k is a constant that we need to determine. The atom is
guaranteed to decay eventually, so normalization gives us

          (probability of 0 < t < oc) = 1

                                   =jD(t)dt

The integral is most easily evaluated by converting the function
into an exponential with e as the base

                 D(t) = k exp [In (0.5t/t, 2

                      = kexp     1In 0.5
                             pt1 /2

                      = k exp(-  t)
                                 t1 /2

which gives an integral of the familiar form f ecxdx = (1/c)ecx.
We thus have

                      ktl/2       In 2
                1 = -      exp   -    t      ,
                      In 2         t1 /2

which gives the desired result:

                             In2
                         k =.
                             t1 /2


Average lifetime                                     example 6
You might think that the half-life would also be the average life-
time of an atom, since half the atoms' lives are shorter and half
longer. But the half whose lives are longer include some that sur-
vive for many half-lives, and these rare long-lived atoms skew the
average. We can calculate the average lifetime as follows:

               (average lifetime) = jt D(t) dt

Using the convenient base-e form again, we have

      (average lifetime) =  j t exp (Wt7Jdt   .


58


Chapter 2  Rules of Randomness


﻿


This integral is of a form that can either be attacked with in-
tegration by parts or by looking it up in a table. The result is
f xecxdx = X ecx - ecx, and the first term can be ignored for our
purposes because it equals zero at both limits of integration. We
end up with


              (average lifetime) -n2 (1/22

                                   In2

                                = 1.443 t1/2

which is, as expected, longer than one half-life.


Section 2.5   f Applications of Calculus


59


﻿


Summary
Selected Vocabulary
probability . . . . the likelihood that something will happen, ex-
                     pressed as a number between zero and one
 normalization .     the property of probabilities that the sum of
                     the probabilities of all possible outcomes must
                     equal one
 independence . . the lack of any relationship between two ran-
                     dom events
 probability distri- a curve that specifies the probabilities of var-
 bution   . . . . . . ious random values of a variable; areas under
                     the curve correspond to probabilities
 FWHM . . . . . . the full width at half-maximum of a probabil-
                     ity distribution; a measure of the width of the
                     distribution
 half-life . . . . . . the amount of time that a radioactive atom
                     will survive with probability 1/2 without de-
                     caying
Notation
P . . . . . . . . . probability
ti/2 - - - - - - - - half-life
D . . . . . . . . . a probability distribution (used only in op-
                     tional section 2.5; not a standardized notation)
Summary
    Quantum physics differs from classical physics in many ways, the
most dramatic of which is that certain processes at the atomic level,
such as radioactive decay, are random rather than deterministic.
There is a method to the madness, however: quantum physics still
rules out any process that violates conservation laws, and it also
offers methods for calculating probabilities numerically.
    In this chapter we focused on certain generic methods of working
with probabilities, without concerning ourselves with any physical
details. Without knowing any of the details of radioactive decay,
for example, we were still able to give a fairly complete treatment
of the relevant probabilities. The most important of these generic
methods is the law of independent probabilities, which states that if
two random events are not related in any way, then the probability
that they will both occur equals the product of the two probabilities,


probability of A and B
      =         PAPB


[if A and B are independent]


The most important application is to radioactive decay. The time
that a radioactive atom has a 50% chance of surviving is called
the half-life, t1/2. The probability of surviving for two half-lives is
(1/2)(1/2) = 1/4, and so on. In general, the probability of surviving


60


Chapter 2  Rules of Randomness


﻿


a time t is given by


                      Psqjrv(t) = 0.5t/t1/2

Related quantities such as the rate of decay and probability distribu-
tion for the time of decay are given by the same type of exponential
function, but multiplied by certain constant factors.


Summary                61


﻿


Problems


Key
V    A computerized answer check is available online.
f    A problem that requires calculus.
* A difficult problem.
1     If a radioactive substance has a half-life of one year, does this
mean that it will be completely decayed after two years? Explain.


2      What is the probability of rolling a pair of dice and getting
"snake eyes," i.e., both dice come up with ones?
3     (a) Extract half-lives directly from the graphs shown in figures
m and n on page 22.
(b) Check that the ratio between these two numbers really is about
10, as calculated in the text based on relativity.
4      Use a calculator to check the approximation that

                       ab ~     b1+bln a

if b < 1, using some arbitrary numbers. Then see how good the
approximation is for values of b that are not quite as small compared
to one.
5     Make up an example of a numerical problem involving a rate of
decay where At < t12, but the exact expression for the rate of decay
on page 55 can still be evaluated on a calculator without getting
something that rounds off to zero. Check that you get approximately
the same result using both methods on page 55 to calculate the
number of decays between t and t + At. Keep plenty of significant
figures in your results, in order to show the difference between them.


6      Devise a method for testing experimentally the hypothesis
that a gambler's chance of winning at craps is independent of her
previous record of wins and losses.
7     Refer to the probability distribution for people's heights in
figure f on page 50.
(a) Show that the graph is properly normalized.
(b) Estimate the fraction of the population having heights between
140 and 150 cm.                                                 V


62


Chapter 2  Rules of Randomness


﻿


8     (a) A nuclear physicist is studying a nuclear reaction caused in
an accelerator experiment, with a beam of ions from the accelerator
striking a thin metal foil and causing nuclear reactions when a nu-
cleus from one of the beam ions happens to hit one of the nuclei in
the target. After the experiment has been running for a few hours,
a few billion radioactive atoms have been produced, embedded in
the target. She does not know what nuclei are being produced, but
she suspects they are an isotope of some heavy element such as Pb,
Bi, Fr or U. Following one such experiment, she takes the target foil
out of the accelerator, sticks it in front of a detector, measures the
activity every 5 min, and makes a graph (figure). The isotopes she
thinks may have been produced are:


activity
(millions
  of de-
cays per
second)


1.3

1.2


1.1


0.9


0.E
0.7

0.6

0.5

0-4

0.3

0.2


0.C


0 10 20 30
   time (minutes)


40


isotope
211Pb
214Pb
214Bi
223 Fr
239U


half-life (minutes)
36.1
26.8
19.7
21.8
23.5


Which one is it?
(b) Having decided that the original experimental conditions pro-
duced one specific isotope, she now tries using beams of ions travel-
ing at several different speeds, which may cause different reactions.
The following table gives the activity of the target 10, 20 and 30 min-
utes after the end of the experiment, for three different ion speeds.


first ion speed
second ion speed
third ion speed


activity
10 min
1.933
1.200
6.544


(millions of decays/s) after...
20 min 30 min
0.832     0.382
0.545     0.248
1.296     0.248


Problem 8.


Since such a large number of decays is being counted, assume that
the data are only inaccurate due to rounding off when writing down
the table. Which are consistent with the production of a single
isotope, and which imply that more than one isotope was being
created?


Problems


63


﻿


9      All helium on earth is from the decay of naturally occurring
heavy radioactive elements such as uranium. Each alpha particle
that is emitted ends up claiming two electrons, which makes it a
helium atom. If the original 238U atom is in solid rock (as opposed
to the earth's molten regions), the He atoms are unable to diffuse
out of the rock. This problem involves dating a rock using the
known decay properties of uranium 238. Suppose a geologist finds
a sample of hardened lava, melts it in a furnace, and finds that it
contains 1230 mg of uranium and 2.3 mg of helium. 238U decays by
alpha emission, with a half-life of 4.5 x 109 years. The subsequent
chain of alpha and electron (beta) decays involves much shorter half-
lives, and terminates in the stable nucleus 206Pb. (You may want to
review alpha and beta decay.) Almost all natural uranium is 238U,
and the chemical composition of this rock indicates that there were
no decay chains involved other than that of 238U.
(a) How many alphas are emitted in decay chain of a single 238U
atom?
[Hint: Use conservation of mass.]
(b) How many electrons are emitted per decay chain?
[Hint: Use conservation of charge.]
(c) How long has it been since the lava originally hardened?  ~


10     Physicists thought for a long time that bismuth-209 was the
heaviest stable isotope. (Very heavy elements decay by alpha emis-
sion because of the strong electrical repulsion of all their protons.)
However, a 2003 paper by Marcillac et al. describes an experiment
in which bismuth-209 lost its claim to fame - it actually undergoes
alpha decay with a half-life of 1.9 x 1019 years.
(a) After the alpha particle is emitted, what is the isotope left over?
(b) Compare the half-life to the age of the universe, which is about
14 billion years.
(c) A tablespoon of Pepto-Bismol contains about 4 x 1020 bismuth-
209 atoms. Once you've swallowed it, how much time will it take,
on the average, before the first atomic decay?                   V


11      A blindfolded person fires a gun at a circular target of ra-
dius b, and is allowed to continue firing until a shot actually hits it.
Any part of the target is equally likely to get hit. We measure the
random distance r from the center of the circle to where the bullet
went in.
(a) Show that the probability distribution of r must be of the form
D~r) =kr, where k is some constant. (Of course we have D(r) =0
for r > b.)
(b) Determine k by requiring D to be properly normalized. V

(c) Find the average value of r.V


64


Chapter 2  Rules of Randomness


﻿


(d) Interpreting your result from part c, how does it compare with
b/2? Does this make sense? Explain.                           f
12      We are given some atoms of a certain radioactive isotope,
with half-life t1/2. We pick one atom at random, and observe it for
one half-life, starting at time zero. If it decays during that one-half-
life period, we record the time t at which the decay occurred. If it
doesn't, we reset our clock to zero and keep trying until we get an
atom that cooperates. The final result is a time 0 < t < t1/2, with a
distribution that looks like the usual exponential decay curve, but
with its tail chopped off.
(a) Find the distribution D(t), with the proper normalization. V

(b) Find the average value of t.                                 V

(c) Interpreting your result from part b, how does it compare with
ti/2/2? Does this make sense? Explain.                        f
13      The speed, v, of an atom in an ideal gas has a probability
distribution of the form

                       D(v) = bve-V

where 0 < v < oc, c relates to the temperature, and b is determined
by normalization.
(a) Sketch the distribution.
(b) Find b in terms of c.IV

(c) Find the average speed in terms of c, eliminating b. (Don't
try to do the indefinite integral, because it can't be done in closed
form. The relevant definite integral can be found in tables or done
with computer software.)IV
                                                              I
14      Neutrinos interact so weakly with normal matter that, of
the neutrinos from the sun that enter the earth from the day side,
only about 10-10 of them fail to reemerge on the night side. From
this fact, estimate the thickness of matter, in units of light-years,
that would be required in order to block half of them. This "half-
distance" is analogous to a half-life for radioactive decay.


Problems              65


﻿


66          Chapter 2  Rules of Randomness


﻿


Nov. 1,1978


Nov. 1,1992


In recent decades, a huge hole in the ozone layer has spread out from
Antarctica.


Chapter 3


Light as a Particle


      The only thing that interferes with my learning is my educa-
      tion.
                                                Albert Einstein
Radioactivity is random, but do the laws of physics exhibit ran-
domness in other contexts besides radioactivity? Yes. Radioactive
decay was just a good playpen to get us started with concepts of
randomness, because all atoms of a given isotope are identical. By
stocking the playpen with an unlimited supply of identical atom-
toys, nature helped us to realize that their future behavior could be
different regardless of their original identicality. We are now ready
to leave the playpen, and see how randomness fits into the structure
of physics at the most fundamental level.
   The laws of physics describe light and matter, and the quantum
revolution rewrote both descriptions. Radioactivity was a good ex-
ample of matter's behaving in a way that was inconsistent with
classical physics, but if we want to get under the hood and under-
stand how nonclassical things happen, it will be easier to focus on
light rather than matter. A radioactive atom such as uranium-235
is after all an extremely complex system, consisting of 92 protons,
143 neutrons, and 92 electrons. Light, however, can be a simple sine
wave.
   However successful the classical wave theory of light had been
   allowing the creation of radio and radar, for example it still


67


﻿


                                  failed to describe many important phenomena. An example that
                                  is currently of great interest is the way the ozone layer protects us
                                  from the dangerous short-wavelength ultraviolet part of the sun's
                                  spectrum. In the classical description, light is a wave. When a wave
                                  passes into and back out of a medium, its frequency is unchanged,
                                  and although its wavelength is altered while it is in the medium,
                                  it returns to its original value when the wave reemerges. Luckily
                                  for us, this is not at all what ultraviolet light does when it passes
                                  through the ozone layer, or the layer would offer no protection at
                                  all!


                                  3.1 Evidence for Light as a Particle


a / Images made by a digital cam-
era. In each successive image,
the dim spot of light has been
made even dimmer.


                                      For a long time, physicists tried to explain away the problems
                                  with the classical theory of light as arising from an imperfect under-
                                  standing of atoms and the interaction of light with individual atoms
                                  and molecules. The ozone paradox, for example, could have been
                                  attributed to the incorrect assumption that the ozone layer was a
                                  smooth, continuous substance, when in reality it was made of indi-
                                  vidual ozone molecules. It wasn't until 1905 that Albert Einstein
                                  threw down the gauntlet, proposing that the problem had nothing to
                                  do with the details of light's interaction with atoms and everything
                                  to do with the fundamental nature of light itself.
                                      In those days the data were sketchy, the ideas vague, and the
                                  experiments difficult to interpret; it took a genius like Einstein to cut
                                  through the thicket of confusion and find a simple solution. Today,
                                  however, we can get right to the heart of the matter with a piece of
                                  ordinary consumer electronics, the digital camera. Instead of film, a
                                  digital camera has a computer chip with its surface divided up into a
                                  grid of light-sensitive squares, called "pixels." Compared to a grain
                                  of the silver compound used to make regular photographic film, a
                                  digital camera pixel is activated by an amount of light energy orders
                                  of magnitude smaller. We can learn something new about light by
                                  using a digital camera to detect smaller and smaller amounts of
                                  light, as shown in figures a/1 through a/3. Figure 1 is fake, but 2
                                  and 3 are real digital-camera images made by Prof. Lyman Page
                                  of Princeton University as a classroom demonstration. Figure 1 is


68


Chapter 3   Light as a Particle


﻿


what we would see if we used the digital camera to take a picture of
a fairly dim source of light. In figures 2 and 3, the intensity of the
light was drastically reduced by inserting semitransparent absorbers
like the tinted plastic used in sunglasses. Going from 1 to 2 to 3,
more and more light energy is being thrown away by the absorbers.
    The results are dramatically different from what we would expect
based on the wave theory of light. If light was a wave and nothing
but a wave, b, then the absorbers would simply cut down the wave's
amplitude across the whole wavefront. The digital camera's entire
chip would be illuminated uniformly, and weakening the wave with
an absorber would just mean that every pixel would take a long time
to soak up enough energy to register a signal.
    But figures a/2 and a/3 show that some pixels take strong hits
while others pick up no energy at all. Instead of the wave picture,
the image that is naturally evoked by the data is something more
like a hail of bullets from a machine gun, c. Each "bullet" of light
apparently carries only a tiny amount of energy, which is why de-
tecting them individually requires a sensitive digital camera rather
than an eye or a piece of film.
    Although Einstein was interpreting different observations, this
is the conclusion he reached in his 1905 paper: that the pure wave
theory of light is an oversimplification, and that the energy of a beam
of light comes in finite chunks rather than being spread smoothly
throughout a region of space.


b / A water
absorbed.


k


%I


wave is partially


          -  I         -

          .     I
      e I-


          -I
        -

        N I
               _'

c / A stream of bullets is par-
tially absorbed.


d / Einstein and Seurat: twins
separated at birth? Detail from
Seine Grande Jatte by Georges
Seurat, 1886.


We now think of these chunks as particles of light, and call them


Section 3.1   Evidence for Light as a Particle


69


﻿


"photons," although Einstein avoided the word "particle," and the
word "photon" was invented later. Regardless of words, the trou-
ble was that waves and particles seemed like inconsistent categories.
The reaction to Einstein's paper could be kindly described as vig-
orously skeptical. Even twenty years later, Einstein wrote, "There
are therefore now two theories of light, both indispensable, and
as one must admit today despite twenty years of tremendous effort
on the part of theoretical physicists  without any logical connec-
tion." In the remainder of this chapter we will learn how the seeming
paradox was eventually resolved.
Discussion Questions
A    Suppose someone rebuts the digital camera data in figure a, claim-
ing that the random pattern of dots occurs not because of anything fun-
damental about the nature of light but simply because the camera's pixels
are not all exactly the same - some are just more sensitive than others.
How could we test this interpretation?
B     Discuss how the correspondence principle applies to the observa-
tions and concepts discussed in this section.


3.2 How Much Light Is One Photon?

The photoelectric effect
    We have seen evidence that light energy comes in little chunks,
so the next question to be asked is naturally how much energy is
in one chunk. The most straightforward experimental avenue for
addressing this question is a phenomenon known as the photoelec-
tric effect. The photoelectric effect occurs when a photon strikes
the surface of a solid object and knocks out an electron. It occurs
continually all around you. It is happening right now at the surface
of your skin and on the paper or computer screen from which you
are reading these words. It does not ordinarily lead to any observ-
able electrical effect, however, because on the average, free electrons
are wandering back in just as frequently as they are being ejected.
(If an object did somehow lose a significant number of electrons,
its growing net positive charge would begin attracting the electrons
back more and more strongly.)
    Figure e shows a practical method for detecting the photoelec-
tric effect. Two very clean parallel metal plates (the electrodes of a
capacitor) are sealed inside a vacuum tube, and only one plate is ex-
posed to light. Because there is a good vacuum between the plates,
any ejected electron that happens to be headed in the right direc-
tion will almost certainly reach the other capacitor plate without
colliding with any air molecules.
    The illuminated (bottom) plate is left with a net positive charge,
and the unilluminated (top) plate acquires a negative charge from
the electrons deposited on it. There is thus an electric field between


light


e / Apparatus  for   observing
the photoelectric effect. A beam
of light strikes a capacitor plate
inside a vacuum tube, and elec-
trons are ejected (black arrows).


70


Chapter 3  Light as a Particle


﻿


the plates, and it is because of this field that the electrons' paths are
curved, as shown in the diagram. However, since vacuum is a good
insulator, any electrons that reach the top plate are prevented from
responding to the electrical attraction by jumping back across the
gap. Instead they are forced to make their way around the circuit,
passing through an ammeter. The ammeter measures the strength
of the photoelectric effect.

An unexpected dependence on frequency
    The photoelectric effect was discovered serendipitously by Hein-
rich Hertz in 1887, as he was experimenting with radio waves. He
was not particularly interested in the phenomenon, but he did notice
that the effect was produced strongly by ultraviolet light and more
weakly by lower frequencies. Light whose frequency was lower than a
certain critical value did not eject any electrons at all.1 This depen-
dence on frequency didn't make any sense in terms of the classical
wave theory of light. A light wave consists of electric and magnetic
fields. The stronger the fields, i.e., the greater the wave's ampli-
tude, the greater the forces that would be exerted on electrons that
found themselves bathed in the light. It should have been amplitude
(brightness) that was relevant, not frequency. The dependence on
frequency not only proves that the wave model of light needs mod-
ifying, but with the proper interpretation it allows us to determine
how much energy is in one photon, and it also leads to a connec-
tion between the wave and particle models that we need in order to
reconcile them.
    To make any progress, we need to consider the physical process
by which a photon would eject an electron from the metal electrode.
A metal contains electrons that are free to move around. Ordinarily,
in the interior of the metal, such an electron feels attractive forces
from atoms in every direction around it. The forces cancel out. But
if the electron happens to find itself at the surface of the metal,
the attraction from the interior side is not balanced out by any
attraction from outside. In popping out through the surface the
electron therefore loses some amount of energy Es, which depends
on the type of metal used.
    Suppose a photon strikes an electron, annihilating itself and giv-
ing up all its energy to the electron.2 The electron will (1) lose
kinetic energy through collisions with other electrons as it plows
through the metal on its way to the surface; (2) lose an amount of
kinetic energy equal to Es as it emerges through the surface; and
(3) lose more energy on its way across the gap between the plates,

   'In fact this was all prior to Thomson's discovery of the electron, so Hertz
would not have described the effect in terms of electrons  we are discussing
everything with the benefit of hindsight.
   2We now know that this is what always happens in the photoelectric effect,
although it had not yet been established in 1905 whether or not the photon was
completely annihilated.


f / The hamster in her hamster
ball is like an electron emerging
from the metal (tiled kitchen floor)
into the surrounding vacuum
(wood floor). The wood floor is
higher than the tiled floor, so as
she rolls up the step, the hamster
will lose a certain amount of
kinetic energy, analogous to Es.
If her kinetic energy is too small,
she won't even make it up the
step.


Section 3.2   How Much Light Is One Photon?


71


﻿


due to the electric field between the plates. Even if the electron
happens to be right at the surface of the metal when it absorbs the
photon, and even if the electric field between the plates has not yet
built up very much, Es is the bare minimum amount of energy that
the electron must receive from the photon if it is to contribute to
a measurable current. The reason for using very clean electrodes is
to minimize Es and make it have a definite value characteristic of
the metal surface, not a mixture of values due to the various types
of dirt and crud that are present in tiny amounts on all surfaces in
everyday life.
   We can now interpret the frequency dependence of the photo-
electric effect in a simple way: apparently the amount of energy
possessed by a photon is related to its frequency. A low-frequency
red or infrared photon has an energy less than Es, so a beam of
them will not produce any current. A high-frequency blue or violet
photon, on the other hand, packs enough of a punch to allow an
electron to get out of the electrode. At frequencies higher than the
minimum, the photoelectric current continues to increase with the
frequency of the light because of effects (1) and (3).

Numerical relationship between energy and frequency
    Prompted by Einstein's photon paper, Robert Millikan (whom
we encountered in book 4 of this series) figured out how to use the
photoelectric effect to probe precisely the link between frequency
and photon energy. Rather than going into the historical details of
Millikan's actual experiments (a lengthy experimental program that
occupied a large part of his professional career) we will describe a
simple version, shown in figure g, that is used sometimes in college
laboratory courses. The idea is simply to illuminate one plate of
the vacuum tube with light of a single wavelength and monitor the
voltage difference between the two plates as they charge up. Since
the resistance of a voltmeter is very high (much higher than the
resistance of an ammeter), we can assume to a good approximation
that electrons reaching the top plate are stuck there permanently,
so the voltage will keep on increasing for as long as electrons are
making it across the vacuum tube.
    At a moment when the voltage difference has a reached a value
AV, the minimum energy required by an electron to make it out of
the bottom plate and across the gap to the other plate is Es + eAV.
As AV increases, we eventually reach a point at which Es + eAV
equals the energy of one photon. No more electrons can cross the
gap, and the reading on the voltmeter stops rising. The quantity
Es+eAV now tells us the energy of one photon. If we determine this
energy for a variety of frequencies, h, we find the following simple
relationship between the energy of a photon and the frequency of
the light:

                          E= hf ,


light


study-
effect.


g /A different way of
ing  the  photoelectric


e''


Es+eAV


A


                   f
h / The quantity ES + eAV in-
dicates the energy of one photon.
It is found to be proportional to
the frequency of the light.


72


Chapter 3   Light as a Particle


﻿


where h is a constant with a numerical value of 6.63 x 10-34J  s.
Note how the equation brings the wave and particle models of light
under the same roof: the left side is the energy of one particle of
light, while the right side is the frequency of the same light, inter-
preted as a wave. The constant h is known as Planck's constant (see
historical note on page 73).
  self-check A
  How would you extract h from the graph in figure h? What if you didn't
  even know ES in advance, and could only graph eAV versus f?
  Answer, p. 133
    Since the energy of a photon is hf, a beam of light can only have
energies of hf, 2hf, 3hf, etc. Its energy is quantized  there is no
such thing as a fraction of a photon. Quantum physics gets its name
from the fact that it quantizes things like energy, momentum, and
angular momentum that had previously been thought to be smooth,
continuous and infinitely divisible.
  Historical Note
  What I'm presenting in this chapter is a simplified explanation of how
  the photon could have been discovered. The actual history is more
  complex. Max Planck (1858-1947) began the photon saga with a the-
  oretical investigation of the spectrum of light emitted by a hot, glowing
  object. He introduced quantization of the energy of light waves, in multi-
  ples of hf, purely as a mathematical trick that happened to produce the
  right results. Planck did not believe that his procedure could have any
  physical significance. In his 1905 paper Einstein took Planck's quantiza-
  tion as a description of reality, and applied it to various theoretical and
  experimental puzzles, including the photoelectric effect. Millikan then
  subjected Einstein's ideas to a series of rigorous experimental tests. Al-
  though his results matched Einstein's predictions perfectly, Millikan was
  skeptical about photons, and his papers conspicuously omit any refer-
  ence to them. Only in his autobiography did Millikan rewrite history and
  claim that he had given experimental proof for photons.
  Number of photons emitted by a llghtbulb per second example 1
  > Roughly how many photons are emitted by a 100-W lightbulb in
  1 second?
  > People tend to remember wavelengths rather than frequencies
  for visible light. The bulb emits photons with a range of frequen-
  cies and wavelengths, but let's take 600 nm as a typical wave-
  length for purposes of estimation. The energy of a single photon


Section 3.2  How Much Light Is One Photon?


73


﻿


is


                           Ephoton = hf
                                     hc

  A power of 100 W means 100 joules per second, so the number
  of photons is
                  100J      100J           20
                          -          3x10         .
                  Ephoton hc/A

  Momentum of a photon                                   example 2
  > According to the theory of relativity, the momentum of a beam
  of light is given by p = E/c (see homework problem 10 on page
  40). Apply this to find the momentum of a single photon in terms
  of its frequency, and in terms of its wavelength.
  > Combining the equations p = E/c and E = hf, we find
                                E
                            p =c
                                hf
                                c
  To reexpress this in terms of wavelength, we use c = fA:
                                   hf
                              p    f=
                                   h

  The second form turns out to be simpler.
Discussion Questions
A     The photoelectric effect only ever ejects a very tiny percentage of
the electrons available near the surface of an object. How well does this
agree with the wave model of light, and how well with the particle model?
Consider the two different distance scales involved: the wavelength of the
light, and the size of an atom, which is on the order of 10-10 or 10-9 m.
B    What is the significance of the fact that Planck's constant is numeri-
cally very small? How would our everyday experience of light be different
if it was not so small?
C    How would the experiments described above be affected if a single
electron was likely to get hit by more than one photon?
D     Draw some representative trajectories of electrons for AV = 0, AV
less than the maximum value, and A V greater than the maximum value.
E Explain based on the photon theory of light why ultraviolet light would
be more likely than visible or infrared light to cause cancer by damaging
DNA molecules. How does this relate to discussion question C?
F Does E = hf imply that a photon changes its energy when it passes
from one transparent material into another substance with a different in-
dex of refraction?


74


Chapter 3  Light as a Particle


﻿


3.3 Wave-Particle Duality


i / Wave interference  patterns
photographed by Prof. Lyman
Page with a digital camera. Laser
light with a single well-defined
wavelength passed through a
series of absorbers to cut down
its intensity, then through a set of
slits to produce interference, and
finally into a digital camera chip.
(A triple slit was actually used,
but for conceptual simplicity we
discuss the results in the main
text as if it was a double slit.) In
panel 2 the intensity has been
reduced relative to 1, and even
more so for panel 3.


    How can light be both a particle and a wave? We are now
ready to resolve this seeming contradiction. Often in science when
something seems paradoxical, it's because we either don't define our
terms carefully, or don't test our ideas against any specific real-world
situation. Let's define particles and waves as follows:

    " Waves exhibit superposition, and specifically interference phe-
      nomena.

    " Particles can only exist in whole numbers, not fractions

    As a real-world check on our philosophizing, there is one partic-
ular experiment that works perfectly. We set up a double-slit inter-
ference experiment that we know will produce a diffraction pattern
if light is an honest-to-goodness wave, but we detect the light with
a detector that is capable of sensing individual photons, e.g., a dig-
ital camera. To make it possible to pick out individual dots from
individual photons, we must use filters to cut down the intensity of
the light to a very low level, just as in the photos by Prof. Page in
section 3.1. The whole thing is sealed inside a light-tight box. The
results are shown in figure i. (In fact, the similar figures in section
3.1 are simply cutouts from these figures.)
    Neither the pure wave theory nor the pure particle theory can
explain the results. If light was only a particle and not a wave, there
would be no interference effect. The result of the experiment would
be like firing a hail of bullets through a double slit, j. Only two
spots directly behind the slits would be hit.
    If, on the other hand, light was only a wave and not a particle,
we would get the same kind of diffraction pattern that would happen


j / Bullets pass
slit.


through a double


k / A water wave passes through
a double slit.


Section 3.3   Wave-Particle Duality


75


﻿


with a water wave, k. There would be no discrete dots in the photo,
only a diffraction pattern that shaded smoothly between light and
dark.
    Applying the definitions to this experiment, light must be both
a particle and a wave. It is a wave because it exhibits interference
effects. At the same time, the fact that the photographs contain
discrete dots is a direct demonstration that light refuses to be split
into units of less than a single photon. There can only be whole
numbers of photons: four photons in figure i/3, for example.

A wrong interpretation: photons interfering with each other
    One possible interpretation of wave-particle duality that occurred
to physicists early in the game was that perhaps the interference ef-
fects came from photons interacting with each other. By analogy, a
water wave consists of moving water molecules, and interference of
water waves results ultimately from all the mutual pushes and pulls
of the molecules. This interpretation was conclusively disproved by
G.I. Taylor, a student at Cambridge. The demonstration by Prof.
Page that we've just been discussing is essentially a modernized
version of Taylor's work. Taylor reasoned that if interference effects
came from photons interacting with each other, a bare minimum of
two photons would have to be present at the same time to produce
interference. By making the light source extremely dim, we can be
virtually certain that there are never two photons in the box at the
same time. In figure i, the intensity of the light has been cut down
so much by the absorbers that if it was in the open, the average
separation between photons would be on the order of a kilometer!
At any given moment, the number of photons in the box is most
likely to be zero. It is virtually certain that there were never two
photons in the box at once.

The concept of a photon's path is undefined.
    If a single photon can demonstrate double-slit interference, then
which slit did it pass through? The unavoidable answer must be that
it passes through both! This might not seem so strange if we think
of the photon as a wave, but it is highly counterintuitive if we try
to visualize it as a particle. The moral is that we should not think
in terms of the path of a photon. Like the fully human and fully
divine Jesus of Christian theology, a photon is supposed to be 100%
wave and 100% particle. If a photon had a well defined path, then it
would not demonstrate wave superposition and interference effects,
contradicting its wave nature. (In the next chapter we will discuss
the Heisenberg uncertainty principle, which gives a numerical way
of approaching this issue.)


        4      44


k-'


I /A  single photon
through both slits.


can go


76


Chapter 3   Light as a Particle


﻿


Another wrong interpretation: the pilot wave hypothesis
    A second possible explanation of wave-particle duality was taken
seriously in the early history of quantum mechanics. What if the
photon particle is like a surfer riding on top of its accompanying
wave? As the wave travels along, the particle is pushed, or "piloted"
by it. Imagining the particle and the wave as two separate entities
allows us to avoid the seemingly paradoxical idea that a photon is
both at once. The wave happily does its wave tricks, like super-
position and interference, and the particle acts like a respectable
particle, resolutely refusing to be in two different places at once. If
the wave, for instance, undergoes destructive interference, becoming
nearly zero in a particular region of space, then the particle simply
is not guided into that region.
    The problem with the pilot wave interpretation is that the only
way it can be experimentally tested or verified is if someone manages
to detach the particle from the wave, and show that there really are
two entities involved, not just one. Part of the scientific method is
that hypotheses are supposed to be experimentally testable. Since
nobody has ever managed to separate the wavelike part of a photon
from the particle part, the interpretation is not useful or meaningful
in a scientific sense.

The probability interpretation
    The correct interpretation of wave-particle duality is suggested
by the random nature of the experiment we've been discussing: even
though every photon wave/particle is prepared and released in the
same way, the location at which it is eventually detected by the
digital camera is different every time. The idea of the probability
interpretation of wave-particle duality is that the location of the
photon-particle is random, but the probability that it is in a certain
location is higher where the photon-wave's amplitude is greater.
    More specifically, the probability distribution of the particle must
be proportional to the square of the wave's amplitude,

           (probability distribution) oc (amplitude)2

This follows from the correspondence principle and from the fact
that a wave's energy density is proportional to the square of its am-
plitude. If we run the double-slit experiment for a long enough time,
the pattern of dots fills in and becomes very smooth as would have
been expected in classical physics. To preserve the correspondence
between classical and quantum physics, the amount of energy de-
posited in a given region of the picture over the long run must be
proportional to the square of the wave's amplitude. The amount of
energy deposited in a certain area depends on the number of pho-


Section 3.3   Wave-Particle Duality


77


﻿


                                  tons picked up, which is proportional to the probability of finding
                                  any given photon there.
                                    'A microwave oven                                   example 3
                                    D The figure shows two-dimensional (top) and one-dimensional
                                    (bottom) representations of the standing wave inside a microwave
                                    oven. Gray represents zero field, and white and black signify the
                                    strongest fields, with white being a field that is in the opposite di-
                                    rection compared to black. Compare the probabilities of detecting
                                    a microwave photon at points A, B, and C.
                                    A and C are both extremes of the wave, so the probabilities of
                                    detecting a photon at A and C are equal. It doesn't matter that we
                                    have represented C as negative and A as positive, because it is
                                    the square of the amplitude that is relevant. The amplitude at B is
                                    about 1/2 as much as the others, so the probability of detecting a
              A B   C               photon there is about 1/4 as much.

m / Example 3.                        The probability interpretation was disturbing to physicists who
                                  had spent their previous careers working in the deterministic world
                                  of classical physics, and ironically the most strenuous objections
                                  against it were raised by Einstein, who had invented the photon
                                  concept in the first place. The probability interpretation has never-
                                  theless passed every experimental test, and is now as well established
                                  as any part of physics.
                                      An aspect of the probability interpretation that has made many
                                  people uneasy is that the process of detecting and recording the
                                  photon's position seems to have a magical ability to get rid of the
                                  wavelike side of the photon's personality and force it to decide for
                                  once and for all where it really wants to be. But detection or mea-
                                  surement is after all only a physical process like any other, governed
                                  by the same laws of physics. We will postpone a detailed discussion
                                  of this issue until the following chapter, since a measuring device
                                  like a digital camera is made of matter, but we have so far only
                                  discussed how quantum mechanics relates to light.
                                    'What is the proportionality constant?         example 4
                                    > What is the proportionality constant that would make an actual
                                    equation out of (probability distribution) oc (amplitude)2?
                                    > The probability that the photon is in a certain small region of
                                    volume v should equal the fraction of the wave's energy that is
                                    within that volume:


                                                       F = energy in volume v
                                                            energy of photon
                                                            energy in volume v
                                                                   hf

                                    We assume v is small enough so that the electric and magnetic


78             Chapter 3   Light as a Particle


﻿


  fields are nearly constant throughout it. We then have

                   P=(        E+1B2)V
                                 hf
  We can simplify this formidable looking expression by recogniz-
  ing that in an electromagnetic wave, |El and |BI are related by
  |El = clBl. With some algebra, it turns out that the electric and
  magnetic fields each contribute half the total energy, so we can
  simplify this to

                        P2(        |E2) v
                                   hf
                                V-2
                             4mkhf|E|
  As advertised, the probability is proportional to the square of the
  wave's amplitude.
Discussion Questions
A In example 3 on page 78, about the carrot in the microwave oven,
show that it would be nonsensical to have probability be proportional to
the field itself, rather than the square of the field.
B     Einstein did not try to reconcile the wave and particle theories of
light, and did not say much about their apparent inconsistency. Einstein
basically visualized a beam of light as a stream of bullets coming from
a machine gun. In the photoelectric effect, a photon "bullet" would only
hit one atom, just as a real bullet would only hit one person. Suppose
someone reading his 1905 paper wanted to interpret it by saying that
Einstein's so-called particles of light are simply short wave-trains that only
occupy a small region of space. Comparing the wavelength of visible light
(a few hundred nm) to the size of an atom (on the order of 0.1 nm), explain
why this poses a difficulty for reconciling the particle and wave theories.
C    Can a white photon exist?
D    In double-slit diffraction of photons, would you get the same pattern
of dots on the digital camera image if you covered one slit? Why should it
matter whether you give the photon two choices or only one?


Section 3.3


Wave-Particle Duality               79


﻿


      volyme
   under surface

         area

K                 y


3.4 Photons in Three Dimensions
Up until now I've been sneaky and avoided a full discussion of the
three-dimensional aspects of the probability interpretation. The ex-
ample of the carrot in the microwave oven, for example, reduced
to a one-dimensional situation because we were considering three
points along the same line and because we were only comparing ra-
tios of probabilities. The purpose of bringing it up now is to head
off any feeling that you've been cheated conceptually rather than to
prepare you for mathematical problem solving in three dimensions,
which would not be appropriate for the level of this course.
    A typical example of a probability distribution in section 2.3
was the distribution of heights of human beings. The thing that
varied randomly, height, h, had units of meters, and the probabil-
ity distribution was a graph of a function D(h). The units of the
probability distribution had to be m-1 (inverse meters) so that ar-
eas under the curve, interpreted as probabilities, would be unitless:
(area) = (height) (width) = m-- m.
    Now suppose we have a two-dimensional problem, e.g., the prob-
ability distribution for the place on the surface of a digital camera
chip where a photon will be detected. The point where it is detected
would be described with two variables, x and y, each having units
of meters. The probability distribution will be a function of both
variables, D(x, y). A probability is now visualized as the volume
under the surface described by the function D(x, y), as shown in
figure n. The units of D must be m-2 so that probabilities will be
unitless: (probability) = (depth) (length) (width) = m-2 m. m.
    Generalizing finally to three dimensions, we find by analogy that
the probability distribution will be a function of all three coordi-
nates, D(x, y, z), and will have units of m-3. It is, unfortunately,
impossible to visualize the graph unless you are a mutant with a nat-
ural feel for life in four dimensions. If the probability distribution
is nearly constant within a certain volume of space v, the probabil-
ity that the photon is in that volume is simply vD. If you know
enough calculus, it should be clear that this can be generalized to
P = f Ddxdydz if D is not constant.


n / The
face.


volume under a sur-


80


Chapter 3   Light as a Particle


﻿


Summary
Selected Vocabulary
photon . . . . . . . . . . .  a particle of light
photoelectric effect . . . .  the ejection, by a photon, of an elec-
                              tron from the surface of an object
 wave-particle duality . . .  the idea that light is both a wave and
                              a particle
Summary
    Around the turn of the twentieth century, experiments began to
show problems with the classical wave theory of light. In any exper-
iment sensitive enough to detect very small amounts of light energy,
it becomes clear that light energy cannot be divided into chunks
smaller than a certain amount. Measurements involving the pho-
toelectric effect demonstrate that this smallest unit of light energy
equals hf, where f is the frequency of the light and h is a number
known as Planck's constant. We say that light energy is quantized
in units of hf, and we interpret this quantization as evidence that
light has particle properties as well as wave properties. Particles of
light are called photons.
    The only method of reconciling the wave and particle natures
of light that has stood the test of experiment is the probability
interpretation: the probability that the particle is at a given location
is proportional to the square of the amplitude of the wave at that
location.
    One important consequence of wave-particle duality is that we
must abandon the concept of the path the particle takes through
space. To hold on to this concept, we would have to contradict the
well established wave nature of light, since a wave can spread out in
every direction simultaneously.


Summary               81


﻿


Problems


Key
V    A computerized answer check is available online.
f    A problem that requires calculus.
* A difficult problem.
For some of these homework problems, you may find it convenient
to refer to the diagram of the electromagnetic spectrum shown in
section 6.4 of Electricity and Magnetism.
1     Give a numerical comparison of the number of photons per
second emitted by a hundred-watt FM radio transmitter and a
hundred-watt lightbulb.
2     Two different flashes of light each have the same energy. One
consists of photons with a wavelength of 600 nm, the other 400 nm.
If the number of photons in the 600-nm flash is 3.0 x 1018, how many
photons are in the 400-nm flash?                                V


3      When light is reflected from a mirror, perhaps only 80% of
the energy comes back. The rest is converted to heat. One could
try to explain this in two different ways: (1) 80% of the photons are
reflected, or (2) all the photons are reflected, but each loses 20% of
its energy. Based on your everyday knowledge about mirrors, how
can you tell which interpretation is correct? [Based on a problem
from PSSC Physics.]
4     Suppose we want to build an electronic light sensor using an
apparatus like the one described in the section on the photoelectric
effect. How would its ability to detect different parts of the spectrum
depend on the type of metal used in the capacitor plates?
5     The photoelectric effect can occur not just for metal cathodes
but for any substance, including living tissue. Ionization of DNA
molecules can cause cancer or birth defects. If the energy required to
ionize DNA is on the same order of magnitude as the energy required
to produce the photoelectric effect in a metal, which of these types
of electromagnetic waves might pose such a hazard? Explain.
60 Hz waves from power lines
100 MHz FM radio
microwaves from a microwave oven
visible light
ultraviolet light

x-rays
6 The beam of a 100-W overhead projector covers an area of
1 m x 1 m when it hits the screen 3 m away. Estimate the number
of photons that are in flight at any given time. (Since this is only


82


Chapter 3  Light as a Particle


﻿


an estimate, we can ignore the fact that the beam is not parallel.)

7     The two diffraction patterns were made by sending a flash of
light through the same double slit. Give a numerical comparison of
the amounts of energy in the two flashes.


Problems 8 and 9 were switched after edition 3.0, because doing 8
first makes it easier to do 9.
8     Three of the four graphs are properly normalized to represent
single photons. Which one isn't? Explain.
9     Photon Fred has a greater energy than photon Ginger. For
each of the following quantities, explain whether Fred's value of
that quantity is greater than Ginger's, less than Ginger's, or equal
to Ginger's. If there is no way to tell, explain why.
frequency
speed
wavelength
period
electric field strength
magnetic field strength


10      Give experimental evidence to disprove the following inter-
pretation of wave-particle duality: A photon is really a particle, but
it travels along a wavy path, like a zigzag with rounded corners. Cite
a specific, real experiment.
11      In the photoelectric effect, electrons are observed with virtu-
ally no time delay (~ 10 ns), even when the light source is very weak.
(A weak light source does however only produce a small number of
ejected electrons.) The purpose of this problem is to show that the
lack of a significant time delay contradicted the classical wave the-
ory of light, so throughout this problem you should put yourself in
the shoes of a classical physicist and pretend you don't know about
photons at all. At that time, it was thought that the electron might
have a radius on the order of 10-15 m. (Recent experiments have
shown that if the electron has any finite size at all, it is far smaller.)
(a) Estimate the power that would be soaked up by a single electron
in a beam of light with an intensity of 1 mW/in2.V
(b) The energy, Es, required for the electron to escape through the
surface of the cathode is on the order of 10-19 J. Find how long it
would take the electron to absorb this amount of energy, and explain
why your result constitutes strong evidence that there is something
wrong with the classical theory.V


Problem 7.


        ---.-- -.-------


Problem 8.


Problems              83


﻿


84             Chapter 3   Light as a Particle


﻿


                                                                  Dorothy melts the Wicked Witch
                                                                  of the West.


Chapter 4


Matter as a Wave


      [In] a few minutes I shall be all melted... I have been wicked
      in my day, but I never thought a little girl like you would ever
      be able to melt me and end my wicked deeds. Look out -
      here I go!
                                The Wicked Witch of the West
   As the Wicked Witch learned the hard way, losing molecular
cohesion can be unpleasant. That's why we should be very grate-
ful that the concepts of quantum physics apply to matter as well
as light. If matter obeyed the laws of classical physics, molecules
wouldn't exist.
   Consider, for example, the simplest atom, hydrogen. Why does
one hydrogen atom form a chemical bond with another hydrogen
atom? Roughly speaking, we'd expect a neighboring pair of hy-
drogen atoms, A and B, to exert no force on each other at all,
attractive or repulsive: there are two repulsive interactions (proton
A with proton B and electron A with electron B) and two attractive
interactions (proton A with electron B and electron A with proton
B). Thinking a little more precisely, we should even expect that once
the two atoms got close enough, the interaction would be repulsive.
For instance, if you squeezed them so close together that the two
protons were almost on top of each other, there would be a tremen-
dously strong repulsion between them due to the 1/r2 nature of the


85


﻿


electrical force. The repulsion between the electrons would not be
as strong, because each electron ranges over a large area, and is not
likely to be found right on top of the other electron. This was only
a rough argument based on averages, but the conclusion is validated
by a more complete classical analysis: hydrogen molecules should
not exist according to classical physics.
    Quantum physics to the rescue! As we'll see shortly, the whole
problem is solved by applying the same quantum concepts to elec-
trons that we have already used for photons.


4.1 Electrons as Waves
We started our journey into quantum physics by studying the ran-
dom behavior of matter in radioactive decay, and then asked how
randomness could be linked to the basic laws of nature governing
light. The probability interpretation of wave-particle duality was
strange and hard to accept, but it provided such a link. It is now
natural to ask whether the same explanation could be applied to
matter. If the fundamental building block of light, the photon, is
a particle as well as a wave, is it possible that the basic units of
matter, such as electrons, are waves as well as particles?
    A young French aristocrat studying physics, Louis de Broglie
(pronounced "broylee"), made exactly this suggestion in his 1923
Ph.D. thesis. His idea had seemed so farfetched that there was
serious doubt about whether to grant him the degree. Einstein was
asked for his opinion, and with his strong support, de Broglie got
his degree.
    Only two years later, American physicists C.J. Davisson and L.
Germer confirmed de Broglie's idea by accident. They had been
studying the scattering of electrons from the surface of a sample
of nickel, made of many small crystals. (One can often see such a
crystalline pattern on a brass doorknob that has been polished by
repeated handling.) An accidental explosion occurred, and when
they put their apparatus back together they observed something
entirely different: the scattered electrons were now creating an in-
terference pattern! This dramatic proof of the wave nature of matter
came about because the nickel sample had been melted by the ex-
plosion and then resolidified as a single crystal. The nickel atoms,
now nicely arranged in the regular rows and columns of a crystalline
lattice, were acting as the lines of a diffraction grating. The new
crystal was analogous to the type of ordinary diffraction grating in
which the lines are etched on the surface of a mirror (a reflection
grating) rather than the kind in which the light passes through the
transparent gaps between the lines (a transmission grating).
    Although we will concentrate on the wave-particle duality of elec-
trons because it is important in chemistry and the physics of atoms,


86             Chapter 4    Matter as a Wave


﻿


all the other "particles" of matter you've learned about show wave
properties as well. Figure a, for instance, shows a wave interference
pattern of neutrons.
   It might seem as though all our work was already done for us,
and there would be nothing new to understand about electrons:
they have the same kind of funny wave-particle duality as photons.
That's almost true, but not quite. There are some important ways
in which electrons differ significantly from photons:


   1. Electrons have mass, and photons don't.

   2. Photons always move at the speed of light, but electrons can
     move at any speed less than c.

   3. Photons don't have electric charge, but electrons do, so electric
     forces can act on them. The most important example is the
     atom, in which the electrons are held by the electric force of
     the nucleus.

   4. Electrons cannot be absorbed or emitted as photons are. De-
     stroying an electron, or creating one out of nothing, would
     violate conservation of charge.


(In chapter 5 we will learn of one more fundamental way in which
electrons differ from photons, for a total of five.)


   5000                                                             a / A double-slit interference pat-
                                                                    tern made with neutrons. (A.
                                                                    Zeilinger, R. GAhler, C.G. Shull,
  4000                                                              W. Treimer, and W. Mampe, Re-
                                                                    views of Modern Physics, Vol. 60,
                                                                    1988.)
  .3000-


  2000-
  oo

  14U,.                        Vi
     h   CNNN 1000

                        100 pm

         SCANNING SLIT POSITION


   Because electrons are different from photons, it is not immedi-
ately obvious which of the photon equations from chapter 3 can be
applied to electrons as well. A particle property, the energy of one
photon, is related to its wave properties via E = hf or, equivalently,
E = hc/A. The momentum of a photon was given by p = hf/c or


Section 4.1   Electrons as Waves


87


﻿


p = h/A (example 2 on page 74). Ultimately it was a matter of ex-
periment to determine which of these equations, if any, would work
for electrons, but we can make a quick and dirty guess simply by
noting that some of the equations involve c, the speed of light, and
some do not. Since c is irrelevant in the case of an electron, we
might guess that the equations of general validity are those that do
not have c in them:

                             E=hf
                                  h
                               p A

This is essentially the reasoning that de Broglie went through, and
experiments have confirmed these two equations for all the funda-
mental building blocks of light and matter, not just for photons and
electrons.
    The second equation, which I soft-pedaled in chapter 3, takes
on a greater important for electrons. This is first of all because the
momentum of matter is more likely to be significant than the mo-
mentum of light under ordinary conditions, and also because force
is the transfer of momentum, and electrons are affected by electrical
forces.
  The wavelength of an elephant                        example 1
  > What is the wavelength of a trotting elephant?
  > One may doubt whether the equation should be applied to an
  elephant, which is not just a single particle but a rather large col-
  lection of them. Throwing caution to the wind, however, we esti-
  mate the elephant's mass at 103 kg and its trotting speed at 10
  m/s. Its wavelength is therefore roughly

                        h
                        p
                        h
                        my
                        6.63 x 10-34 J-s
                        (103 kg)(10 m/s)

                          l-37 (kg -m2/s2) -.
                                  kg-m/s
                      =10-37m


    The wavelength found in this example is so fantastically small
that we can be sure we will never observe any measurable wave
phenomena with elephants. The result is numerically small because
Planck's constant is so small, and as in some examples encountered
previously, this smallness is in accord with the correspondence prin-
ciple.


88             Chapter 4   Matter as a Wave


﻿


    Although a smaller mass in the equation A = h/mv does re-
sult in a longer wavelength, the wavelength is still quite short even
for individual electrons under typical conditions, as shown in the
following example.
  The typical wavelength of an electron         example 2
  > Electrons in circuits and in atoms are typically moving through
  voltage differences on the order of 1 V, so that a typical energy is
  (e)(1 V), which is on the order of 10-19 J. What is the wavelength
  of an electron with this amount of kinetic energy?
  > This energy is nonrelativistic, since it is much less than mc2.
  Momentum and energy are therefore related by the nonrelativistic
  equation KE = p2/2m. Solving for p and substituting in to the
  equation for the wavelength, we find

                              h
                            2m-KE
                        =1.6x 10-9m

  This is on the same order of magnitude as the size of an atom,
  which is no accident: as we will discuss in the next chapter in
  more detail, an electron in an atom can be interpreted as a stand-
  ing wave. The smallness of the wavelength of a typical electron
  also helps to explain why the wave nature of electrons wasn't dis-
  covered until a hundred years after the wave nature of light. To
  scale the usual wave-optics devices such as diffraction gratings
  down to the size needed to work with electrons at ordinary ener-
  gies, we need to make them so small that their parts are compa-
  rable in size to individual atoms. This is essentially what Davisson
  and Germer did with their nickel crystal.
  self-check A
  These remarks about the inconvenient smallness of electron wavelengths
  apply only under the assumption that the electrons have typical ener-
  gies. What kind of energy would an electron have to have in order to
  have a longer wavelength that might be more convenient to work with?
  > Answer, p. 133

What kind of wave is it?
    If a sound wave is a vibration of matter, and a photon is a
vibration of electric and magnetic fields, what kind of a wave is
an electron made of? The disconcerting answer is that there is
no experimental "observable," i.e., directly measurable quantity, to
correspond to the electron wave itself. In other words, there are
devices like microphones that detect the oscillations of air pressure
in a sound wave, and devices such as radio receivers that measure
the oscillation of the electric and magnetic fields in a light wave,
but nobody has ever found any way to measure an electron wave
directly.


Section 4.1   Electrons as Waves


89


﻿


b / These two electron waves
are not distinguishable by any
measuring device.


    We can of course detect the energy (or momentum) possessed by
an electron just as we could detect the energy of a photon using a
digital camera. (In fact I'd imagine that an unmodified digital cam-
era chip placed in a vacuum chamber would detect electrons just as
handily as photons.) But this only allows us to determine where the
wave carries high probability and where it carries low probability.
Probability is proportional to the square of the wave's amplitude,
but measuring its square is not the same as measuring the wave
itself. In particular, we get the same result by squaring either a
positive number or its negative, so there is no way to determine the
positive or negative sign of an electron wave.
    Most physicists tend toward the school of philosophy known as
operationalism, which says that a concept is only meaningful if we
can define some set of operations for observing, measuring, or test-
ing it. According to a strict operationalist, then, the electron wave
itself is a meaningless concept. Nevertheless, it turns out to be one
of those concepts like love or humor that is impossible to measure
and yet very useful to have around. We therefore give it a symbol,
' (the capital Greek letter psi), and a special name, the electron
wavefunction (because it is a function of the coordinates x, y, and
z that specify where you are in space). It would be impossible, for
example, to calculate the shape of the electron wave in a hydro-
gen atom without having some symbol for the wave. But when the
calculation produces a result that can be compared directly to ex-
periment, the final algebraic result will turn out to involve only i2,
which is what is observable, not P itself.
    Since i, unlike E and B, is not directly measurable, we are free
to make the probability equations have a simple form: instead of
having the probability density equal to some funny constant multi-
plied by j2, we simply define P so that the constant of proportion-
ality is one:
                  (probability density) = P2
Since the probability density has units of m-3, the units of P must
be m-3/2.
Discussion Question
A    Frequency is oscillations per second, whereas wavelength is meters
per oscillation. How could the equations E = hf and p = h/A be made
to look more alike by using quantities that were more closely analogous?
(This more symmetric treatment makes it easier to incorporate relativity
into quantum mechanics, since relativity says that space and time are not
entirely separate.)


90             Chapter 4    Matter as a Wave


﻿


412   f * Dispersive Waves
A colleague of mine who teaches chemistry loves to tell the story
about an exceptionally bright student who, when told of the equa-
tion p = h/A, protested, "But when I derived it, it had a factor of
2!" The issue that's involved is a real one, albeit one that could be
glossed over (and is, in most textbooks) without raising any alarms
in the mind of the average student. The present optional section
addresses this point; it is intended for the student who wishes to
delve a little deeper.
    Here's how the now-legendary student was presumably reason-
ing. We start with the equation v= f A, which is valid for any sine
wave, whether it's quantum or classical. Let's assume we already
know E = hf, and are trying to derive the relationship between
wavelength and momentum:

                                V

                                vh
                                £
                                  vh
                                2mv2
                                2h
                                my
                                2h
                                p
The reasoning seems valid, but the result does contradict the ac-
cepted one, which is after all solidly based on experiment.
    The mistaken assumption is that we can figure everything out in
terms of pure sine waves. Mathematically, the only wave that has a
perfectly well defined wavelength and frequency is a sine wave, and
not just any sine wave but an infinitely long one, c. The unphysical
thing about such a wave is that it has no leading or trailing edge, so
it can never be said to enter or leave any particular region of space.
Our derivation made use of the velocity, v, and if velocity is to be a
meaningful concept, it must tell us how quickly stuff (mass, energy,
momentum,...) is transported from one region of space to another.
Since an infinitely long sine wave doesn't remove any stuff from one
region and take it to another, the "velocity of its stuff" is not a well
defined concept.

    Of course the individual wave peaks do travel through space, and
one might think that it would make sense to associate their speed
with the "speed of stuff," but as we will see, the two velocities are
in general unequal when a wave's velocity depends on wavelength.
Such a wave is called a dispersive wave, because a wave pulse consist-
ing of a superposition of waves of different wavelengths will separate
(disperse) into its separate wavelengths as the waves move through
space at different speeds. Nearly all the waves we have encountered


c / Part of an infinite sine wave.


Section 4.2   f * Dispersive Waves


91


﻿


d / A finite-length sine wave.


e / A beat pattern created by
superimposing two sine waves
with  slightly different wave-
lengths.


have been nondispersive. For instance, sound waves and light waves
(in a vacuum) have speeds independent of wavelength. A water wave
is one good example of a dispersive wave. Long-wavelength water
waves travel faster, so a ship at sea that encounters a storm typi-
cally sees the long-wavelength parts of the wave first. When dealing
with dispersive waves, we need symbols and words to distinguish
the two speeds. The speed at which wave peaks move is called the
phase velocity, vp, and the speed at which "stuff" moves is called
the group velocity, vg.
   An infinite sine wave can only tell us about the phase velocity,
not the group velocity, which is really what we would be talking
about when we referred to the speed of an electron. If an infinite
sine wave is the simplest possible wave, what's the next best thing?
We might think the runner up in simplicity would be a wave train
consisting of a chopped-off segment of a sine wave, d. However, this
kind of wave has kinks in it at the end. A simple wave should be
one that we can build by superposing a small number of infinite
sine waves, but a kink can never be produced by superposing any
number of infinitely long sine waves.
   Actually the simplest wave that transports stuff from place to
place is the pattern shown in figure e. Called a beat pattern, it is
formed by superposing two sine waves whose wavelengths are similar
but not quite the same. If you have ever heard the pulsating howling
sound of musicians in the process of tuning their instruments to each
other, you have heard a beat pattern. The beat pattern gets stronger
and weaker as the two sine waves go in and out of phase with each
other. The beat pattern has more "stuff" (energy, for example)
in the areas where constructive interference occurs, and less in the
regions of cancellation. As the whole pattern moves through space,
stuff is transported from some regions and into other ones.
   If the frequency of the two sine waves differs by 10%, for in-
stance, then ten periods will be occur between times when they are
in phase. Another way of saying it is that the sinusoidal "envelope"
(the dashed lines in figure e) has a frequency equal to the difference
in frequency between the two waves. For instance, if the waves had
frequencies of 100 Hz and 110 Hz, the frequency of the envelope
would be 10 Hz.
   To apply similar reasoning to the wavelength, we must define a
quantity z = 1/A that relates to wavelength in the same way that
frequency relates to period. In terms of this new variable, the z of
the envelope equals the difference between the z's of the two sine
waves.
   The group velocity is the speed at which the envelope moves
through space. Let Af and Az be the differences between the
frequencies and z's of the two sine waves, which means that they
equal the frequency and z of the envelope. The group velocity is


92            Chapter 4    Matter as a Wave


﻿


Vg = femveiopeAemveiope = Af/Az. If Af and Az are sufficiently
small, we can approximate this expression as a derivative,

                                df
                             Vg dz

This expression is usually taken as the definition of the group veloc-
ity for wave patterns that consist of a superposition of sine waves
having a narrow range of frequencies and wavelengths. In quan-
tum mechanics, with f = E/h and z = p/h, we have vg = dE/dp.
In the case of a nonrelativistic electron the relationship between
energy and momentum is E = p2/2m, so the group velocity is
dE/dp = p/m = v, exactly what it should be. It is only the phase
velocity that differs by a factor of two from what we would have
expected, but the phase velocity is not the physically important
thing.


Bound States


Electrons are at their most interesting when they're in atoms, that
is, when they are bound within a small region of space. We can
understand a great deal about atoms and molecules based on simple
arguments about such bound states, without going into any of the
realistic details of atom. The simplest model of a bound state is
known as the particle in a box: like a ball on a pool table, the
electron feels zero force while in the interior, but when it reaches
an edge it encounters a wall that pushes back inward on it with
a large force. In particle language, we would describe the electron
as bouncing off of the wall, but this incorrectly assumes that the
electron has a certain path through space. It is more correct to
describe the electron as a wave that undergoes 100% reflection at
the boundaries of the box.
    Like a generation of physics students before me, I rolled my
eyes when initially introduced to the unrealistic idea of putting a
particle in a box. It seemed completely impractical, an artificial
textbook invention. Today, however, it has become routine to study
electrons in rectangular boxes in actual laboratory experiments. The
"box" is actually just an empty cavity within a solid piece of silicon,
amounting in volume to a few hundred atoms. The methods for
creating these electron-in-a-box setups (known as "quantum dots")
were a by-product of the development of technologies for fabricating
computer chips.
    For simplicity let's imagine a one-dimensional electron in a box,
i.e., we assume that the electron is only free to move along a line.
The resulting standing wave patterns, of which the first three are
shown in figure f, are just like some of the patterns we encountered
with sound waves in musical instruments. The wave patterns must
be zero at the ends of the box, because we are assuming the walls
are impenetrable, and there should therefore be zero probability of


  n=1


  n=2


  n=3


f / Three  possible   standing-
wave patterns for a particle in a
box.


Section 4.3   Bound States              93


﻿


finding the electron outside the box. Each wave pattern is labeled
according to n, the number of peaks and valleys it has. In quan-
tum physics, these wave patterns are referred to as "states" of the
particle-in-the-box system.
    The following seemingly innocuous observations about the par-
ticle in the box lead us directly to the solutions to some of the most
vexing failures of classical physics:
The particle's energy is quantized (can only have certain values).
Each wavelength corresponds to a certain momentum, and a given
momentum implies a definite kinetic energy, E = p2/2m. (This is
the second type of energy quantization we have encountered. The
type we studied previously had to do with restricting the number
of particles to a whole number, while assuming some specific wave-
length and energy for each particle. This type of quantization refers
to the energies that a single particle can have. Both photons and
matter particles demonstrate both types of quantization under the
appropriate circumstances.)
The particle has a minimum kinetic energy. Long wavelengths cor-
respond to low momenta and low energies. There can be no state
with an energy lower than that of the n = 1 state, called the ground
state.
The smaller the space in which the particle is confined, the higher
its kinetic energy must be. Again, this is because long wavelengths
give lower energies.
  Spectra of thin gases                               example 3
  A fact that was inexplicable by classical physics was that thin
  gases absorb and emit light only at certain wavelengths. This
  was observed both in earthbound laboratories and in the spectra
  of stars. Figure g shows the example of the spectrum of the star
  Sirius, in which there are "gap teeth" at certain wavelengths. Tak-
  ing this spectrum as an example, we can give a straightforward
  explanation using quantum physics.
  Energy is released in the dense interior of the star, but the outer
  layers of the star are thin, so the atoms are far apart and electrons
  are confined within individual atoms. Although their standing-
  wave patterns are not as simple as those of the particle in the
  box, their energies are quantized.
  When a photon is on its way out through the outer layers, it can be
  absorbed by an electron in an atom, but only if the amount of en-
  ergy it carries happens to be the right amount to kick the electron
  from one of the allowed energy levels to one of the higher lev-
  els. The photon energies that are missing from the spectrum are
  the ones that equal the difference in energy between two elec-
  tron energy levels. (The most prominent of the absorption lines in
  Sirius's spectrum are absorption lines of the hydrogen atom.)


blue                      red

g / The spectrum of the light
from the star Sirius. Photograph
by the author.


94             Chapter 4   Matter as a Wave


﻿


  The stability of atoms                              example 4
  In many Star Trek episodes the Enterprise, in orbit around a planet,
  suddenly lost engine power and began spiraling down toward the
  planet's surface. This was utter nonsense, of course, due to con-
  servation of energy: the ship had no way of getting rid of energy,
  so it did not need the engines to replenish it.
  Consider, however, the electron in an atom as it orbits the nu-
  cleus. The electron does have a way to release energy: it has an
  acceleration due to its continuously changing direction of motion,
  and according to classical physics, any accelerating charged par-
  ticle emits electromagnetic waves. According to classical physics,
  atoms should collapse!
  The solution lies in the observation that a bound state has a min-
  imum energy. An electron in one of the higher-energy atomic
  states can and does emit photons and hop down step by step in
  energy. But once it is in the ground state, it cannot emit a photon
  because there is no lower-energy state for it to go to.
  Chemical bonds in hydrogen molecules        example 5
  I began this chapter with a classical argument that chemical
  bonds, as in an H2 molecule, should not exist. Quantum physics
  explains why this type of bonding does in fact occur. When the
  atoms are next to each other, the electrons are shared between
  them. The "box" is about twice as wide, and a larger box allows a
  smaller energy. Energy is required in order to separate the atoms.   2
  (A qualitatively different type of bonding is discussed in on page
  123.)
Discussion Questions
A Neutrons attract each other via the strong nuclear force, so according
to classical physics it should be possible to form nuclei out of clusters of h / Example 5: Two hydro-
two or more neutrons, with no protons at all. Experimental searches,  gen atoms bond to form an H2
however, have failed to turn up evidence of a stable two-neutron system molecule. In the molecule, the
(dineutron) or larger stable clusters. These systems are apparently not  two electrons' wave patterns
just unstable in the sense of being able to beta decay but unstable in overlap, and are about twice as
the sense that they don't hold together at all. Explain based on quantum  wide.
physics why a dineutron might spontaneously fly apart.
B    The following table shows the energy gap between the ground state
and the first excited state for four nuclei, in units of picojoules. (The nuclei
were chosen to be ones that have similar structures, e.g., they are all
spherical in shape.)
nucleus energy gap (picojoules)
4He        3.234
160        0.968
40 Ca   0.536
208Pb   0.418
Explain the trend in the data.


Section 4.3


Bound States             95


﻿


The Uncertainty Principle


The uncertainty principle
Eliminating randomness through measurement?
    A common reaction to quantum physics, among both early-
twentieth-century physicists and modern students, is that we should
be able to get rid of randomness through accurate measurement. If
I say, for example, that it is meaningless to discuss the path of a
photon or an electron, you might suggest that we simply measure
the particle's position and velocity many times in a row. This series
of snapshots would amount to a description of its path.
    A practical objection to this plan is that the process of measure-
ment will have an effect on the thing we are trying to measure. This
may not be of much concern, for example, when a traffic cop mea-
sures your car's motion with a radar gun, because the energy and
momentum of the radar pulses aren't enough to change the car's
motion significantly. But on the subatomic scale it is a very real
problem. Making a videotape of an electron orbiting a nucleus is
not just difficult, it is theoretically impossible, even with the video
camera hooked up to the best imaginable microscope. The video
camera makes pictures of things using light that has bounced off
them and come into the camera. If even a single photon of the
right wavelength was to bounce off of the electron we were trying to
study, the electron's recoil would be enough to change its behavior
significantly (see homework problem 4).

The Heisenberg uncertainty principle
    This insight, that measurement changes the thing being mea-
sured, is the kind of idea that clove-cigarette-smoking intellectuals
outside of the physical sciences like to claim they knew all along. If
only, they say, the physicists had made more of a habit of reading
literary journals, they could have saved a lot of work. The anthro-
pologist Margaret Mead has recently been accused of inadvertently
encouraging her teenaged Samoan informants to exaggerate the free-
dom of youthful sexual experimentation in their society. If this is
considered a damning critique of her work, it is because she could
have done better: other anthropologists claim to have been able to
eliminate the observer-as-participant problem and collect untainted
data.
    The German physicist Werner Heisenberg, however, showed that
in quantum physics, any measuring technique runs into a brick wall
when we try to improve its accuracy beyond a certain point. Heisen-
berg showed that the limitation is a question of what there is to be
known, even in principle, about the system itself, not of the inabil-
ity of a particular measuring device to ferret out information that
is knowable.


96             Chapter 4    Matter as a Wave


﻿


    Suppose, for example, that we have constructed an electron in a
box (quantum dot) setup in our laboratory, and we are able to adjust
the length L of the box as desired. All the standing wave patterns
pretty much fill the box, so our knowledge of the electron's position
is of limited accuracy. If we write Ax for the range of uncertainty
in our knowledge of its position, then Ax is roughly the same as the
length of the box:
                             A   ~x L
If we wish to know its position more accurately, we can certainly
squeeze it into a smaller space by reducing L, but this has an unin-
tended side-effect. A standing wave is really a superposition of two
traveling waves going in opposite directions. The equation p = h/A
only gives the magnitude of the momentum vector, not its direc-
tion, so we should really interpret the wave as a 50/50 mixture of
a right-going wave with momentum p = h/A and a left-going one
with momentum p = -h/A. The uncertainty in our knowledge of
the electron's momentum is Ap = 2h/A, covering the range between
these two values. Even if we make sure the electron is in the ground
state, whose wavelength A = 2L is the longest possible, we have an
uncertainty in momentum of Ap = h/L. In general, we find

                         Ap > h/L

with equality for the ground state and inequality for the higher-
energy states. Thus if we reduce L to improve our knowledge of the
electron's position, we do so at the cost of knowing less about its
momentum. This trade-off is neatly summarized by multiplying the
two equations to give

                         ApAx > h

Although we have derived this in the special case of a particle in a
box, it is an example of a principle of more general validity:

    the Heisenberg uncertainty principle
It is not possible, even in principle, to know the momentum and the
position of a particle simultaneously and with perfect accuracy. The
uncertainties in these two quantities are always such that

                         ApAx> h


(This approximation can be made into a strict inequality, ApAzx>
h/4w, but only with more careful definitions, which we will not
bother with.1)
    Note that although I encouraged you to think of this derivation
in terms of a specific real-world system, the quantum dot, I never
   Se homework problems 6 and 7.


Section 4.4   The Uncertainty Principle


97


﻿


made any reference to specific measuring equipment. The argument
is simply that we cannot know the particle's position very accurately
unless it has a very well defined position, it cannot have a very well
defined position unless its wave-pattern covers only a very small
amount of space, and its wave-pattern cannot be thus compressed
without giving it a short wavelength and a correspondingly uncer-
tain momentum. The uncertainty principle is therefore a restriction
on how much there is to know about a particle, not just on what we
can know about it with a certain technique.
  An estimate for electrons i atoms                     example 6
  > A typical energy for an electron in an atom is on the order of
  (1 volt).- e, which corresponds to a speed of about 1 % of the speed
  of light. If a typical atom has a size on the order of 0.1 nm, how
  close are the electrons to the limit imposed by the uncertainty
  principle?
  > If we assume the electron moves in all directions with equal
  probability, the uncertainty in its momentum is roughly twice its
  typical momentum. This only an order-of-magnitude estimate, so
  we take Ap to be the same as a typical momentum:
             ApAx = ptypicaiAx
                    = (melectron)(0.01c)(0.1 x 10-9m)
                    = 3 x 10-34 j.s
  This is on the same order of magnitude as Planck's constant, so
  evidently the electron is "right up against the wall." (The fact that
  it is somewhat less than h is of no concern since this was only an
  estimate, and we have not stated the uncertainty principle in its
  most exact form.)
  self-check B
  If we were to apply the uncertainty principle to human-scale objects,
  what would be the significance of the small numerical value of Planck's
  constant?                                        > Answer, p. 133
  self-check C
  Suppose rain is falling on your roof, and there is a tiny hole that lets
  raindrops into your living room now and then. All these drops hit the
  same spot on the floor, so they have the same value of x. Not only
  that, but if the rain is falling straight down, they all have zero horizontal
  momentum. Thus it seems that the raindrops have Ap = 0, Ax = 0,
  and ApAx = 0, violating the uncertainty principle. To look for the hole in
  this argument, consider how it would be acted out on the microscopic
  scale: an electron wave comes along and hits a narrow slit. What really
  happens?                                         > Answer, p. 133
  Historical Note
  The true nature of Heisenberg's role in the Nazi atomic bomb effort is
  a fascinating question, and dramatic enough to have inspired a well-
  received 1998 theatrical play, "Copenhagen." The real story, however,


98             Chapter 4    Matter as a Wave


﻿


  may never be completely unraveled. Heisenberg was the scientific leader
  of the German bomb program up until its cancellation in 1942, when the
  German military decided that it was too ambitious a project to undertake
  in wartime, and too unlikely to produce results.
      Some historians believe that Heisenberg intentionally delayed and
  obstructed the project because he secretly did not want the Nazis to
  get the bomb. Heisenberg's apologists point out that he never joined
  the Nazi party, and was not anti-Semitic. He actively resisted the gov-
  ernment's Deutsche-Physik policy of eliminating supposed Jewish in-
  fluences from physics, and as a result was denounced by the S.S. as a
  traitor, escaping punishment only because Himmler personally declared
  him innocent. One strong piece of evidence is a secret message carried
  to the U.S. in 1941, by one of the last Jews to escape from Berlin, and
  eventually delivered to the chairman of the Uranium Committee, which
  was then studying the feasibility of a bomb. The message stated "...that
  a large number of German physicists are working intensively on the
  problem of the uranium bomb under the direction of Heisenberg, [and]
  that Heisenberg himself tries to delay the work as much as possible,
  fearing the catastrophic results of success. But he cannot help fulfill-
  ing the orders given to him, and if the problem can be solved, it will be
  solved probably in the near future. So he gave the advice to us to hurry
  up if U.S.A. will not come too late." The message supports the view that
  Heisenberg intentionally misled his government about the bomb's tech-
  nical feasibility; German Minister of Armaments Albert Speer wrote that
  he was convinced to drop the project after a 1942 meeting with Heisen-
  berg because "the physicists themselves didn't want to put too much
  into it." Heisenberg also may have warned Danish physicist Niels Bohr
  personally in September 1941 about the existence of the Nazi bomb
  effort.
     On the other side of the debate, critics of Heisenberg say that he
  clearly wanted Germany to win the war, that he visited German-occupied
  territories in a semi-official role, and that he simply may not have been
  very good at his job directing the bomb project. On a visit to the oc-
  cupied Netherlands in 1943, he told a colleague, "Democracy cannot
  develop sufficient energy to rule Europe. There are, therefore, only two
  alternatives: Germany and Russia. And then a Europe under German
  leadership would be the lesser evil." Some historians2 argue that the
  real point of Heisenberg's meeting with Bohr was to try to convince the
  U.S. not to try to build a bomb, so that Germany, possessing a nuclear
  monopoly, would defeat the Soviets - this was after the June 1941 en-
  try of the U.S.S.R. into the war, but before the December 1941 Pearl
  Harbor attack brought the U.S. in. Bohr apparently considered Heisen-
  berg's account of the meeting, published after the war was over, to be
  inaccurate.3 The secret 1941 message also has a curious moral pas-
  sivity to it, as if Heisenberg was saying "I hope you stop me before I do
  something bad," but we should also consider the great risk Heisenberg
  would have been running if he actually originated the message.


  2A Historical Perspective on Copenhagen, David C. Cassidy, Physics Today,
July 2000, p. 28, http://www.aip.org/pt/vol-53/iss-7/p28.html
   3Bohr drafted several replies, but never published them for fear of hurting
Heisenberg and his family. Bohr's papers were to be sealed for 50 years after his
death, but his family released them early, in February 2002.


i / Werner Heisenberg.


Section 4.4  The Uncertainty Principle


99


﻿


Measurement and Schrodinger's cat
    In chapter 3 I briefly mentioned an issue concerning measure-
ment that we are now ready to address carefully. If you hang around
a laboratory where quantum-physics experiments are being done
and secretly record the physicists' conversations, you'll hear them
say many things that assume the probability interpretation of quan-
tum mechanics. Usually they will speak as though the randomness
of quantum mechanics enters the picture when something is mea-
sured. In the digital camera experiments of chapter 3, for example,
they would casually describe the detection of a photon at one of the
pixels as if the moment of detection was when the photon was forced
to "make up its mind." Although this mental cartoon usually works
fairly well as a description of things one experiences in the lab, it
cannot ultimately be correct, because it attributes a special role to
measurement, which is really just a physical process like all other
physical processes.
    If we are to find an interpretation that avoids giving any spe-
cial role to measurement processes, then we must think of the entire
laboratory, including the measuring devices and the physicists them-
selves, as one big quantum-mechanical system made out of protons,
neutrons, electrons, and photons. In other words, we should take
quantum physics seriously as a description not just of microscopic
objects like atoms but of human-scale ("macroscopic") things like
the apparatus, the furniture, and the people.
    The most celebrated example is called the Schr6dinger's cat ex-
periment. Luckily for the cat, there probably was no actual ex-
periment     it was simply a "thought experiment" that the physi-
cist the German theorist Schrodinger discussed with his colleagues.
Schrodinger wrote:
  One can even construct quite burlesque cases. A cat is shut up in a steel
  container, together with the following diabolical apparatus (which one
  must keep out of the direct clutches of the cat): In a [radiation detector]
  there is a tiny mass of radioactive substance, so little that in the course
  of an hour perhaps one atom of it disintegrates, but also with equal
  probability not even one; if it does happen, the [detector] responds and
  ... activates a hammer that shatters a little flask of prussic acid [filling
  the chamber with poison gas]. If one has left this entire system to itself
  for an hour, then one will say to himself that the cat is still living, if
  in that time no atom has disintegrated. The first atomic disintegration
  would have poisoned it.
Now comes the strange part. Quantum mechanics says that the
particles the cat is made of have wave properties, including the
property of superposition. Schrodinger describes the wavefunction
of the box's contents at the end of the hour:
  The wavefunction of the entire system would express this situation by
  having the living and the dead cat mixed ... in equal parts [50/50 pro-
  portions]. The uncertainty originally restricted to the atomic domain


j / Schrcdinger's cat.


100


Chapter 4  Matter as a Wave


﻿


  has been transformed into a macroscopic uncertainty...
    At first Schrodinger's description seems like nonsense. When you
opened the box, would you see two ghostlike cats, as in a doubly
exposed photograph, one dead and one alive? Obviously not. You
would have a single, fully material cat, which would either be dead
or very, very upset. But Schrodinger has an equally strange and
logical answer for that objection. In the same way that the quantum
randomness of the radioactive atom spread to the cat and made its
wavefunction a random mixture of life and death, the randomness
spreads wider once you open the box, and your own wavefunction
becomes a mixture of a person who has just killed a cat and a person
who hasn't.
Discussion Questions
A     Compare Ap and Ax for the two lowest energy levels of the one-
dimensional particle in a box, and discuss how this relates to the uncer-
tainty principle.
B    On a graph of Ap versus Ax, sketch the regions that are allowed and
forbidden by the Heisenberg uncertainty principle. Interpret the graph:
Where does an atom lie on it? An elephant? Can either p or x be mea-
sured with perfect accuracy if we don't care about the other?


4.5 Electrons in Electric Fields
    So far the only electron wave patterns we've considered have
been simple sine waves, but whenever an electron finds itself in an
electric field, it must have a more complicated wave pattern. Let's
consider the example of an electron being accelerated by the elec-
tron gun at the back of a TV tube. The electron is moving from
a region of low voltage into a region of higher voltage. Since its
charge is negative, it loses PE by moving to a higher voltage, so its
KE increases. As its potential energy goes down, its kinetic energy
goes up by an equal amount, keeping the total energy constant. In-
creasing kinetic energy implies a growing momentum, and therefore
a shortening wavelength, k.
    The wavefunction as a whole does not have a single well-defined
wavelength, but the wave changes so gradually that if you only look
at a small part of it you can still pick out a wavelength and relate
it to the momentum and energy. (The picture actually exagger-
ates by many orders of magnitude the rate at which the wavelength
changes.)
    But what if the electric field was stronger? The electric field in
a TV is only ~ 105 N/C, but the electric field within an atom is
more like 1012 N/C. In figure 1, the wavelength changes so rapidly
that there is nothing that looks like a sine wave at all. We could
get a general idea of the wavelength in a given region by measuring
the distance between two peaks, but that would only be a rough


direction of motion
(speeding up)


high PE                  low PE
low KE                  high KE
low momentum     high momentum
long wavelength  short wavelength
weak curvature   strong curvature

k / An  electron  in  a  gentle
electric field gradually shortens
its wavelength as it gains energy.


Section 4.5   Electrons in Electric Fields


101


﻿


approximation. Suppose we want to know the wavelength at point
P. The trick is to construct a sine wave, like the one shown with the
dashed line, which matches the curvature of the actual wavefunction
as closely as possible near P. The sine wave that matches as well as
possible is called the "osculating" curve, from a Latin word meaning
"to kiss." The wavelength of the osculating curve is the wavelength
that will relate correctly to conservation of energy.


r+   r


t   t


'    t


P


I / A typical wavefunction of
and the osculating sine wave
at point P.


an electron in an atom (heavy curve)
(dashed curve) that matches its curvature


2


m / 1.   Kinks  like this
happen. 2. The wave act
penetrates into the class
forbidden region.


         Tunneling
             We implicitly assumed that the particle-in-a-box wavefunction
         would cut off abruptly at the sides of the box, m/1, but that would
         be unphysical. A kink has infinite curvature, and curvature is related
         to energy, so it can't be infinite. A physically realistic wavefunction
         must always "tail off" gradually, m/2. In classical physics, a particle
         can never enter a region in which its potential energy would be
         greater than the amount of energy it has available. But in quantum
don't    physics the wavefunction will always have a tail that reaches into
ually    the classically forbidden region. If it was not for this effect, called
ically
         tunneling, the fusion reactions that power the sun would not occur
         due to the high potential energy that nuclei need in order to get
         close together! Tunneling is discussed in more detail in the next
         section.


         4.6 f* The Schrodinger Equation
         In section 4.5 we were able to apply conservation of energy to an elec-
         tron's wavefunction, but only by using the clumsy graphical tech-
         nique of osculating sine waves as a measure of the wave's curvature.
         You have learned a more convenient measure of curvature in calcu-
         lus: the second derivative. To relate the two approaches, we take
         the second derivative of a sine wave:
                         d2  .   2x\      d /2,      2x
                         dxsmnKA        dxKA     cos A
                                            (27\2     27x
                                              -    smA


102


Chapter 4   Matter as a Wave


﻿


Taking the second derivative gives us back the same function, but
with a minus sign and a constant out in front that is related to
the wavelength. We can thus relate the second derivative to the
osculating wavelength:

                       d2P       2w 2
[1]                    d2     -A
                        d2

This could be solved for A in terms of T, but it will turn out to be
more convenient to leave it in this form.
   Using conservation of energy, we have

                       E=KE+PE
                       E= K + PE
                           _P2
[2]                         2m+PE

                         _ (h/A)2 +PE
                             2m


Note that both equation [1] and equation [2] have A2 in the denom-
inator. We can simplify our algebra by multiplying both sides of
equation [2] by T to make it look more like equation [1]:


             E T - (h/A)2PEP
                       2m

                     2m  (h)2 (f22           E"
                         1h 2 2x
                         1 (h 2d2P
                       - m(2I         +PE-P
                       2m \2w dz2


Further simplification is achieved by using the symbol h (h with a
slash through it, read "h-bar") as an abbreviation for h/27. We then
have the important result known as the Schrodinger equation:


                              $d2P
                  E          m= d+PEPT
                            2m nd"-2


(Actually this is a simplified version of the Schr~dinger equation,
applying only to standing waves in one dimension.) Physically it is
a statement of conservation of energy. The total energy £ must be
constant, so the equation tells us that a change in potential energy
must be accompanied by a change in the curvature of the wavefunc-
tion. This change in curvature relates to a change in wavelength,
which corresponds to a change in momentum and kinetic energy.


Section 4.6   f * The Schrbdinger Equation


103


﻿


  self-check D
  Considering the assumptions that were made in deriving the Schrodinger
  equation, would it be correct to apply it to a photon? To an electron mov-
  ing at relativistic speeds?                           > Answer, p.
  134
    Usually we know right off the bat how the potential energy de-
pends on x, so the basic mathematical problem of quantum physics
is to find a function T (x) that satisfies the Schrodinger equation
for a given function PE(x). An equation, such as the Schrodinger
equation, that specifies a relationship between a function and its
derivatives is known as a differential equation.
    The study of differential equations in general is beyond the math-
ematical level of this book, but we can gain some important insights
by considering the easiest version of the Schrodinger equation, in
which the potential energy is constant. We can then rearrange the
Schrodinger equation as follows:


d21
dx2


2m(PE - E)
     h2


which boils down to

                    d21
                    dx2


where, according to our assumptions, a is independent of x. We need
to find a function whose second derivative is the same as the original
function except for a multiplicative constant. The only functions
with this property are sine waves and exponentials:


PE


Wi


d2
2 [ q sin(rz + s) ]


-qr2 sin(rx + s)


d2


+AAAF1 ~fVW\


X        The sine wave gives negative values of a, a = -r2, and the exponen-
         tial gives positive ones, a = r2. The former applies to the classically
         allowed region with PE < E.
             This leads us to a quantitative calculation of the tunneling ef-
 X       fect discussed briefly in the preceding subsection. The wavefunction
         evidently tails off exponentially in the classically forbidden region.
         Suppose, as shown in figure n, a wave-particle traveling to the right
bar-     encounters a barrier that it is classically forbidden to enter. Al-
         though the form of the Schrodinger equation we're using technically
         does not apply to traveling waves (because it makes no reference
         to time), it turns out that we can still use it to make a reasonable
         calculation of the probability that the particle will make it through
         the barrier. If we let the barrier's width be w, then the ratio of the


n / Tunneling
rier.


through a


104


Chapter 4  Matter as a Wave


﻿


wavefunction on the left side of the barrier to the wavefunction on
the right is
                        qerxs
                      ger(x+w)+s
Probabilities are proportional to the squares of wavefunctions, so
the probability of making it through the barrier is

                 P _ -2rw

                   = exp (         2m(PE - B))


  self-check E
  If we were to apply this equation to find the probability that a person can
  walk through a wall, what would the small value of Planck's constant
  imply?                                           > Answer, p. 134

Use of complex numbers
    In a classically forbidden region, a particle's total energy, PE +
KE, is less than its PE, so its KE must be negative. If we want to
keep believing in the equation KE = p2/2m, then apparently the
momentum of the particle is the square root of a negative number.
This is a symptom of the fact that the Schrodinger equation fails
to describe all of nature unless the wavefunction and various other
quantities are allowed to be complex numbers. In particular it is not
possible to describe traveling waves correctly without using complex
wavefunctions.
    This may seem like nonsense, since real numbers are the only
ones that are, well, real! Quantum mechanics can always be re-
lated to the real world, however, because its structure is such that
the results of measurements always come out to be real numbers.
For example, we may describe an electron as having non-real mo-
mentum in classically forbidden regions, but its average momentum
will always come out to be real (the imaginary parts average out to
zero), and it can never transfer a non-real quantity of momentum
to another particle.
    A complete investigation of these issues is beyond the scope of
this book, and this is why we have normally limited ourselves to
standing waves, which can be described with real-valued wavefunc-
tions.


Section 4.6   f * The Schrbdinger Equation


105


﻿


Summary
Selected Vocabulary
wavefunction . . the numerical measure of an electron wave, or
                     in general of the wave corresponding to any
                     quantum mechanical particle
Notation
h . . . . . . . . . . Planck's constant divided by 27r (used only in
                     optional section 4.6)
 . . . . . . . . .   the wavefunction of an electron
 Summary
    Light is both a particle and a wave. Matter is both a particle and
a wave. The equations that connect the particle and wave properties
are the same in all cases:

                            E=hf
                            p = h/A

Unlike the electric and magnetic fields that make up a photon-
wave, the electron wavefunction is not directly measurable. Only
the square of the wavefunction, which relates to probability, has
direct physical significance.
    A particle that is bound within a certain region of space is a
standing wave in terms of quantum physics. The two equations
above can then be applied to the standing wave to yield some im-
portant general observations about bound particles:


   1. The particle's energy is quantized (can only have certain val-
      ues).

   2. The particle has a minimum energy.

   3. The smaller the space in which the particle is confined, the
      higher its kinetic energy must be.


These immediately resolve the difficulties that classical physics had
encountered in explaining observations such as the discrete spectra
of atoms, the fact that atoms don't collapse by radiating away their
energy, and the formation of chemical bonds.
    A standing wave confined to a small space must have a short
wavelength, which corresponds to a large momentum in quantum
physics. Since a standing wave consists of a superposition of two
traveling waves moving in opposite directions, this large momentum
should actually be interpreted as an equal mixture of two possible
momenta: a large momentum to the left, or a large momentum to
the right. Thus it is not possible for a quantum wave-particle to
be confined to a small space without making its momentum very


106


Chapter 4  Matter as a Wave


﻿


uncertain. In general, the Heisenberg uncertainty principle states
that it is not possible to know the position and momentum of a
particle simultaneously with perfect accuracy. The uncertainties in
these two quantities must satisfy the approximate inequality

                         ApAx> h

    When an electron is subjected to electric forces, its wavelength
cannot be constant. The "wavelength" to be used in the equation
p = h/A should be thought of as the wavelength of the sine wave
that most closely approximates the curvature of the wavefunction
at a specific point.
    Infinite curvature is not physically possible, so realistic wave-
functions cannot have kinks in them, and cannot just cut off abruptly
at the edge of a region where the particle's energy would be in-
sufficient to penetrate according to classical physics. Instead, the
wavefunction "tails off" in the classically forbidden region, and as a
consequence it is possible for particles to "tunnel" through regions
where according to classical physics they should not be able to pen-
etrate. If this quantum tunneling effect did not exist, there would
be no fusion reactions to power our sun, because the energies of
the nuclei would be insufficient to overcome the electrical repulsion
between them.
Exploring Further
The New World of Mr. Tompkins: George Gamow's Clas-
sic Mr. Tompkins in Paperback, George Gamow. Mr. Tomp-
kins finds himself in a world where the speed of light is only 30 miles
per hour, making relativistic effects obvious. Later parts of the book
play similar games with Planck's constant.
The First Three Minutes: A Modern View of the Origin of
the Universe, Steven Weinberg. Surprisingly simple ideas allow
us to understand the infancy of the universe surprisingly well.
Three Roads to Quantum Gravity, Lee Smolin. The great-
est embarrassment of physics today is that we are unable to fully
reconcile general relativity (the theory of gravity) with quantum
mechanics. This book does a good job of introducing the lay reader
to a difficult, speculative subject, and showing that even though
we don't have a full theory of quantum gravity, we do have a clear
outline of what such a theory must look like.


Summary               107


﻿


Problems


Key
     A computerized answer check is available online.
 f A problem that requires calculus.
 * A difficult problem.
 1 In a television, suppose the electrons are accelerated from rest
 through a voltage difference of 104 V. What is their final wavelength?


 2      Use the Heisenberg uncertainty principle to estimate the
 minimum velocity of a proton or neutron in a 208Pb nucleus, which
 has a diameter of about 13 fm (1 fm = 10-15m ). Assume that
 the speed is nonrelativistic, and then check at the end whether this
 assumption was warranted.                                      V


 3     A free electron that contributes to the current in an ohmic
material typically has a speed of 105 m/s (much greater than the
drift velocity).
(a) Estimate its de Broglie wavelength, in nm.                  V
(b) If a computer memory chip contains 108 electric circuits in a
1 cm2 area, estimate the linear size, in nm, of one such circuit. V
(c) Based on your answers from parts a and b, does an electrical
engineer designing such a chip need to worry about wave effects
such as diffraction?
(d) Estimate the maximum number of electric circuits that can fit on
a 1 cm2 computer chip before quantum-mechanical effects become
important.
4     On page 96, I discussed the idea of hooking up a video cam-
era to a visible-light microscope and recording the trajectory of an
electron orbiting a nucleus. An electron in an atom typically has a
speed of about 1% of the speed of light.
(a) Calculate the momentum of the electron.
(b) When we make images with photons, we can't resolve details
that are smaller than the photons' wavelength. Suppose we wanted
to map out the trajectory of the electron with an accuracy of 0.01
nm. What part of the electromagnetic spectrum would we have to
use?
(c) As found in homework problem 10 on page 40, the momentum of
a photon is given by p = E/c. Estimate the momentum of a photon
of having the necessary wavelength.V
(d) Comparing your answers from parts a and c, what would be the
effect on the electron if the photon bounced off of it? What does
this tell you about the possibility of mapping out an electron's orbit
around a nucleus?


5 Find the energy of a particle in a one-dimensional box of length


108


Chapter 4  Matter as a Wave


﻿


L, expressing your result in terms of L, the particle's mass m, the
number of peaks and valleys n in the wavefunction, and fundamental
constants.


6     The Heisenberg uncertainty principle, ApAz x> h, can only be
made into a strict inequality if we agree on a rigorous mathematical
definition of Ax and Ap. Suppose we define the deltas in terms of the
full width at half maximum (FWHM), which we first encountered in
Vibrations and Waves, and revisited on page 51 of this book. Now
consider the lowest-energy state of the one-dimensional particle in
a box. As argued on page 97, the momentum has equal probability
of being h/L or -h/L, so the FWHM definition gives Ap = 2h/L.
(a) Find Ax using the FWHM definition. Keep in mind that the
probability distribution depends on the square of the wavefunction.
(b) Find AxAp.
7     If x has an average value of zero, then the standard deviation
of the probability distribution D(x) is defined by

                    o2 =    JD(x)x2dx       ,


where the integral ranges over all possible values of x.
Interpretation: if x only has a high probability of having values close
to the average (i.e., small positive and negative values), the thing
being integrated will always be small, because x2 is always a small
number; the standard deviation will therefore be small. Squaring
x makes sure that either a number below the average (x < 0) or a
number above the average (x > 0) will contribute a positive amount
to the standard deviation. We take the square root of the whole
thing so that it will have the same units as x, rather than having
units of 2.
Redo problem 6 using the standard deviation rather than the FWHM.
Hints: (1) You need to determine the amplitude of the wave based
on normalization. (2) You'll need the following definite integral:
a_ u2 cos2 udu = (7r3 - 67r)/24.                         V   f
8      In section 4.6 we derived an expression for the probability
that a particle would tunnel through a rectangular potential bar-
rier. Generalize this to a barrier of any shape. [Hints: First try
generalizing to two rectangular barriers in a row, and then use a
series of rectangular barriers to approximate the actual curve of an
arbitrary potential. Note that the width and height of the barrier in
the original equation occur in such a way that all that matters is the
area under the PE-versus-z curve. Show that this is still true for
a series of rectangular barriers, and generalize using an integral.] If
you had done this calculation in the 1930's you could have become
a famous physicist.                                         1 *


Problems              109


﻿


110          Chapter 4 Matter as aWave


﻿


                                                                    A wavefunction of an electron in a
                                                                    hydrogen atom.


Chapter 5


The Atom


You can learn a lot by taking a car engine apart, but you will have
learned a lot more if you can put it all back together again and make
it run. Half the job of reductionism is to break nature down into
its smallest parts and understand the rules those parts obey. The
second half is to show how those parts go together, and that is our
goal in this chapter. We have seen how certain features of all atoms
can be explained on a generic basis in terms of the properties of
bound states, but this kind of argument clearly cannot tell us any
details of the behavior of an atom or explain why one atom acts
differently from another.
   The biggest embarrassment for reductionists is that the job of
putting things back together is usually much harder than the taking
them apart. Seventy years after the fundamentals of atomic physics
were solved, it is only beginning to be possible to calculate accu-
rately the properties of atoms that have many electrons. Systems
consisting of many atoms are even harder. Supercomputer manu-
facturers point to the folding of large protein molecules as a process
whose calculation is just barely feasible with their fastest machines.
The goal of this chapter is to give a gentle and visually oriented
guide to some of the simpler results about atoms.


111


﻿


5.1 Classifying States
We'll focus our attention first on the simplest atom, hydrogen, with
one proton and one electron. We know in advance a little of what
we should expect for the structure of this atom. Since the electron
is bound to the proton by electrical forces, it should display a set
of discrete energy states, each corresponding to a certain standing
wave pattern. We need to understand what states there are and
what their properties are.
   What properties should we use to classify the states? The most
sensible approach is to used conserved quantities. Energy is one
conserved quantity, and we already know to expect each state to
have a specific energy. It turns out, however, that energy alone is
not sufficient. Different standing wave patterns of the atom can
have the same energy.
   Momentum is also a conserved quantity, but it is not particularly
appropriate for classifying the states of the electron in a hydrogen
atom. The reason is that the force between the electron and the pro-
ton results in the continual exchange of momentum between them.
(Why wasn't this a problem for energy as well? Kinetic energy and
momentum are related by KE = p2/2m, so the much more mas-
sive proton never has very much kinetic energy. We are making an
approximation by assuming all the kinetic energy is in the electron,
but it is quite a good approximation.)
   Angular momentum does help with classification. There is no
transfer of angular momentum between the proton and the electron,
since the force between them is a center-to-center force, producing
no torque.
   Like energy, angular momentum is quantized in quantum physics.
As an example, consider a quantum wave-particle confined to a cir-
cle, like a wave in a circular moat surrounding a castle. A sine
wave in such a "quantum moat" cannot have any old wavelength,
because an integer number of wavelengths must fit around the cir-
cumference, C, of the moat. The larger this integer is, the shorter
the wavelength, and a shorter wavelength relates to greater momen-
tum and angular momentum. Since this integer is related to angular
momentum, we use the symbol e for it:
                                 C

The angular momentum is
                          L =rp      .
Here, r = C/27, and p = h/A = ht/C, so
                               C hf
                          L = 2
                              2w C
                              hf
                              2w


a / Eight wavelengths fit around
this circle: t = 8.


112            Chapter 5    The Atom


﻿


   In the example of the quantum moat, angular momentum is
quantized in units of h/27, and this turns out to be a completely
general fact about quantum physics. That makes h/27 a pretty
important number, so we define the abbreviation h = h/27. This
symbol is read "h-bar."

   quantization of angular momentum
The angular momentum of a particle due to its motion through
space is quantized in units of h.


  self-check A
  What is the angular momentum of the wavefunction shown on page
  111?                                          D Answer, p. 134


5.2 Angular Momentum in Three Dimensions
Up until now we've only worked with angular momentum in the
context of rotation in a plane, for which we could simply use pos-
itive and negative signs to indicate clockwise and counterclockwise
directions of rotation. A hydrogen atom, however, is unavoidably
three-dimensional. Let's first consider the generalization of angu-
lar momentum to three dimensions in the classical case, and then
consider how it carries over into quantum physics.

Three-dimensional angular momentum in classical physics                             L
   If we are to completely specify the angular momentum of a clas-
sical object like a top, b, in three dimensions, it's not enough to
say whether the rotation is clockwise or counterclockwise. We must
also give the orientation of the plane of rotation or, equivalently, the
direction of the top's axis. The convention is to specify the direction
of the axis. There are two possible directions along the axis, and
as a matter of convention we use the direction such that if we sight
along it, the rotation appears clockwise.
                                                                   b / The  angular   momentum
   Angular momentum can, in fact, be defined as a vector pointing vector of a spinning top.
along this direction. This might seem like a strange definition, since
nothing actually moves in that direction, but it wouldn't make sense
to define the angular momentum vector as being in the direction of
motion, because every part of the top has a different direction of
motion. Ultimately it's not just a matter of picking a definition
that is convenient and unambiguous: the definition we're using is
the only one that makes the total angular momentum of a system a
conserved quantity if we let "total" mean the vector sum.
   As with rotation in one dimension, we cannot define what we
mean by angular momentum in a particular situation unless we pick
a point as an axis. This is really a different use of the word "axis"
than the one in the previous paragraphs. Here we simply mean a
point from which we measure the distance r. In the hydrogen atom,


Section 5.2  Angular Momentum in Three Dimensions


113


﻿


Z


  y


-'E~z~-- x


Z


       the nearly immobile proton provides a natural choice of axis.

       Three-dimensional angular momentum in quantum physics
          Once we start to think more carefully about the role of angular
       momentum in quantum physics, it may seem that there is a basic
       problem: the angular momentum of the electron in a hydrogen atom
       depends on both its distance from the proton and its momentum,
       so in order to know its angular momentum precisely it would seem
       we would need to know both its position and its momentum simul-
       taneously with good accuracy. This, however, might seem to be
1      forbidden by the Heisenberg uncertainty principle.
          Actually the uncertainty principle does place limits on what can
       be known about a particle's angular momentum vector, but it does
       not prevent us from knowing its magnitude as an exact integer mul-
       tiple of h. The reason is that in three dimensions, there are really
       three separate uncertainty principles:


                                ApAx  h
                                ApyAy > h
                                ApzAz > h

          Now consider a particle, c/i, that is moving along the x axis at
2      position x and with momentum px. We may not be able to know
       both x and px with unlimited accuracy, but we can still know the
       particle's angular momentum about the origin exactly. Classically,
       it is zero, because the particle is moving directly away from the
       origin: if it was to be nonzero, we would need both a nonzero x and
       a nonzero py. In quantum terms, the uncertainty principle does not
       place any constraint on AxApy.
          Suppose, on the other hand, a particle finds itself, as in figure
       c/2, at a position x along the x axis, and it is moving parallel to the
       y axis with momentum py. It has angular momentum xpy about the
       z axis, and again we can know its angular momentum with unlimited
nd     accuracy, because the uncertainty principle on relates x to px and y
m.     to py. It does not relate x to py.
lar       As shown by these examples, the uncertainty principle does
       not restrict the accuracy of our knowledge of angular momenta as
       severely as might be imagined. However, it does prevent us from
       knowing all three components of an angular momentum vector si-
       multaneously. The most general statement about this is the follow-
       ing theorem, which we present without proof:

          the angular momentum vector in quantum physics
       The most that can be known about an angular momentum vector
       is its magnitude and one of its three vector components. Both are
       quantized in units of h.


          Y


                        >X


c / 1. This particle is movi
directly away from the axis, a
has  no   angular momentu
2.   This particle has angu
momentum.


114            Chapter 5   The Atom


﻿


5.3 The Hydrogen Atom
Deriving the wavefunctions of the states of the hydrogen atom from
first principles would be mathematically too complex for this book,
but it's not hard to understand the logic behind such a wavefunction
in visual terms. Consider the wavefunction from the beginning of
the chapter, which is reproduced below. Although the graph looks
three-dimensional, it is really only a representation of the part of the
wavefunction lying within a two-dimensional plane. The third (up-
down) dimension of the plot represents the value of the wavefunction
at a given point, not the third dimension of space. The plane chosen
for the graph is the one perpendicular to the angular momentum
vector.


d / A wavefunction of a hydrogen
atom.


   Each ring of peaks and valleys has eight wavelengths going around
in a circle, so this state has L = 8h, i.e., we label it e = 8. The wave-
length is shorter near the center, and this makes sense because when
the electron is close to the nucleus it has a lower PE, a higher KE,
and a higher momentum.
   Between each ring of peaks in this wavefunction is a nodal cir-
cle, i.e., a circle on which the wavefunction is zero. The full three-
dimensional wavefunction has nodal spheres: a series of nested spher-
ical surfaces on which it is zero. The number of radii at which nodes
occur, including r = oc, is called n, and n turns out to be closely
related to energy. The ground state has n  1 (a single node only
at r   oc), and higher-energy states have higher n values. There
is a simple equation relating n to energy, which we will discuss in
section 5.4.
   The numbers n and f, which identify the state, are called its
 quantum numbers. A state of a given n and e can be oriented
 in a variety of directions in space. We might try to indicate the


u b dn=3


n=2


21
C


                n=1
e /The energy of a state in
the hydrogen atom depends only
on its n quantum number.


Section 5.3  The Hydrogen Atom


115


﻿


orientation using the three quantum numbers f, = L,/h, fy = LY /h,
and fz = Lz/h. But we have already seen that it is impossible to
know all three of these simultaneously. To give the most complete
possible description of a state, we choose an arbitrary axis, say the
z axis, and label the state according to n, f, and fz.


                                              z


n=1, t=O


y


x


z


a


n=2, 1=0


y


z


n=2, 1=1


y


r 0/70/77e ter


x


f / The three lowest-energy states of hydrogen.


   Angular momentum requires motion, and motion implies kinetic
energy. Thus it is not possible to have a given amount of angular
momentum without having a certain amount of kinetic energy as
well. Since energy relates to the n quantum number, this means
that for a given n value there will be a maximum possible f. It


116            Chapter 5    The Atom


﻿


turns out that this maximum value of £ equals n - 1.

    In general, we can list the possible combinations of quantum
numbers as follows:
     n can equal 1, 2, 3, .. .
     £ can range from 0 to n - 1, in steps of 1
     4zcan range from -E to £, in steps of 1
     Applying these rules, we have the following list of states:
     n = 1, £ = 0, 4=0                  one state
     n = 2, £=0, £=O                    one state
     n = 2, £ = 1, 4z = -1, 0, or 1 three states


  self-check B
  Continue the list for n = 3.                      > Answer, p. 134
    Figure f shows the lowest-energy states of the hydrogen atom.
The left-hand column of graphs displays the wavefunctions in the
x - y plane, and the right-hand column shows the probability dis-
tribution in a three-dimensional representation.

Discussion Questions
A     The quantum number n is defined as the number of radii at which
the wavefunction is zero, including r = oo. Relate this to the features of
the figures on the facing page.
B     Based on the definition of n, why can't there be any such thing as
an n = 0 state?
C    Relate the features of the wavefunction plots in figure f to the corre-
sponding features of the probability distribution pictures.
D    How can you tell from the wavefunction plots in figure f which ones
have which angular momenta?
E    Criticize the following incorrect statement: "The £ = 8 wavefunction
in figure d has a shorter wavelength in the center because in the center
the electron is in a higher energy level."
F    Discuss the implications of the fact that the probability cloud in of the
n = 2, £ = 1 state is split into two parts.


5.4 * Energies of States in Hydrogen

History
    The experimental technique for measuring the energy levels of
an atom accurately is spectroscopy: the study of the spectrum of
light emitted (or absorbed) by the atom. Only photons with cer-
tain energies can be emitted or absorbed by a hydrogen atom, for
example, since the amount of energy gained or lost by the atom
must equal the difference in energy between the atom's initial and
final states. Spectroscopy had actually become a highly developed
art several decades before Einstein even proposed the photon, and


Section 5.4   * Energies of States in Hydrogen


117


﻿


the Swiss spectroscopist Johann Balmer determined in 1885 that
there was a simple equation that gave all the wavelengths emitted
by hydrogen. In modern terms, we think of the photon wavelengths
merely as indirect evidence about the underlying energy levels of
the atom, and we rework Balmer's result into an equation for these
atomic energy levels:

                            2.2 x 10-18 j
                     En           2

This energy includes both the kinetic energy of the electron and
the electrical energy. The zero-level of the electrical energy scale
is chosen to be the energy of an electron and a proton that are
infinitely far apart. With this choice, negative energies correspond
to bound states and positive energies to unbound ones.
    Where does the mysterious numerical factor of 2.2 x 10-18 J come
from? In 1913 the Danish theorist Niels Bohr realized that it was
exactly numerically equal to a certain combination of fundamental
physical constants:

                             mk2e4    1
                       En     2h2    n2

where m is the mass of the electron, and k is the Coulomb force
constant for electric forces.
    Bohr was able to cook up a derivation of this equation based on
the incomplete version of quantum physics that had been developed
by that time, but his derivation is today mainly of historical interest.
It assumes that the electron follows a circular path, whereas the
whole concept of a path for a particle is considered meaningless in
our more complete modern version of quantum physics. Although
Bohr was able to produce the right equation for the energy levels,
his model also gave various wrong results, such as predicting that
the atom would be flat, and that the ground state would have £ = 1
rather than the correct £ = 0.

Approximate treatment
    A full and correct treatment is impossible at the mathematical
level of this book, but we can provide a straightforward explanation
for the form of the equation using approximate arguments.

    A typical standing-wave pattern for the electron consists of a
central oscillating area surrounded by a region in which the wave-
function tails off. As discussed in section 4.6, the oscillating type
of pattern is typically encountered in the classically allowed region,
while the tailing off occurs in the classically forbidden region where
the electron has insufficient kinetic energy to penetrate according to
classical physics. We use the symbol r for the radius of the spherical
boundary between the classically allowed and classically forbidden
regions. Classically, r would be the distance from the proton at


118             Chapter 5    The Atom


﻿


which the electron would have to stop, turn around, and head back
in.
    If r had the same value for every standing-wave pattern, then
we'd essentially be solving the particle-in-a-box problem in three
dimensions, with the box being a spherical cavity. Consider the
energy levels of the particle in a box compared to those of the hy-
drogen atom, g. They're qualitatively different. The energy levels
of the particle in a box get farther and farther apart as we go higher
in energy, and this feature doesn't even depend on the details of
whether the box is two-dimensional or three-dimensional, or its ex-
act shape. The reason for the spreading is that the box is taken to
be completely impenetrable, so its size, r, is fixed. A wave pattern
with n humps has a wavelength proportional to r/n, and therefore a
momentum proportional to n, and an energy proportional to n2. In
the hydrogen atom, however, the force keeping the electron bound
isn't an infinite force encountered when it bounces off of a wall, it's
the attractive electrical force from the nucleus. If we put more en-
ergy into the electron, it's like throwing a ball upward with a higher
energy     it will get farther out before coming back down. This
means that in the hydrogen atom, we expect r to increase as we go
to states of higher energy. This tends to keep the wavelengths of
the high energy states from getting too short, reducing their kinetic
energy. The closer and closer crowding of the energy levels in hydro-
gen also makes sense because we know that there is a certain energy
that would be enough to make the electron escape completely, and
therefore the sequence of bound states cannot extend above that
energy.
    When the electron is at the maximum classically allowed distance
r from the proton, it has zero kinetic energy. Thus when the electron
is at distance r, its energy is purely electrical:


particle in a box hydrogen atom

g / The  energy   levels of   a
particle in a box, contrasted with
those of the hydrogen atom.


[1]


       ke2
E_=
        r


Now comes the approximation. In reality, the electron's wavelength
cannot be constant in the classically allowed region, but we pretend
that it is. Since n is the number of nodes in the wavefunction, we
can interpret it approximately as the number of wavelengths that
fit across the diameter 2r. We are not even attempting a derivation
that would produce all the correct numerical factors like 2 and r
and so on, so we simply make the approximation


[2]


r


Finally we assume that the typical kinetic energy of the electron is
on the same order of magnitude as the absolute value of its total
energy. (This is true to within a factor of two for a typical classical
system like a planet in a circular orbit around the sun.) We then


Section 5.4   * Energies of States in Hydrogen


119


﻿


have


[3]


absolute value of total energy
           ke2
           T,


= p2/2m
  (h/A)2/2m
h2n2/2mr2


We now solve the equation ke2/r ~ h2n2/2mr2 for r and throw
away numerical factors we can't hope to have gotten right, yielding


[4]


    h2n2
r' ~
    mke2


   Plugging n = 1 into this equation gives r = 2 nm, which is indeed
on the right order of magnitude. Finally we combine equations [4]
and [1] to find
                             mk2e4
                       E ~ -h2n2
which is correct except for the numerical factors we never aimed to
find.
Discussion Questions
A    States of hydrogen with n greater than about 10 are never observed
in the sun. Why might this be?
B    Sketch graphs of r and E versus n for the hydrogen, and compare
with analogous graphs for the one-dimensional particle in a box.


5.5 Electron Spin
It's disconcerting to the novice ping-pong player to encounter for
the first time a more skilled player who can put spin on the ball.
Even though you can't see that the ball is spinning, you can tell
something is going on by the way it interacts with other objects in
its environment. In the same way, we can tell from the way electrons
interact with other things that they have an intrinsic spin of their
own. Experiments show that even when an electron is not moving
through space, it still has angular momentum amounting to h/2.
   This may seem paradoxical because the quantum moat, for in-
stance, gave only angular momenta that were integer multiples of
h, not half-units, and I claimed that angular momentum was al-
ways quantized in units of h, not just in the case of the quantum
moat. That whole discussion, however, assumed that the angular
momentum would come from the motion of a particle through space.
The h/2 angular momentum of the electron is simply a property of
the particle, like its charge or its mass. It has nothing to do with


                   spin, s

                orbital angular
                momentum, R


h / The top has angular mo-
mentum both because of the
motion of its center of mass
through space and due to its
internal rotation. Electron spin is
roughly analogous to the intrinsic
spin of the top.


120            Chapter 5    The Atom


﻿


whether the electron is moving or not, and it does not come from any
internal motion within the electron. Nobody has ever succeeded in
finding any internal structure inside the electron, and even if there
was internal structure, it would be mathematically impossible for it
to result in a half-unit of angular momentum.
    We simply have to accept this h/2 angular momentum, called
the "spin" of the electron  Mother Nature rubs our noses in it as
an observed fact.  Protons and neutrons have the same h/2 spin,
while photons have an intrinsic spin of h. In general, half-integer
spins are typical of material particles. Integral values are found for
the particles that carry forces: photons, which embody the electric
and magnetic fields of force, as well as the more exotic messengers
of the nuclear and gravitational forces.
    As was the case with ordinary angular momentum, we can de-
scribe spin angular momentum in terms of its magnitude, and its
component along a given axis. We notate these quantities, in units
of h, as s and sz, so an electron has s = 1/2 and sz = +1/2 or -1/2.
    Taking electron spin into account, we need a total of four quan-
tum numbers to label a state of an electron in the hydrogen atom: n,
£, £z, and sz. (We omit s because it always has the same value.) The
symbols £ and 4z include only the angular momentum the electron
has because it is moving through space, not its spin angular mo-
mentum. The availability of two possible spin states of the electron
leads to a doubling of the numbers of states:
n = 1, £ = 0, Ez = 0,               sz = +1/2 or -1/2    two states
n = 2, £ = 0, Ez = 0,               sz = +1/2 or -1/2    two states
n = 2, £ = 1, £z = -1, 0, or 1, sz = +1/2 or -1/2  six states


5i6 Atoms With More Than One Electron
What about other atoms besides hydrogen? It would seem that
things would get much more complex with the addition of a second
electron. A hydrogen atom only has one particle that moves around
much, since the nucleus is so heavy and nearly immobile. Helium,
with two, would be a mess. Instead of a wavefunction whose square
tells us the probability of finding a single electron at any given lo-
cation in space, a helium atom would need to have a wavefunction
whose square would tell us the probability of finding two electrons
at any given combination of points. Ouch! In addition, we would
have the extra complication of the electrical interaction between the
two electrons, rather than being able to imagine everything in terms
of an electron moving in a static field of force created by the nucleus
alone.

    Despite all this, it turns out that we can get a surprisingly good
description of many-electron atoms simply by assuming the elec-


Section 5.6   Atoms With More Than One Electron


121


﻿


trons can occupy the same standing-wave patterns that exist in a
hydrogen atom. The ground state of helium, for example, would
have both electrons in states that are very similar to the n = 1
states of hydrogen. The second-lowest-energy state of helium would
have one electron in an n = 1 state, and the other in an n = 2 states.
The relatively complex spectra of elements heavier than hydrogen
can be understood as arising from the great number of possible com-
binations of states for the electrons.
    A surprising thing happens, however, with lithium, the three-
electron atom. We would expect the ground state of this atom to
be one in which all three electrons settle down into n = 1 states.
What really happens is that two electrons go into n = 1 states, but
the third stays up in an n = 2 state. This is a consequence of a new
principle of physics:

    the Pauli exclusion principle
 Only one electron can ever occupy a given state.


    There are two n = 1 states, one with sz = +1/2 and one with
sz = -1/2, but there is no third n = 1 state for lithium's third
electron to occupy, so it is forced to go into an n = 2 state.
    It can be proved mathematically that the Pauli exclusion prin-
ciple applies to any type of particle that has half-integer spin. Thus
two neutrons can never occupy the same state, and likewise for two
protons. Photons, however, are immune to the exclusion principle
because their spin is an integer. Material objects can't pass through
each other, but beams of light can, and the basic reason is that the
exclusion principle applies to one but not to the other.1


   1There are also electrical forces between atoms, but as argued on page 86,
the attractions and repulsions tend to cancel out.


122              Chapter 5   The Atom


﻿


Deriving the periodic table
    We can now account for the structure of the periodic table, which
seemed so mysterious even to its inventor Mendeleev. The first row
consists of atoms with electrons only in the n = 1 states:
  H 1 electron in an n = 1 state
  He 2 electrons in the two n = 1 states
The next row is built by filling the n = 2 energy levels:


                           0
  1   II III IV V  VI VII 2
  H                        He
  3   4  5   6   7  8   9  10
  Li Be  B   C  N   O   F  Ne
  11
  Na  ---

i/ The beginning of the peri-
odic table.


Li
Be

O
F
Ne


2 electrons in n
2 electrons in n

2 electrons in n
2 electrons in n
2 electrons in n


1 states, 1 electron in an n = 2 state
1 states, 2 electrons in n = 2 states


1 states,
1 states,
1 states,


6 electrons in n
7 electrons in n
8 electrons in n


2 states
2 states
2 states


In the third row we start in on the n = 3 levels:


Na 2 electrons in n
     electron in an n


1 states, 8 electrons in n
3 state


= 2 states, 1


We can now see a logical link between the filling of the energy levels
and the structure of the periodic table. Column 0, for example,
consists of atoms with the right number of electrons to fill all the
available states up to a certain value of n. Column I contains atoms
like lithium that have just one electron more than that.
    This shows that the columns relate to the filling of energy levels,
but why does that have anything to do with chemistry? Why, for
example, are the elements in columns I and VII dangerously reac-
tive? Consider, for example, the element sodium (Na), which is so
reactive that it may burst into flames when exposed to air. The
electron in the n = 3 state has an unusually high energy. If we let
a sodium atom come in contact with an oxygen atom, energy can
be released by transferring the n = 3 electron from the sodium to
one of the vacant lower-energy n = 2 states in the oxygen. This
energy is transformed into heat. Any atom in column I is highly
reactive for the same reason: it can release energy by giving away
the electron that has an unusually high energy.

    Column VII is spectacularly reactive for the opposite reason:
these atoms have a single vacancy in a low-energy state, so energy
is released when these atoms steal an electron from another atom.
    It might seem as though these arguments would only explain
reactions of atoms that are in different rows of the periodic table,
because only in these reactions can a transferred electron move from
a higher-n state to a lower-n state. This is incorrect. An n = 2 elec-
tron in fluorine (F), for example, would have a different energy than
an n = 2 electron in lithium (Li), due to the different number of
protons and electrons with which it is interacting. Roughly speak-
ing, the n = 2 electron in fluorine is more tightly bound (lower in


j / Hydrogen is highly reactive.


Section 5.6   Atoms With More Than One Electron


123


﻿


energy) because of the larger number of protons attracting it. The
effect of the increased number of attracting protons is only partly
counteracted by the increase in the number of repelling electrons,
because the forces exerted on an electron by the other electrons are
in many different directions and cancel out partially.
  Neutron stars                                        example 1
  Here's an exotic example that doesn't even involve atoms. When
  a star runs out of fuel for its nuclear reactions, it begins to collapse
  under its own weight. Since Newton's law of gravity depends on
  the inverse square of the distance, the gravitational forces be-
  come stronger as the star collapses, which encourages it to col-
  lapse even further. The final result depends on the mass of the
  star, but let's consider a star that's only a little more massive than
  our own sun. Such a star will collapse to the point where the grav-
  itational energy being released is sufficient to cause the reaction
  p + e- - n +-v to occur. (As you found in homework problem 7 on
  page 39, this reaction can only occur when there is some source
  of energy, because the mass-energy of the products is greater
  than the mass-energy of the things being consumed.) The neu-
  trinos fly off and are never heard from again, so we're left with a
  star consisting only of neutrons!
  Now the exclusion principle comes into play. The collapse can't
  continue indefinitely. The situation is in fact closely analogous
  to that of an atom. A lead atom's cloud of 82 electrons can't
  shrink down to the size of a hydrogen atom, because only two
  electrons can have the lowest-energy wave pattern. The same
  happens with the neutron star. No physical repulsion keeps the
  neutrons apart. They're electrically neutral, so they don't repel or
  attract one another electrically. The gravitational force is attrac-
  tive, and as the collapse proceeds to the point where the neutrons
  come within range of the strong nuclear force (~ 10-15 m), we
  even start getting nuclear attraction. The only thing that stops the
  whole process is the exclusion principle. The star ends up being
  only a few kilometers across, and has the same billion-ton-per-
  teaspoon density as an atomic nucleus. Indeed, we can think of it
  as one big nucleus (with atomic number zero, because there are
  no protons).
  As with a spinning figure skater pulling her arms in, conservation
  of angular momentum makes the star spin faster and faster. The
  whole object may end up with a rotational period of a fraction of a
  second! Such a star sends out radio pulses with each revolution,
  like a sort of lighthouse. The first time such a signal was detected,
  radio astronomers thought that it was a signal from aliens.


124             Chapter 5    The Atom


﻿


Summary
Selected Vocabulary
quantum number      a numerical label used to classify a quantum
                    state
 spin . . . . . . . .  the built-in angular momentum possessed by
                    a particle even when at rest
Notation
n . . . . . . . . . .  the number of radial nodes in the wavefunc-
                    tion, including the one at r = oc
 h........... ..    h/2,r
 L . . . . . . . . . . the angular momentum vector of a particle,
                    not including its spin
 . . . . . . . . . . the magnitude of the L vector, divided by h
 z .     .the z component of the L vector, divided by
                    h; this is the standard notation in nuclear
                    physics, but not in atomic physics
 s . . . . . . . . . . the magnitude of the spin angular momentum
                    vector, divided by h
 sz . . . . . . . . .  the z component of the spin angular momen-
                    tum vector, divided by h; this is the standard
                    notation in nuclear physics, but not in atomic
                    physics


Other Terminology
m    . . . . . . . . .


and Notation
a less obvious notation for £z, standard in
atomic physics
a less obvious notation for sz, standard in
atomic physics


Summary
   Hydrogen, with one proton and one electron, is the simplest
atom, and more complex atoms can often be analyzed to a reason-
ably good approximation by assuming their electrons occupy states
that have the same structure as the hydrogen atom's. The electron
in a hydrogen atom exchanges very little energy or angular mo-
mentum with the proton, so its energy and angular momentum are
nearly constant, and can be used to classify its states. The energy
of a hydrogen state depends only on its n quantum number.
   In quantum physics, the angular momentum of a particle moving
in a plane is quantized in units of h. Atoms are three-dimensional,
however, so the question naturally arises of how to deal with angu-
lar momentum in three dimensions. In three dimensions, angular
momentum is a vector in the direction perpendicular to the plane
of motion, such that the motion appears clockwise if viewed along
the direction of the vector. Since angular momentum depends on
both position and momentum, the Heisenberg uncertainty principle
limits the accuracy with which one can know it. The most the can


Summary               125


﻿


be known about an angular momentum vector is its magnitude and
one of its three vector components, both of which are quantized in
units of h.
    In addition to the angular momentum that an electron carries by
virtue of its motion through space, it possesses an intrinsic angular
momentum with a magnitude of h/2. Protons and neutrons also
have spins of h/2, while the photon has a spin equal to h.
    Particles with half-integer spin obey the Pauli exclusion princi-
ple: only one such particle can exist is a given state, i.e., with a
given combination of quantum numbers.

    We can enumerate the lowest-energy states of hydrogen as fol-
lows:
n= 1, £=0,         z = 0,            sz =+1/2 or -1/2     two states
n = 2, £ = 0, £z = 0,                sz = +1/2 or -1/2    two states
n = 2, £ = 1 , Ez = -1, 0, or 1, sz = +1/2 or -1/2 six states


The periodic table can be understood in terms of the filling of these
states. The nonreactive noble gases are those atoms in which the
electrons are exactly sufficient to fill all the states up to a given n
value. The most reactive elements are those with one more electron
than a noble gas element, which can release a great deal of energy
by giving away their high-energy electron, and those with one elec-
tron fewer than a noble gas, which release energy by accepting an
electron.


126             Chapter 5    The Atom


﻿


Problems


Key
V A computerized answer check is available online.
f A problem that requires calculus.
* A difficult problem.
1      (a) A distance scale is shown below the wavefunctions and
probability densities illustrated in section 5.3. Compare this with
the order-of-magnitude estimate derived in section 5.4 for the radius
r at which the wavefunction begins tailing off. Was the estimate in
section 5.4 on the right order of magnitude?
(b) Although we normally say the moon orbits the earth, actually
they both orbit around their common center of mass, which is below
the earth's surface but not at its center. The same is true of the
hydrogen atom. Does the center of mass lie inside the proton or
outside it?
2    The figure shows eight of the possible ways in which an electron
in a hydrogen atom could drop from a higher energy state to a state
of lower energy, releasing the difference in energy as a photon. Of
these eight transitions, only D, E, and F produce photons with
wavelengths in the visible spectrum.
(a) Which of the visible transitions would be closest to the violet
end of the spectrum, and which would be closest to the red end?
Explain.
(b) In what part of the electromagnetic spectrum would the photons
from transitions A, B, and C lie? What about G and H? Explain.
(c) Is there an upper limit to the wavelengths that could be emitted
by a hydrogen atom going from one bound state to another bound
state? Is there a lower limit? Explain.


3      Before the quantum theory, experimentalists noted that in
many cases, they would find three lines in the spectrum of the same
atom that satisfied the following mysterious rule: 1/A1l= 1/A2 +
1/A3. Explain why this would occur. Do not use reasoning that
only works for hydrogen  such combinations occur in the spectra
of all elements. [Hint: Restate the equation in terms of the energies
of photons.]
4      Find an equation for the wavelength of the photon emitted
when the electron in a hydrogen atom makes a transition from en-
ergy level rni to level rn2. [You will need to have read optional section
5.4.]


5 (a) Verify that Planck's constant has the same units as angular
momentum.
(b) Estimate the angular momentum of a spinning basketball, in
units of h.


     n=4
     n=3             G H
     n=2
               D E F


     n=1"
     PolmA B C

Problem 2.


Problems              127


﻿


6      Assume that the kinetic energy of an electron in the n = 1
state of a hydrogen atom is on the same order of magnitude as the
absolute value of its total energy, and estimate a typical speed at
which it would be moving. (It cannot really have a single, definite
speed, because its kinetic and potential energy trade off at different
distances from the proton, but this is just a rough estimate of a
typical speed.) Based on this speed, were we justified in assuming
that the electron could be described nonrelativistically?
7       The wavefunction of the electron in the ground state of a
hydrogen atom is

                    T= =r-1/2a-3/2e-r/a

where r is the distance from the proton, and a = h2/kme2
5.3 x 10-11 m is a constant that sets the size of the wave.
(a) Calculate symbolically, without plugging in numbers, the prob-
ability that at any moment, the electron is inside the proton. As-
sume the proton is a sphere with a radius of b = 0.5 fm. [Hint:
Does it matter if you plug in r =0 or r= b in the equation for the
wavefunction?]
(b) Calculate the probability numerically.
(c) Based on the equation for the wavefunction, is it valid to think
of a hydrogen atom as having a finite size? Can a be interpreted
as the size of the atom, beyond which there is nothing? Or is there
any limit on how far the electron can be from the proton?


8      Use physical reasoning to explain how the equation for the
energy levels of hydrogen,

                             mk2e4    1
                       En     22     g2

should be generalized to the case of a heavier atom with atomic
number Z that has had all its electrons stripped away except for
one.                                                            *
9       This question requires that you read optional section 5.4.
A muon is a subatomic particle that acts exactly like an electron
except that its mass is 207 times greater. Muons can be created by
cosmic rays, and it can happen that one of an atom's electrons is
displaced by a muon, forming a muonic atom. If this happens to
a hydrogen atom, the resulting system consists simply of a proton
plus a muon.
(a) How would the size of a muonic hydrogen atom in its ground
state compare with the size of the normal atom?
(b) If you were searching for muonic atoms in the sun or in the
earth's atmosphere by spectroscopy, in what part of the electromag-
netic spectrum would you expect to find the absorption lines?


128             Chapter 5    The Atom


﻿


10      Consider a classical model of the hydrogen atom in which
the electron orbits the proton in a circle at constant speed. In this
model, the electron and proton can have no intrinsic spin. Using
the result of problem 17 from book 4, ch. 6, show that in this
model, the atom's magnetic dipole moment Dm is related to its an-
gular momentum by Dm = (-e/2m)L, regardless of the details of
the orbital motion. Assume that the magnetic field is the same as
would be produced by a circular current loop, even though there
is really only a single charged particle. [Although the model is
quantum-mechanically incorrect, the result turns out to give the cor-
rect quantum mechanical value for the contribution to the atom's
dipole moment coming from the electron's orbital motion. There
are other contributions, however, arising from the intrinsic spins of
the electron and proton.]


Problems              129


﻿


Appendix 1: Exercises


Exercise 1A: Sports in Slowlightland

In Slowlightland, the speed of light is 20 mi/hr = 32 km/hr = 9 m/s. Think of an example of
how relativistic effects would work in sports. Things can get very complex very quickly, so try
to think of a simple example that focuses on just one of the following effects:

   " relativistic momentum

   " relativistic kinetic energy

   " relativistic addition of velocities

   " time dilation and length contraction

   " Doppler shifts of light

   " equivalence of mass and energy

   " time it takes for light to get to an athlete's eye


e deflection of light rays by gravity


﻿


Exercise 5A: Quantum Versus Classical Randomness


1. Imagine the classical version of the particle in a one-dimensional box. Suppose you insert
the particle in the box and give it a known, predetermined energy, but a random initial position
and a random direction of motion. You then pick a random later moment in time to see where
it is. Sketch the resulting probability distribution by shading on top of a line segment. Does
the probability distribution depend on energy?
2. Do similar sketches for the first few energy levels of the quantum mechanical particle in a
box, and compare with 1.
3. Do the same thing as in 1, but for a classical hydrogen atom in two dimensions, which acts
just like a miniature solar system. Assume you're always starting out with the same fixed values
of energy and angular momentum, but a position and direction of motion that are otherwise
random. Do this for L = 0, and compare with a real L = 0 probability distribution for the
hydrogen atom.
4. Repeat 3 for a nonzero value of L, say L=h.
5. Summarize: Are the classical probability distributions accurate? What qualitative features
are possessed by the classical diagrams but not by the quantum mechanical ones, or vice-versa?


131


﻿


Appendix 2: Photo Credits


Except as specifically noted below or in a parenthetical credit in the caption of a figure, all the illustrations in
this book are under my own copyright, and are copyleft licensed under the same license as the rest of the book.

    In some cases it's clear from the date that the figure is public domain, but I don't know the name of the
artist or photographer; I would be grateful to anyone who could help me to give proper credit. I have assumed
that images that come from U.S. government web pages are copyright-free, since products of federal agencies fall
into the public domain. I've included some public-domain paintings; photographic reproductions of them are not
copyrightable in the U.S. (Bridgeman Art Library, Ltd. v. Corel Corp., 36 F. Supp. 2d 191, S.D.N.Y. 1999).

    When "PSSC Physics" is given as a credit, it indicates that the figure is from the first edition of the textbook
entitled Physics, by the Physical Science Study Committee. The early editions of these books never had their
copyrights renewed, and are now therefore in the public domain. There is also a blanket permission given in
the later PSSC College Physics edition, which states on the copyright page that "The materials taken from the
original and second editions and the Advanced Topics of PSSC PHYSICS included in this text will be available
to all publishers for use in English after December 31, 1970, and in translations after December 31, 1975."

    Credits to Millikan and Gale refer to the textbooks Practical Physics (1920) and Elements of Physics (1927).
Both are public domain. (The 1927 version did not have its copyright renewed.) Since is possible that some of
the illustrations in the 1927 version had their copyrights renewed and are still under copyright, I have only used
them when it was clear that they were originally taken from public domain sources.

    In a few cases, I have made use of images under the fair use doctrine. However, I am not a lawyer, and the
laws on fair use are vague, so you should not assume that it's legal for you to use these images. In particular,
fair use law may give you less leeway than it gives me, because I'm using the images for educational purposes,
and giving the book away for free. Likewise, if the photo credit says "courtesy of ...," that means the copyright
owner gave me permission to use it, but that doesn't mean you have permission to use it.

Cover Colliding nuclei: courtesy of RHIC.  13 Einstein: "Professor Einstein's Visit to the United States," The
Scientific Monthly 12:5 (1921), p. 483, public domain.  13 Trinity test: U.S. military, public domain.  17
Michelson: 1887, public domain.   17 Lorentz: painting by Arnhemensis, public domain (Wikimedia Commons).
17 FitzGerald: before 1901, public domain. 16 Michelson-Morley experiment: Line art: redrawn from the 1887
paper by Michelson and Morley. Photo of the 1887 apparatus: public domain. Photos of the Miller experiment:
ca. 1930, believed to be public domain.   26 Colliding nuclei: courtesy of RHIC.   32 Eclipse: 1919, public
domain.     32 Newspaper headline: 1919, public domain.  43 Mount St. Helens: public-domain image by
Austin Post, USGS.    67 Ozone maps: NASA/GSFC TOMS Team.         68 Digital camera image: courtesy of
Lyman Page.     75 Diffracted photons: courtesy of Lyman Page.  99 Werner Heisenberg: ca. 1927, believed to
be public domain.  85 Wicked Witch: art by W.W. Denslow, 1900. Quote from The Wizard of Oz, L. Frank
Baum, 1900.    123 Hindenburg: Public domain product of the U.S. Navy.


﻿


Appendix 3: Hints and Solutions


Answers to Self-Checks


Answers to Self-Checks for Chapter 1
Page 19, self-check A: At v = 0, we get -y= 1, so t = T. There is no time distortion unless
the two frames of reference are in relative motion.
Page 29, self-check B: The total momentum is zero before the collision. After the collision,
the two momenta have reversed their directions, but they still cancel. Neither object has changed
its kinetic energy, so the total energy before and after the collision is also the same.
Page 35, self-check C: At v = 0, we have y = 1, so the mass-energy is mc2 as claimed. As v
approaches c, y approaches infinity, so the mass energy becomes infinite as well.

Answers to Self-Checks for Chapter 2
Page 49, self-check A: (1) Most people would think they were positively correlated, but
they could be independent. (2) These must be independent, since there is no possible physical
mechanism that could make one have any effect on the other. (3) These cannot be independent,
since dying today guarantees that you won't die tomorrow.
Page 51, self-check B: The area under the curve from 130 to 135 cm is about 3/4 of a
rectangle. The area from 135 to 140 cm is about 1.5 rectangles. The number of people in the
second range is about twice as much. We could have converted these to actual probabilities
(1 rectangle = 5 cm x 0.005 cm--1= 0.025), but that would have been pointless, because we
were just going to compare the two areas.

Answers to Self-Checks for Chapter 3
Page 73, self-check A: The axes of the graph are frequency and photon energy, so its slope
is Planck's constant. It doesn't matter if you graph eAV rather than W + eAV, because that
only changes the y-intercept, not the slope.

Answers to Self-Checks for Chapter 4
Page 89, self-check A: Wavelength is inversely proportional to momentum, so to produce a
large wavelength we would need to use electrons with very small momenta and energies. (In
practical terms, this isn't very easy to do, since ripping an electron out of an object is a violent
process, and it's not so easy to calm the electron down afterward.)
Page 98, self-check B: Under the ordinary circumstances of life, the accuracy with which
we can measure the position and momentum of an object doesn't result in a value of ApAz
that is anywhere near the tiny order of magnitude of Planck's constant. We run up against the
ordinary limitations on the accuracy of our measuring techniques long before the uncertainty
principle becomes an issue.
Page 98, self-check C: The electron wave will suffer single-slit diffraction, and spread out to


﻿


the sides after passing through the slit. Neither Ap nor Ax is zero for the diffracted wave.
Page 104, self-check D: No. The equation KE = p2/2m is nonrelativistic, so it can't be
applied to an electron moving at relativistic speeds. Photons always move at relativistic speeds,
so it can't be applied to them, either.
Page 105, self-check E: Dividing by Planck's constant, a small number, gives a large negative
result inside the exponential, so the probability will be very small.

Answers to Self-Checks for Chapter 5
Page 113, self-check A: If you trace a circle going around the center, you run into a series of
eight complete wavelengths. Its angular momentum is 8h.
Page 117, self-check B: n = 3, £ = 0, £z = 0: one state
n = 3, £ = 1, £z = -1, 0, or 1: three states
n = 3, £ = 2, £z = -2, -1, 0, 1, or 2: five states


134


Appendix 3: Hints and Solutions


﻿


Index


absorption spectrum, 94
angular momentum
    and the uncertainty principle, 113
    in three dimensions, 113
    quantization of, 112
atoms
    helium, 121
    lithium, 122
    sodium, 123
    with many electrons, 121
averages, 48
    rule for calculating, 48

Balmer, Johann, 118
black hole, 33
Bohr, Niels, 118
bound states, 93
box
    particle in a, 93

carbon-14 dating, 54
cat
    Schrodinger's, 100
chemical bonds
    quantum explanation for, 95
classical physics, 44
complex numbers
    use in quantum physics, 105
correspondence principle
    defined, 19
    for mass-energy, 35
    for relativistic addition of velocities, 28
    for relativistic momentum, 31
    for time dilation, 19
cosmic rays, 22

Darwin, Charles, 45
Davisson, C.J., 86
de Broglie, Louis, 86
decay
    exponential, 52
digital camera, 68
duality
    wave-particle, 75


Eddington, Arthur, 45
Einstein, Albert, 44, 67
    and randomness, 45
    biography, 13
electron
    as a wave, 86
    spin of, 120
    wavefunction, 89
emission spectrum, 94
energy
    equivalence to mass, 31
    quantization of for bound states, 94
Enlightenment, 45
ether, 14, 16
evolution
    randomness in, 45
exclusion principle, 122
exponential decay, 52
    rate of, 55

gamma factor, 19
garage paradox, 21
gas
    spectrum of, 94
geothermal vents, 44
Germer, L., 86
goiters, 52
group velocity, 92

half-life, 52
Heisenberg uncertainty principle, 96
    in three dimensions, 114
Heisenberg, Werner
    Nazi bomb effort, 98
    uncertainty principle, 96
helium, 121
Hertz, Heinrich, 71
Hindenburg, 123
hydrogen atom, 115
    angular momentum in, 112
    classification of states, 112
    energies of states in, 117
    energy in, 112
    L quantum number, 115


﻿


    momentum in, 112
    n quantum number, 115

independence
    statistical, 46
independent probabilities
    law of, 46
inertia
    principle of relativity, 15
iodine, 52

Jeans, James, 45

Laplace, Pierre Simon de, 43
light
    momentum of, 40
    speed of, 15

mass
    equivalence to energy, 31
mass-energy
    conservation of, 33
    correspondence principle, 35
    of a moving particle, 35
matter
    as a wave, 85
measurement in quantum physics, 100
Michelson-Morley experiment, 16
Millikan, Robert, 72
molecules
    nonexistence in classical physics, 85
momentum
    of light, 40
    relativistic, 28
muons, 22


phase velocity, 92
photoelectric effect, 70
photon
    Einstein's early theory, 70
    energy of, 72
    in three dimensions, 80
    spin of, 121
pilot wave hypothesis, 77
Planck's constant, 73
Planck, Max, 73
positron, 34
probabilities
    addition of, 47
    normalization of, 48
probability distributions
    averages of, 51
    widths of, 51
probability distributions, 50
probability interpretation, 77
protein molecules, 111
proton
    spin of, 121

quantum dot, 93
quantum moat, 112
quantum numbers, 115
    £, 115
    ez, 115
    mf, 125
    m, 125
    n, 115
    s, 121
    sz, 121
quantum physics, 44

radar, 67
radio, 67
radioactivity, 52
randomness, 45
relativity
    principle of, 15
RHIC accelerator, 25
Russell, Bertrand, 45

Schr~dinger equation, 102
Schr~dinger's cat, 100
Schr~dinger, Erwin, 100
simultaneity, 20
Sirius, 94


neutron
    spin of, 121
neutron stars, 124
Newton, Isaac, 44
normalization, 48

ozone layer, 68

particle
    definition of, 75
particle in a box, 93
path of a photon undefined, 76
Pauli exclusion principle, 122
periodic table, 123


136             Index


﻿


sodium, 123
space
    relativistic effects, 20
spectrum
    absorption, 94
    emission, 94
spin, 120
    electron's, 120
    neutron's, 121
    photon's, 121
    proton's, 121
Star Trek, 95
states
    bound, 93
supernovae, 23

Taylor, G.I., 76
time
    relativistic effects, 18
tunneling, 102
twin paradox, 23

ultraviolet light, 68
uncertainty principle, 96
    in three dimensions, 114

velocity
    addition of, 16
      relativistic, 27
    group, 92
    phase, 92
Voyager space probe, 39

wave
    definition of, 75
    dispersive, 91
wave-particle duality, 75
    pilot-wave interpretation of, 77
    probability interpretation of, 77
wavefunction
    complex numbers in, 105
    of the electron, 89
Wicked Witch of the West, 85


Index             137


﻿


Useful Data


M-                mega-   106
k-                kilo-   103
m-               milli-   10-3
pu- (Greek mu)    micro-  10-6
n-                nano-   10-9
p-                pico-   10-12
f-                femto-  10-15

(Centi-, 10-2, is used only in the centimeter.)


quantity
distance
time
mass
density
velocity
acceleration
force
pressure
energy
power
momentum
angular momentum
period
wavelength
frequency
gamma factor
probability
prob. distribution
electron wavefunction


unit
meter, m
second, s
kilogram, kg
kg/m3
m/s
m/s2
N    kg-m/s2
Pa=1 N/m2
J = kg -m2/s2
W  =1 J/s
kg -m/s
kg-m2/s or J-s
s
m
s-1 or Hz
unitless
unitless
various
m- 3/2


symbol
x, Ax
t, At
m
p
v
a
F
P
E
P
p
L
T
A
f
7
P
D


a A
#3B
y    '
3 A
E E
( Z
r/ H
o 8
LI
k K
A A
y M


alpha
beta
gamma
delta
epsilon
zeta
eta
theta
iota
kappa
lambda
mu


v

0
7r
p
a
T
V

x

w


N

0
H
P

T
Y

X

Q


nu
xi
omicron
pi
rho
sigma
tau
upsilon
phi
chi
psi
omega


body
earth
moon
sun


mass (kg)
5.97 x 1024
7.35 x 1022
1.99 x 1030


radius (km)
6.4 x 103
1.7 x 103
7.0 x 105


radius of orbit (km)
1.49 x 108
3.84 x 105


particle
electron
proton
neutron


mass (kg)
9.109 x 10-31
1.673 x 10-27
1.675 x 10-27


radius (fm)
< 0.01
N1.1
N1.1


The radii of protons and neutrons can only be given approx-
imately, since they have fuzzy surfaces. For comparison, a
typical atom is about a million fm in radius.


gravitational constant
Coulomb constant
quantum of charge
speed of light
Planck's constant


G
k


h


= 6.67 x 10-11 N-m2/kg2
8.99 x 109 N-m2/C2
1.60 x 10-19 C
3.00 x 108 m/s
6.63 x 10-34 J-s


138                Index