CH 7
CH 7
CH 7
Learning
The topic of this chapter is learning—the relatively permanent change in knowledge or behavior
that is the result of experience. Although you might think of learning in terms of what you need
to do before an upcoming exam, the knowledge that you take away from your classes, or new
skills that you acquire through practice, these changes represent only one component of learning.
Learning is perhaps the most important human capacity. Learning allows us to create effective
lives by being able to respond to changes. We learn to avoid touching hot stoves, to find our way
home from school, and to remember which people have helped us in the past and which people
have been unkind. Without the ability to learn from our experiences, our lives would be
remarkably dangerous and inefficient. The principles of learning can also be used to explain a
wide variety of social interactions, including social dilemmas in which people make important,
and often selfish, decisions about how to behave by calculating the costs and benefits of different
outcomes.
The study of learning is closely associated with the behaviorist school of psychology, in which it
was seen as an alternative scientific perspective to the failure of introspection. The behaviorists,
including John B. Watson and B. F. Skinner, focused their research entirely on behavior, to the
exclusion of any kinds of mental processes. For behaviorists, the fundamental aspect of learning
is the process of conditioning—the ability to connect stimuli (the changes that occur in the
environment) with responses (behaviors or other actions).
But conditioning is just one type of learning. We will also consider other types, including
learning through insight, as well as observational learning (also known as modeling). In each
case we will see not only what psychologists have learned about the topics but also the important
influence that learning has on many aspects of our everyday lives.
With his team of researchers, Pavlov began studying this process in more detail. He conducted a
series of experiments in which, over a number of trials, dogs were exposed to a sound
immediately before receiving food. He systematically controlled the onset of the sound and the
timing of the delivery of the food, and recorded the amount of the dogs’ salivation. Initially the
dogs salivated only when they saw or smelled the food, but after several pairings of the sound
and the food, the dogs began to salivate as soon as they heard the sound. The animals had learned
to associate the sound with the food that followed.
Pavlov had identified a fundamental associative learning process called classical conditioning.
Classical conditioning refers to learning that occurs when a neutral stimulus (e.g., a tone)
becomes associated with a stimulus (e.g., food) that naturally produces a behavior. After the
association is learned, the previously neutral stimulus is sufficient to produce the behavior.
As you can see in Figure 7.3 "4-Panel Image of Whistle and Dog", psychologists use specific
terms to identify the stimuli and the responses in classical conditioning.
The unconditioned stimulus (US) is something (such as food) that triggers a natural occurring
response, and the unconditioned response (UR) is the naturally occurring response (such as
salivation) that follows the unconditioned stimulus. The conditioned stimulus (CS) is a neutral
stimulus that, after being repeatedly presented prior to the unconditioned stimulus, evokes a
similar response as the unconditioned stimulus. In Pavlov’s experiment, the sound of the tone
served as the conditioned stimulus that, after learning, produced the conditioned response (CR),
which is the acquired response to the formerly neutral stimulus. Note that the UR and the CR are
Top left: Before conditioning, the unconditioned stimulus (US) naturally produces the unconditioned response (UR).
Top right: Before conditioning, the neutral stimulus (the whistle) does not produce the salivation response. Bottom
left: The unconditioned stimulus (US), in this case the food, is repeatedly presented immediately after the neutral
stimulus. Bottom right: After learning, the neutral stimulus (now known as the conditioned stimulus or CS), is
sufficient to produce the conditioned responses (CR).
After he had demonstrated that learning could occur through association, Pavlov moved on to
study the variables that influenced the strength and the persistence of conditioning. In some
studies, after the conditioning had taken place, Pavlov presented the sound repeatedly but
Acquisition: The CS and the US are repeatedly paired together and behavior increases. Extinction: The CS is
repeatedly presented alone, and the behavior slowly decreases. Spontaneous recovery: After a pause, when the CS
is again presented alone, the behavior may again occur and then again show extinction.
Although at the end of the first extinction period the CS was no longer producing salivation, the
effects of conditioning had not entirely disappeared. Pavlov found that, after a pause, sounding
the tone again elicited salivation, although to a lesser extent than before extinction took
place. The increase in responding to the CS following a pause after extinction is known as
spontaneous recovery. When Pavlov again presented the CS alone, the behavior again showed
extinction until it disappeared again.
Although the behavior has disappeared, extinction is never complete. If conditioning is again
attempted, the animal will learn the new associations much faster than it did the first time.
Pavlov also experimented with presenting new stimuli that were similar, but not identical to, the
original conditioned stimulus. For instance, if the dog had been conditioned to being scratched
Saylor URL: http://www.saylor.org/books Saylor.org
4
before the food arrived, the stimulus would be changed to being rubbed rather than scratched. He
found that the dogs also salivated upon experiencing the similar stimulus, a process known as
generalization. Generalization refers to the tendency to respond to stimuli that resemble the
original conditioned stimulus. The ability to generalize has important evolutionary significance.
If we eat some red berries and they make us sick, it would be a good idea to think twice before
we eat some purple berries. Although the berries are not exactly the same, they nevertheless are
similar and may have the same negative properties.
In some cases, an existing conditioned stimulus can serve as an unconditioned stimulus for a
pairing with a new conditioned stimulus—a process known as second-order conditioning. In one
of Pavlov’s studies, for instance, he first conditioned the dogs to salivate to a sound, and then
repeatedly paired a new CS, a black square, with the sound. Eventually he found that the dogs
would salivate at the sight of the black square alone, even though it had never been directly
associated with the food. Secondary conditioners in everyday life include our attractions to
things that stand for or remind us of something else, such as when we feel good on a Friday
because it has become associated with the paycheck that we receive on that day, which itself is a
conditioned stimulus for the pleasures that the paycheck buys us.
Clinical psychologists make use of classical conditioning to explain the learning of a phobia—a
strong and irrational fear of a specific object, activity, or situation. For example, driving a car is
a neutral event that would not normally elicit a fear response in most people. But if a person
were to experience a panic attack in which he suddenly experienced strong negative emotions
while driving, he may learn to associate driving with the panic response. The driving has become
the CS that now creates the fear response.
Psychologists have also discovered that people do not develop phobias to just anything.
Although people may in some cases develop a driving phobia, they are more likely to develop
phobias toward objects (such as snakes, spiders, heights, and open spaces) that have been
dangerous to people in the past. In modern life, it is rare for humans to be bitten by spiders or
snakes, to fall from trees or buildings, or to be attacked by a predator in an open area. Being
injured while riding in a car or being cut by a knife are much more likely. But in our
evolutionary past, the potential of being bitten by snakes or spiders, falling out of a tree, or being
trapped in an open space were important evolutionary concerns, and therefore humans are still
evolutionarily prepared to learn these associations over others (Öhman & Mineka, 2001; LoBue
& DeLoache, 2010). [1]
Psychological Review, 108(3), 483–522; LoBue, V., & DeLoache, J. S. (2010). Superior detection of threat-relevant stimuli in
[2] Garcia, J., Kimeldorf, D. J., & Koelling, R. A. (1955). Conditioned aversion to saccharin resulting from exposure to gamma
radiation. Science, 122, 157–158; Garcia, J., Ervin, F. R., & Koelling, R. A. (1966). Learning with prolonged delay of
2. Explain how learning can be shaped through the use of reinforcement schedules and secondary reinforcers.
In classical conditioning the organism learns to associate new stimuli with natural, biological
responses such as salivation or fear. The organism does not learn something new but rather
begins to perform in an existing behavior in the presence of a new signal. Operant conditioning,
on the other hand, is learning that occurs based on the consequences of behavior and can involve
the learning of new actions. Operant conditioning occurs when a dog rolls over on command
because it has been praised for doing so in the past, when a schoolroom bully threatens his
classmates because doing so allows him to get his way, and when a child gets good grades
because her parents threaten to punish her if she doesn’t. In operant conditioning the organism
learns from the consequences of its own actions.
When Thorndike placed his cats in a puzzle box, he found that they learned to engage in the
important escape behavior faster after each trial. Thorndike described the learning that follows
reinforcement in terms of the law of effect.
The most basic of Skinner’s experiments was quite similar to Thorndike’s research with cats. A
rat placed in the chamber reacted as one might expect, scurrying about the box and sniffing and
clawing at the floor and walls. Eventually the rat chanced upon a lever, which it pressed to
release pellets of food. The next time around, the rat took a little less time to press the lever, and
Skinner studied, in detail, how animals changed their behavior through reinforcement and
punishment, and he developed terms that explained the processes of operant learning (Table 7.1
"How Positive and Negative Reinforcement and Punishment Influence Behavior"). Skinner used
the term reinforcer to refer to any event that strengthens or increases the likelihood of a
behavior and the term punisher to refer to any event that weakens or decreases the likelihood of a
behavior. And he used the terms positive and negative to refer to whether a reinforcement was
presented or removed, respectively. Thus positive reinforcement strengthens a response by
presenting something pleasant after the response and negative reinforcement strengthens a
response by reducing or removing something unpleasant. For example, giving a child praise for
completing his homework represents positive reinforcement, whereas taking aspirin to reduced
the pain of a headache represents negative reinforcement. In both cases, the reinforcement makes
it more likely that behavior will occur again in the future.
Table 7.1 How Positive and Negative Reinforcement and Punishment Influence Behavior
Operant
conditioning term Description Outcome Example
Negative Reduce or remove an Behavior is Taking painkillers that eliminate pain increases the
reinforcement unpleasant stimulus strengthened likelihood that you will take painkillers again
Negative Reduce or remove a Behavior is Taking away a teen’s computer after he misses
punishment pleasant stimulus weakened curfew
Although the distinction between reinforcement (which increases behavior) and punishment
(which decreases it) is usually clear, in some cases it is difficult to determine whether a
reinforcer is positive or negative. On a hot day a cool breeze could be seen as a positive
reinforcer (because it brings in cool air) or a negative reinforcer (because it removes hot air). In
other cases, reinforcement can be both positive and negative. One may smoke a cigarette both
because it brings pleasure (positive reinforcement) and because it eliminates the craving for
nicotine (negative reinforcement).
It is also important to note that reinforcement and punishment are not simply opposites. The use
of positive reinforcement in changing behavior is almost always more effective than using
punishment. This is because positive reinforcement makes the person or animal feel better,
helping create a positive relationship with the person providing the reinforcement. Types of
positive reinforcement that are effective in everyday life include verbal praise or approval, the
awarding of status or prestige, and direct financial payment. Punishment, on the other hand, is
more likely to create only temporary changes in behavior because it is based on coercion and
typically creates a negative and adversarial relationship with the person providing the
reinforcement. When the person who provides the punishment leaves the situation, the unwanted
behavior is likely to return.
Perhaps you remember watching a movie or being at a show in which an animal—maybe a dog,
a horse, or a dolphin—did some pretty amazing things. The trainer gave a command and the
dolphin swam to the bottom of the pool, picked up a ring on its nose, jumped out of the water
through a hoop in the air, dived again to the bottom of the pool, picked up another ring, and then
took both of the rings to the trainer at the edge of the pool. The animal was trained to do the
One way to expand the use of operant learning is to modify the schedule on which the
reinforcement is applied. To this point we have only discussed a
continuous reinforcement schedule, in which the desired response is reinforced every time it
occurs; whenever the dog rolls over, for instance, it gets a biscuit. Continuous reinforcement
results in relatively fast learning but also rapid extinction of the desired behavior once the
reinforcer disappears. The problem is that because the organism is used to receiving the
reinforcement after every behavior, the responder may give up quickly when it doesn’t appear.
Behavior is reinforced after a specific number of Factory workers who are paid according
Fixed-ratio responses to the number of products they produce
Behavior is reinforced after an average, but Payoffs from slot machines and other
Variable-ratio unpredictable, number of responses games of chance
Behavior is reinforced for the first response after an Person who checks voice mail for
Variable-interval average, but unpredictable, amount of time has passed messages
Figure 7.7 Examples of Response Patterns by Animals Trained Under Different Partial Reinforcement Schedules
Schedules based on the number of responses (ratio types) induce greater response rate than do schedules based on
elapsed time (interval types). Also, unpredictable schedules (variable types) produce stronger responses than do
Website:http://wps.prenhall.com/hss_kassin_essentials_1/15/3933/1006917.cw/index.html.
Complex behaviors are also created through shaping, the process of guiding an organism’s
behavior to the desired outcome through the use of successive approximation to a final desired
behavior. Skinner made extensive use of this procedure in his boxes. For instance, he could train
a rat to press a bar two times to receive food, by first providing food when the animal moved
near the bar. Then when that behavior had been learned he would begin to provide food only
when the rat touched the bar. Further shaping limited the reinforcement to only when the rat
pressed the bar, to when it pressed the bar and touched it a second time, and finally, to only when
it pressed the bar twice. Although it can take a long time, in this way operant conditioning can
create chains of behaviors that are reinforced only when they are completed.
Reinforcing animals if they correctly discriminate between similar stimuli allows scientists to
test the animals’ ability to learn, and the discriminations that they can make are sometimes quite
remarkable. Pigeons have been trained to distinguish between images of Charlie Brown and the
other Peanuts characters (Cerella, 1980), [3] and between different styles of music and art (Porter
& Neuringer, 1984; Watanabe, Sakamoto & Wakita, 1995). [4]
[1] Thorndike, E. L. (1898). Animal intelligence: An experimental study of the associative processes in animals. Washington, DC:
[2] Thorndike, E. L. (1911). Animal intelligence: Experimental studies. New York, NY: Macmillan. Retrieved
from http://www.archive.org/details/animalintelligen00thor
[3] Cerella, J. (1980). The pigeon’s analysis of pictures. Pattern Recognition, 12, 1–6.
[4] Porter, D., & Neuringer, A. (1984). Music discriminations by pigeons. Journal of Experimental Psychology: Animal Behavior
Processes, 10(2), 138–148; Watanabe, S., Sakamoto, J., & Wakita, M. (1995). Pigeons’ discrimination of painting by Monet and
One type of learning that is not determined only by conditioning occurs when we suddenly find
the solution to a problem, as if the idea just popped into our head. This type of learning is known
Edward Tolman (Tolman & Honzik, 1930) [2] studied the behavior of three groups of rats that
were learning to navigate through mazes. The first group always received a reward of food at the
end of the maze. The second group never received any reward, and the third group received a
reward, but only beginning on the 11th day of the experimental period. As you might expect
when considering the principles of conditioning, the rats in the first group quickly learned to
negotiate the maze, while the rats of the second group seemed to wander aimlessly through it.
The rats in the third group, however, although they wandered aimlessly for the first 10 days,
quickly learned to navigate to the end of the maze as soon as they received food on day 11. By
the next day, the rats in the third group had caught up in their learning to the rats that had been
rewarded from the beginning.
It was clear to Tolman that the rats that had been allowed to experience the maze, even without
any reinforcement, had nevertheless learned something, and Tolman called this latent
learning. Latent learning refers to learning that is not reinforced and not demonstrated until
there is motivation to do so. Tolman argued that the rats had formed a “cognitive map” of the
maze but did not demonstrate this knowledge until they received reinforcement.
The idea of latent learning suggests that animals, and people, may learn simply by experiencing
or watching. Observational learning (modeling) islearning by observing the behavior of others.
To demonstrate the importance of observational learning in children, Bandura, Ross, and Ross
The researchers first let the children view one of the three types of modeling, and then let them
play in a room in which there were some really fun toys. To create some frustration in the
children, Bandura let the children play with the fun toys for only a couple of minutes before
taking them away. Then Bandura gave the children a chance to play with the Bobo doll.
If you guessed that most of the children imitated the model, you would be correct. Regardless of
which type of modeling the children had seen, and regardless of the sex of the model or the child,
the children who had seen the model behaved aggressively—just as the model had done. They
also punched, kicked, sat on the doll, and hit it with the hammer. Bandura and his colleagues had
demonstrated that these children had learned new behaviors, simply by observing and imitating
others.
Observational learning is useful for animals and for people because it allows us to learn without
having to actually engage in what might be a risky behavior. Monkeys that see other monkeys
respond with fear to the sight of a snake learn to fear the snake themselves, even if they have
been raised in a laboratory and have never actually seen a snake (Cook & Mineka, 1990). [4] As
Bandura put it,
the prospects for [human] survival would be slim indeed if one could learn only by suffering the
consequences of trial and error. For this reason, one does not teach children to swim,
adolescents to drive automobiles, and novice medical students to perform surgery by having
them discover the appropriate behavior through the consequences of their successes and
Although modeling is normally adaptive, it can be problematic for children who grow up in
violent families. These children are not only the victims of aggression, but they also see it
happening to their parents and siblings. Because children learn how to be parents in large part by
modeling the actions of their own parents, it is no surprise that there is a strong correlation
between family violence in childhood and violence as an adult. Children who witness their
parents being violent or who are themselves abused are more likely as adults to inflict abuse on
intimate partners or their children, and to be victims of intimate violence (Heyman & Slep,
2002). [6] In turn, their children are more likely to interact violently with each other and to
aggress against their parents (Patterson, Dishion, & Bank, 1984). [7]
[1] Köhler, W. (1925). The mentality of apes (E. Winter, Trans.). New York, NY: Harcourt Brace Jovanovich.
[2] Tolman, E. C., & Honzik, C. H. (1930). Introduction and removal of reward, and maze performance in rats. University of
[3] Bandura, A., Ross, D., & Ross, S. A. (1963). Imitation of film-mediated aggressive models. The Journal of Abnormal and Social
[4] Cook, M., & Mineka, S. (1990). Selective associations in the observational conditioning of fear in rhesus monkeys. Journal of
[5] Bandura, A. (1977). Self-efficacy: Toward a unifying theory of behavior change.Psychological Review, 84, 191–215.
[6] Heyman, R. E., & Slep, A. M. S. (2002). Do child abuse and interparental violence lead to adulthood family violence? Journal
[7] Patterson, G. R., Dishion, T. J., & Bank, L. (1984). Family interaction: A process model of deviancy training. Aggressive