So if the name Ivan Pavlov rings a bell, it’s
because his experiments are among the most famous in the history of psychology. His work contributed to the foundation of
the behaviorist school of thought that viewed psychology as an empirically rigorous science
focused on observable behaviors and not unobservable internal mental processes.
Even though today we view psychology as the science of both behavior AND mental processes,
Pavlov’s influence was tremendous. His research helped pave the path for more experimental
rigor in behavioral research, right up to the present day.
Born in 1849 in Russia, Pavlov was never much for psychology. After giving up on his original
aspirations to become a Russian Orthodox priest like his father, he instead earned a medical
degree and spent nearly twenty years studying the digestive system, earning Russia’s first
Nobel Prize in his mid-50s for his research expanding our understanding of how stomachs
worked. He didn’t study human stomachs though…cause
the procedures were terrible and cruel…he studied dog stomachs. And while researching those dogs, he noticed
how the animals would salivate at a mere whiff of their dinner. At first he found all that slobber annoying,
but soon started to suspect that this behavior was actually a simple but important form of
learning. For us scholars of psychology, we can define
learning as the process of acquiring, through experience, new and relatively enduring information
or behaviors. Whether through association, observation,
or just plain thinking, learning is what allows us to adapt to our environments and to survive.
And as Pavlov began to discover, it wasn’t only humans who learned. Soon enough he was turning out his famous
series of experiments, in which he paired the presence of meat powder – yummy – which
got the dogs to drooling, with lots of different neutral stimuli — things that wouldn’t
normally make you drool, like a certain sound, a shining light, or a touch on the leg.
Then Pavlov observed, after several of these pairings, a dog would start to drool just
at the sound or the light or the touch, even if there wasn’t any slobber-inducing meat
powder around. Animals, he found, can exhibit associative
learning. That’s when a subject links certain events, behaviors, or stimuli together in
the process of conditioning. This may be the most elemental, basic form
of learning a brain can do. But that doesn’t mean that the processes behind conditioning
are, or ever were, obvious. Or, for that matter, simple. In fact, the research that’s gone into how
we’re conditioned by our environments has helped shape the science of psychology, from
a still-kinda-subjective-thought-exercise into the more rigorous discipline we know
today. And it also starred some of psychology’s
most notable, and often controversial, figures, including Pavlov, B. F. Skinner — aaaand
that guy who trained kids to be terrified of furry animals…
[INTRO] OK, I’m not a licensed dog-trainer – do
they license dog trainers? But I can break down for you the sequence of steps in Pavlov’s
famous experiment, to help you get a sense of how conditioning works:
First, before conditioning, the dog just drools when it smells food. That smell is the unconditioned
stimulus, and the slobbering, the unconditioned, or natural response. The ringing sound, which
at this point means nothing to the dog, is the neutral stimulus, and it produces no drooling.
During conditioning, the unconditioned stimulus — that food smell — is paired with the neutral
stimulus — the bell sound — and results in drooling. This is repeated many times until
the association between the two stimuli is made, in a stage called acquisition.
By the time you get to the after-conditioning phase, that old neutral stimulus has become
a conditioned stimulus, because it now elicits the conditioned response of drooling.
Sounds super simple, right? If you have a dog, you’ve probably seen it tapdance at
the sight of a leash, but in Pavlov’s day, this whole series of steps hadn’t really
been studied in a lab setting, or thought about in scientific terms.
Pavlov’s work suggested that classical conditioning — as this kind of learning came to be known
— could be an adaptive form of learning that helps an animal survive by changing its behavior
to better suit its environment. In this case, a bell means food, and food means survival.
So get ready! Not only that, but methodologically, classical
conditioning shows how a process like learning can actually be studied through direct observation
of behavior, in real time, without all those messy feelings and emotions.
This was something Pavlov especially appreciated given his disdain for “mentalistic” concepts
like consciousness and introspection championed by Freud.
Behaviorist psychologists, like Pavlov’s younger American analogues B.F. Skinner and
John B. Watson, also embraced the notion that psychology was all about objective, observable
behavior. In his 1930 book Behaviorism, he argued that
given a dozen healthy infants he could train any one of them to be a doctor, artist, lawyer,
or even a thief, regardless of their talents, tendencies, or ancestry. Whoa there, Watson! Thankfully no one gave
him any infants. In his most famous and, yes, controversial
experiment, Watson conditioned a young child, dubbed “Little Albert,” to fear a white
rat. Maybe that doesn’t sound so bad, but he accomplished this by pairing the rat with
a loud, scary noise, over and over and then demonstrated that the child’s terror could
branch out and be generalized to include other furry, white objects… like bunnies, dogs,
or even fur coats. So yeah, that’d never fly today, obviously,
but Watson’s research did make other psychologists wonder whether adults, too, were just holding
tanks of conditioned emotions — and if so, whether new conditioning could be used to
undo old conditioning. Like, if you’re terrified of roller coasters, but you made yourself
ride one ten times a day for two weeks, would your fears fade?
For the record, recent exploration has revealed that the boy known at Little Albert sadly
died a few years after these experiments, while Watson eventually left academia and
got into advertising, where he put all that associative learning to lucrative use. So that’s classical conditioning. But we’ve
also got another kind of associative learning: operant conditioning.
If classical conditioning is all about forming associations between stimuli, operant conditioning
involves associating our own behavior with consequences. The kid who gets a cookie for
saying please, or the aquarium seal that gets a sardine for balancing a ball on its nose,
they’ve both been trained with operant conditioning. The basic premise here is that behaviors increase
when followed by a reinforcement, or reward, but they decrease when followed by a punishment.
And the most well-known champion of operant conditioning is American behaviorist B.F.
Skinner. He designed the famous operant chamber, or “Skinner Box”–a confined space containing
a lever or button that an animal could touch to get some sort of reward, typically food,
along with a device that keeps track of its responses.
Okay, time for a debunking break! Other than maybe Freud, no other figure in
psychology seems to be as shrouded in lore and misinformation as B. F. Skinner. So I’m
just going to tell you straight that, no, Skinner never put any kids in this box. And
no, he didn’t raise his children without love or affection, and his daughter didn’t
hate his guts until the day she committed suicide. Deborah Skinner is alive and well,
and she loved her dad plenty. Skinner DID, however, invent something called
an air crib–a climate controlled box with a window on the front that was meant to keep
babies warm and safe while moms ran around doing their 1950’s-lady thang. It’s not
exactly where I’d like to spend the night, but it wasn’t remotely the same as the Skinner
Box. No one knows where all of these myths came
from, but being a somewhat controversial guy, Skinner had a lot of haters, some of whom
were probably happy to perpetuate misinformation. But back to the rat in the box. Basically,
the box provided an observable stage to demonstrate Skinner’s concept of reinforcement, which
is anything that increases the behavior that it follows. In other words, you push the lever,
you get a snack, and then you want to keep pushing the lever.
But most rats aren’t going to push a lever for no reason. I mean, there aren’t food-dispensing
levers in a natural environment, so operant-conditioning behavior requires shaping. Maybe you give the rat a nibble of food each
time it gets closer to the bar, then only when it touches the bar, until little by little,
in a series of successive approximations to the desired behavior, you only reward them
only when they do what you’re trying to shape them to do.
In everyday life, we’re all continually reinforcing, shaping, and refining each other’s
behaviors, both intentionally and accidentally. We do this with both positive and negative
reinforcement. Positive reinforcement obviously strengthens
responses by giving rewards after a desired event, like the rat snack after a lever push,
or getting a cookie when you say please. Negative reinforcement is a little trickier.
It’s what increases a behavior by taking away an aversive or upsetting stimulus. Like,
say, you get in your car and it does that infernal beeping thing until you fasten your
seatbelt. The car is reinforcing your seatbelt-wearing by getting rid of that horrible beeping. And
it’s good, because you should wear your seatbelt.
It’s important to recognize here that negative reinforcement is NOT the same as punishment. Punishment decreases a behavior either positively,
by say, giving a speeding ticket, or negatively, by taking away a driver’s license.
But negative reinforcement removes the punishing event to increase a behavior. So, painkillers
negatively reinforce the behavior of swallowing them by ending the headache. So by now hopefully you’re getting the picture.
There are things that we want and things that we don’t want, and we can be taught by way
of those impulses to behave certain ways. But it’s worth pointing out that conditioning
is way more complex than just the cookie and the beeping car.
For one thing ending annoyance or getting a cookie, are types of primary reinforcers–you
don’t have to learn that, they just make innate biological sense. Beeping is annoying,
cookies are delicious. But there are other kinds of reinforcers that
we only recognize after we learn to associate them with primary reinforcers. Like, a paycheck
is a conditioned reinforcer–we want money because we need food and shelter, which are
still the primary drivers. Plus, just as there are different kinds of
reinforcers, so are there various reinforcement schedules. Like, those boxed rats were getting
continuous reinforcement when they got a treat every single time they hit that lever, so
they picked it up pretty quickly. But if one day the rat chow doesn’t come,
that connection quickly dwindles, and the rat stops hitting the lever. This is a process
called extinction. And it is important, because that’s how
real life works. Outside of a Skinner box, you’re not gonna get continuous reinforcement.
All of life is a series of partial, or intermittent reinforcements, that occur only sometimes.
Learning under these conditions takes longer, but it holds up better in the long run and
is less susceptible to that extinction. So, say a cafe gives out a free cup of coffee
for every ten you buy, while another shop pours a free double shots every Tuesday morning,
and yet another has a free-coffee lottery that customers win at random. These are all
different kinds of intermittent reinforcement techniques that get customers coming back
for more. Now, Pavlov, Watson, and Skinner’s ideas
were definitely controversial — as well as the whole scary-rat experiments. Plenty of
folks disagreed with their insistence that only external influences, and not internal
thoughts and feelings, shaped behavior. It was clear to many of the behaviorists’
rivals that our cognitive processes – our thoughts, perceptions, feelings, memories
– also influence the way we learn. We’re going talk about how these other things
factor into learning next week when we look more at conditioning, cognition and observational
learning — and yeah, also watch kids beat the face iff blow-up dolls.
Today you learned about how associative learning works, the essentials of behaviorist theory,
the basic components of classical and operant conditioning, including positive and negative
reinforcement, and reinforcement scheduling. Thanks for watching this, especially to all
of our Subbable subscribers, who make this whole channel possible. If you’d like to
sponsor an episode of Crash Course, or get a special Laptop Decal, or even be animated
into an upcoming episode, just go to Subbable.com. This episode was written by Kathleen Yale,
edited by Blake de Pastino, and our consultant is Dr. Ranjit Bhagwat. Our director and editor
is Nicholas Jenkins, the script supervisor is Michael Aranda, who is also our sound designer,
and the graphics team is Thought Café.