Skip to main content icon/video/no-internet

Operant conditioning (OC) is a form of learning defined by the relationship between behavior and stimuli that follow behavior. Consequences may affect behavior by increasing it (reinforcement) or by decreasing it (punishment). Thus, OC is characterized by an increase or decrease in behavior that is attributable to past consequences of that behavior. Unlike classical conditioning, in which stimuli are presented independently of behavior, OC is defined by the fact that the learner must perform a specific behavior before some stimulus change occurs.

Conceptual Overview

Edward L. Thorndike initiated the study of instrumental conditioning, a variant of OC, at the end of the 19th century. To investigate how animals learn new behaviors, Thorndike placed a cat in a locked puzzle box. If the cat performed the correct response (e.g., pulling a rope), the door opened and the cat would get access to food located outside the cage. In initial trials, it took a long time for the cat to emit the correct response, and the response seemed to occur by chance. However, as the cat gained experience, it would make the correct response more quickly; eventually, the correct response was emitted in a matter of seconds when the animal was placed in the cage. Based on these observations, Thorndike developed the principle of the law of effect. This law states that a behavior that is followed by pleasant consequences will be associated with the situation and will thus be more likely to occur in the future when the animal is again located in the same situation. The law of effect has a second part, that behaviors that are followed by unpleasant consequences will be weakened.

Thorndike's law of effect focused on the assumed pleasure or displeasure to explain how behavioral consequences affect future behavior. In the 1930s, B. F. Skinner reformulated this understanding; he also coined the term operant conditioning. Skinner pointed out that Thorndike's interpretation of the cat's learning was mentalistic in that it referred to assumed feelings on the part of the cat, and Skinner insisted that studies of OC should focus on functional relationships between behavior and its consequences. Also, in contrast to Thorndike, Skinner based his studies on responding in situations that allowed for free repetition of the required response (free-operant). For example, a rat could press a bar and receive food pellets for pressing. In this way, operant responding could be studied as a continuous process, without the intervention of the experimenter during sessions. Thus, Skinner focused on the flow of behavior in interaction with environmental events. He formulated a number of principles to describe such behaviorenvironment relationships, all of which have been supported by later research.

Critical Commentary and Future Directions

Reinforcement

Skinner measured strength of behavior in terms of frequency of responding. Rate can change in two ways; it can increase or decrease. If rate of responding increases as a function of the organism's interacting with its environment, the process is called reinforcement. Decrease of responding is called punishment or reduction. These terms are purely descriptive of observed processes, and they do not refer, as Thorndike's law of effect did, to pleasure or discomfort. It is important to note that reinforcement was defined in functional terms. An event that increases frequency of responding because it typically follows a response is called a (positive) reinforcer. This distinguishes the terms reinforcer and reward. Rewards are events that are intended to strengthen behavior, but reinforcers are events that actually strengthen behavior. Often, rewards function as reinforcers, but there may be exceptions. For example, if a student enjoys working with mathematics, rewarding that activity may actually reduce rather than strengthen that behavior (overjustification). Reinforcers include biological events such as food, drink, and sex; money and possessions; and social events such as attention and praise. Such events can strengthen many forms of behaviors, both desirable and undesirable. For example, one important determinant of aggressive behavior may be the immediate and positive consequences of acting aggressively.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading