Skip to main content icon/video/no-internet

Differential Reinforcement of Other Behavior

Description of the Strategy

Differential reinforcement of other behavior (DRO) involves the delivery of a reinforcer when a target behavior is not emitted for a specified period of time. Sometimes referred to as differential reinforcement of zero rates of behavior, reinforcement within DRO interventions is contingent on the nonoccurrence of target behavior over a specified period of time.

There are two basic types of DRO schedules: whole interval and momentary interval. In whole-interval DRO, reinforcement is delivered if the target behavior has not occurred for the entire prescribed interval. An example of a whole-interval DRO contingency would be “Eric will earn 5 minutes of computer time [i.e., a reinforcer] for each 90-minute interval during which he does not destroy property.” To earn the reinforcer, Eric will need to refrain from the target behavior (i.e., property destruction) for the entire 90 minutes.

In the second type of schedule, momentary-interval DRO, reinforcement is delivered if the target behavior is not occurring at a particular moment of observation. A contingency for this type of DRO schedule would be “Eric will earn 5 minutes of computer time if he is not destroying property at the conclusion of a 90-minute interval.” Eric's behavior is now assessed only at a single point in time, rather than continuously (similar to a momentary time-sampling procedure). He could destroy property one or more times at any point during the interval and still earn computer time, as long as he is not doing so at minute 90. In general, research suggests that whole-interval DRO is the procedure of choice during the initial phase of intervention and that momentary-interval DRO may be used for maintenance once a satisfactory reduction in the frequency of the target behavior has been achieved.

Selection of Target Behaviors

Several variables are critical to the successful implementation of DRO schedules. The first is the selection of target behaviors, especially when the student of concern has a large number of them in his or her repertoire. For example, if problem behaviors include swearing and hitting people, one option is to select the most problematic behavior for the DRO schedule (hitting people, in this case); this means that the student could swear during the interval and still receive reinforcement. A second option is to specify the most problematic behavior in the DRO schedule (e.g., hitting) and add that, at the end of an interval during which the most problematic behavior does not occur, the reinforcer will not be delivered until a specified number of minutes has passed without the less problematic behaviors occurring (e.g., swearing). A third option is to include both swearing and hitting in the DRO schedule, so that the student would be required to refrain from engaging in both behaviors during the interval to earn reinforcement. This makes sense, especially when the target behaviors all serve the same function for the student.

Interval Length

Based on a number of research studies, the length of the initial interval selected for the DRO schedule should be based on the interresponse time (IRT), the average length of time between problem behaviors. To calculate the IRT, one would (a) record the frequency of the target behaviors over several sessions, (b) calculate the average number of target behaviors across sessions, and (c) divide the total amount of time during baseline by the average number of behaviors. For example, imagine that Eric destroys property 2, 4, 4, 6, 3, and 5 times per hour over a 6-hour period (step a). The average number of target behaviors over the 6-hour period is 4 per hour (24 behaviors ÷ 6 hours; step b). This means that, across the entire 6 hours (360 minutes), property destruction occurred, on average, once every 90 minutes (360 minutes ÷ 4 behaviors per hour; step c). Thus, the IRT of 90 minutes would be the length of the initial interval typically used for the DRO schedule, such that Eric could earn a reinforcer every time 90 minutes passed without an episode of property destruction. Some researchers have recommended that, especially when the problem behavior presents a risk to the student or to others, the initial interval length be set at half of the IRT, to ensure that the student will come in contact with the reinforcer at least 50% of the time, based on the baseline rate. Based on the calculations shown, Eric would then earn a reinforcer each time 45 minutes passed without an episode of property destruction.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading