Skip to main content icon/video/no-internet

Although criterion-referenced tests (CRTs) can be developed to measure performance at the domain level (e.g., mathematics, reading), they are much more commonly used to measure mastery of short-term objectives (e.g., a unit on the Civil War). As classroom tests, CRTs have existed for many decades, if not centuries. The work of Mager and Glaser in the early 1960s started an important and continuing movement to improve the way that educators estimate students' achievement. R. F. Mager wrote a popular book that motivated educators to be precise in explicating the skills they wanted their students to learn. Robert Glaser, who is generally credited with coining the term criterion-referenced test, initiated the movement to measure the mastery of instructional objectives with reliability and validity.

What is a CRT?

A CRT is a measure designed to estimate mastery of an identified unit of a curriculum (e.g., battles of the Civil War, multidigit addition with regrouping, use of prepositions). CRTs are also referred to as curriculum-based measures and more broadly as curriculum-based assessment. CRTs are standardized instruments, which are constructed with sufficient precision that different examiners will administer, score, and interpret results in the same way. CRTs contain items designed to represent the unit of instruction adequately. Each item has a predetermined correct answer that can be scored objectively by the assessor. A CRT is used for two main purposes. First, it is used to determine whether a student is weak in a given skill and needs further instruction. Second, it is used to determine the effectiveness of instruction. Although CRTs are seldom normed nationally, it is beneficial to collect local norms for appropriate grade groups.

In contrast to norm-referenced tests, which use relative mastery criteria to interpret scores, CRTs use absolute mastery criteria. Therefore, the student's performance is not compared to that of other students but to a predetermined absolute standard of performance. Most commonly, CRTs measure performance as percentage correct. The particular measure used should be based on real-life demands. Because CRTs are usually used to measure short-term objectives, they tend to be formative rather than summative in nature. Thus, for skills at the lowest taxonomic levels (e.g., miniskills), educators may obtain mastery estimates on a weekly or even a daily basis.

Figure 1 A Partial Taxonomy for Mathematics

None

Some writers distinguish between domain-referenced tests and objective-referenced tests. Domain-referenced tests provide a measure that is interpretable in the context of a clearly defined and delimited set of objectives that collectively constitute an entire domain (e.g., mathematics, reading). Objective-referenced tests provide a measure that is interpretable only for a particular instructional objective.

Selection of Taxonomies of Objectives

It is useful for practitioners to select taxonomies of objectives for domains relevant to their practice. For instance, a mathematics teacher would benefit from a complete specification of the broad, enabling, and specific math skills needed by American citizens. Such a taxonomy represents the results of a task analysis of a domain. Although there is no such thing as a perfect taxonomy, national organizations (e.g., the National Council of Teachers of Mathematics, National Council of Teachers of English, National Research Council-Science) often publish taxonomies. A layout of a partial math taxonomy might look like that displayed in Figure 1.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading