Entry
Reader's guide
Entries A-Z
Subject index
Criterion-Referenced Testing
Definition and Features
A criterion-referenced test (CRT) provides a measure of an individual's absolute performance or behavior on a well-defined domain. The domain may include a set of learning/behavioral objectives to be mastered or a set of standards to be achieved. The history of CRTs dates back to a 1963 essay by Robert Glaser in which he introduced criterion referencing as a new type of approach to test development and interpretation. Glaser indicated that the absolute comparisons that formed the basis of CRTs were preferable to the relative comparisons made by the norm-referenced tests (NRTs) widely used at that time.
One way to gain a better understanding of exactly what a CRT entails is to compare it to an NRT. Within this section, CRTs are further defined and comparisons to NRTs are made. The next section discusses the development, use, and interpretation of CRTs in large-scale tests used for accountability purposes. It includes issues of alignment and validity, performance levels, and standard settings. The third section discusses the use of CRTs for classroom instructional purposes. The final section briefly describes guidelines available to assist test users in selecting and interpreting CRTs.
What does a CRT Measure?
Whereas NRTs compare an individual's performance to the performance of others in a comparison group, such as all students in a state or across the nation, CRTs compare an individual's performance to standards or learning objectives. Because of this distinction, interpretations of NRTs are often referred to as relative comparisons and CRTs as absolute comparisons. In a CRT, test users are concerned about knowing whether the student has achieved a standard or mastered a learning objective. They are not concerned about the position or ranking of the student relative to other students.
What is the Domain for a CRT?
CRTs consist of a well-defined domain of knowledge, skills, and/or behaviors to be measured. The domain for a CRT is narrower and more clearly delineated than the domain for an NRT, which is usually broader and covers many objectives. The level of specificity, however, varies somewhat across different types of CRTs. In some classroom situations, CRTs are called objective-referenced because they measure detailed learning outcomes (e.g., adding two three-digit numbers that require carrying). In large-scale tests that are standards-based, the criteria may be slightly less detailed (e.g., measuring whether a student can represent mathematical situations using algebraic symbols).
How Are the Items Designed?
In NRTs, items are usually developed so that the difficulty level is average and discrimination among student scores is high. Because easy items do not lend themselves to discriminating among individuals, they are not usually used on NRTs. However, for CRTs, the item difficulty or discrimination is not of utmost importance. Rather, the most critical aspect of developing a CRT is that each item must have a direct match to a learning objective, behavior, or standard within the domain. Depending on the purpose of administering the CRT, the items may be very easy or difficult. When a test is given immediately after instruction to determine whether students attained the knowledge necessary to move on to the next topic, it is likely that the items will have a low difficulty level. Also, there would be no concern about whether the test was able to discriminate among students.
...
- Classroom Achievement
- Acceleration
- Alternative Academic Assessment
- Bell Curve
- Direct Instruction
- Educational Technology
- Failure, Effects of
- Gifted and Talented Students
- Goals
- Grade Retention
- Grading
- Halo Effect
- Home Environment and Academic Intrinsic Motivation
- Homework
- Intelligence and Intellectual Development
- Intelligence Quotient (IQ)
- Intelligence Tests
- Literacy
- Media Literacy
- Parental Expectations
- Personalized System of Instruction
- Precision Teaching
- Reading Comprehension Strategies
- Rubrics
- Spelling
- Test Anxiety
- Classroom Management
- Calculator Use
- Cheating
- Contingency Contracts
- Cooperative Learning
- Curriculum Development
- Discovery Learning
- Distance Learning
- Early Intervention Programs
- Educational Technology
- Effective Teaching, Characteristics of
- Mainstreaming
- Montessori Schools
- School Design
- School Resources
- Students' Rights
- Time-Out
- Token Reinforcement Programs
- Virtual Schools
- Vocational Education
- Cognitive Development
- Cognitive Development and School Readiness
- Conservation
- Deductive Reasoning
- Egocentrism
- Equilibration
- Field Independence–Field Dependence
- Flashbulb Memories, the Nature of
- Inductive Reasoning
- Intelligence and Intellectual Development
- Literacy
- Long-Term Memory
- Measurement and Cognitive Development
- Metacognition and Learning
- Moral Development
- Motivation and Emotion
- Object Permanence
- Perceptual Development
- Piaget's Theory of Cognitive Development
- Schemas
- Short-Term Memory
- Spelling
- Vygotsky's Cultural-Historical Theory of Development
- Zone of Proximal Development
- Ethnicity, Race, and Culture
- African Americans
- American Indians and Alaska Natives
- Asian Americans
- Bilingual Education
- Bilingualism
- Communication Disorders
- Cultural Deficit Model
- Cultural Diversity
- Culture
- Diversity
- Ethnicity and Race
- Head Start
- Hispanic Americans
- Identity Development
- Immigration
- Multicultural Classrooms
- Multicultural Education
- Families
- Gender and Gender Development
- Health and Well-Being
- Abstinence Education
- Athletics
- Attention Deficit Hyperactivity Disorder
- Autism Spectrum Disorders
- Behavior Disorders
- Brain-Relevant Education
- Communication Disorders
- Conduct Disorders
- Diagnostic and Statistical Manual of Mental Disorders
- Disabilities
- Drug Abuse
- Dyslexia
- Eating Disorders
- Extracurricular Activities
- HIV/AIDS
- Learning Disabilities
- Malnutrition and Development
- Mental Health Care in Schools
- Mental Retardation
- Obesity
- School Counseling
- Sex Education
- Special Education
- Suicide
- Human Development
- Acculturation
- Aggression
- Androgyny
- Anxiety
- Aptitude
- Athletics
- Attachment
- Attachment Disorder
- Autism Spectrum Disorders
- Behavior Disorders
- Creativity
- Early Intervention Programs
- Egocentrism
- Emotion and Memory
- Emotional Development
- Empathy
- Equilibration
- Erikson's Theory of Psychosocial Development
- Extracurricular Activities
- Friendship
- Gifted and Talented Students
- Head Start
- Identity Development
- Individual Differences
- Individuals with Disabilities Education Act
- Intelligence and Intellectual Development
- Intrinsic versus Extrinsic Motivation
- Kohlberg's Stages of Moral Development
- Mainstreaming
- Maslow's Hierarchy of Basic Needs
- Maturation
- Mental Retardation
- Metacognition and Learning
- Moral Development
- Motivation
- Motivation and Emotion
- Motor Development
- Myelination
- Neuroscience
- Peer Influences
- Perceptual Development
- Physical Development
- Piaget's Theory of Cognitive Development
- Risk Factors and Development
- School Violence and Disruption
- Self-Determination
- Self-Efficacy
- Self-Esteem
- Special Education
- Test Anxiety
- Vygotsky's Cultural-Historical Theory of Development
- Intelligence and Intellectual Development
- Language Development
- Learning and Memory
- Adult Learning
- Assistive Technology
- Aversive Stimuli
- Behavior Modification
- Bloom's Taxonomy of Educational Objectives
- Brain-Relevant Education
- Classical Conditioning
- Cognitive and Cultural Styles
- Cognitive View of Learning
- Cooperative Learning
- Discovery Learning
- Discrimination
- Distance Learning
- Divergent Thinking
- Educational Technology
- Emotion and Memory
- Episodic Memory
- Explicit Memory
- Flashbulb Memories, the Nature of
- Habituation
- Intrinsic versus Extrinsic Motivation
- Learning
- Learning Communities
- Learning Disabilities
- Learning Strategies
- Learning Style
- Lifelong Learning
- Long-Term Memory
- Malnutrition and Development
- Maturation
- Memory
- Metacognition and Learning
- Mnemonics
- Motivation and Emotion
- Observational Learning
- Older Learners
- Operant Conditioning
- Peer-Assisted Learning
- Perceptual Development
- Premack Principle
- Reinforcement
- Rosenthal Effect
- Shaping
- Short-Term Memory
- Social Learning Theory
- Stimulus Control
- Working Memory
- Organizations
- Peers and Peer Influences
- Public Policy
- Abstinence Education
- Assistive Technology
- Bilingual Education
- Charter Schools
- Child Abuse
- Early Child Care and Education
- English as a Second Language
- Ethics and Research
- Gangs
- Grade Retention
- Head Start
- High-Stakes Testing
- Home Education
- Immigration
- Inclusion
- Individualized Education Program
- Individuals with Disabilities Education Act
- Institutional Review Boards
- Intelligence Tests
- Least Restrictive Placement
- Mainstreaming
- No Child Left Behind
- Poverty
- School Design
- School Violence and Disruption
- Sex Education
- Special Education
- Students' Rights
- Testing
- Tracking
- Vouchers
- Research Methods and Statistics
- T Scores
- Case Studies
- Confidence Interval
- Correlation
- Cross-Sectional Research
- Descriptive Statistics
- Ethics and Research
- Ethnography
- Experimental Design
- External Validity
- Field Experiments
- Frequency Distribution
- Generalizability Theory
- Inferential Statistics
- Internal Validity
- Longitudinal Research
- Mean
- Median
- Meta-Analysis
- Mode
- Naturalistic Observation
- Normal Curve
- Percentile Rank
- Qualitative Research Methods
- Quantitative Research Methods
- Random Sample
- Regression
- Scientific Method
- Standard Deviation and Variance
- Standard Scores
- Stanine Scores
- Statistical Significance
- Social Development
- Teaching
- Aptitude Tests
- Constructivism
- Contingency Contracts
- Criterion-Referenced Testing
- Curriculum Development
- Direct Instruction
- Educational Technology
- Effective Teaching, Characteristics of
- Emotion and Memory
- English as a Second Language
- Evaluation
- Expert Teachers
- Explicit Teaching
- Goals
- Grade Retention
- Grade-Equivalent Scores
- Grading
- Home Education
- Homework
- Instructional Objectives
- Learning Objectives
- Parent–Teacher Conferences
- Personalized System of Instruction
- PRAXIS™
- Precision Teaching
- Rubrics
- Scaffolding
- School Readiness
- Sex Education
- Students' Rights
- Teaching Strategies
- Tracking
- Testing, Measurement, and Evaluation
- Acceleration
- Alternative Academic Assessment
- Aptitude Tests
- Assessment
- Bell Curve
- Certification
- Criterion-Referenced Testing
- Essay Tests
- Evaluation
- External Validity
- Generalizability Theory
- Grade Retention
- Grade-Equivalent Scores
- Grading
- High-Stakes Testing
- Intelligence Tests
- Measurement
- Measurement of Cognitive Development
- Mental Age
- Multiple-Choice Tests
- Norm-Referenced Tests
- Percentile Rank
- Personality Tests
- Reliability
- Rubrics
- Standardized Tests
- Stanford–Binet Test
- Test Anxiety
- Testing
- Validity
- Theory
- Applied Behavior Analysis
- Behavior Modification
- Bloom's Taxonomy of Educational Objectives
- Classical Conditioning
- Cognitive Behavior Modification
- Cognitive View of Learning
- Constructivism
- Continuity and Discontinuity in Learning
- Cultural Deficit Model
- Dynamical Systems
- Erikson's Theory of Psychosocial Development
- Generalizability Theory
- Kohlberg's Stages of Moral Development
- Learned Helplessness
- Maslow's Hierarchy of Basic Needs
- Neuroscience
- Piaget's Theory of Cognitive Development
- Premack Principle
- Psychoanalytic Theory
- Psychosocial Development
- Reciprocal Determinism
- Rosenthal Effect
- Schemas
- Social Learning Theory
- Theory of Mind
- Vicarious Reinforcement
- Loading...
Get a 30 day FREE TRIAL
-
Watch videos from a variety of sources bringing classroom topics to life
-
Read modern, diverse business cases
-
Explore hundreds of books and reference titles
Sage Recommends
We found other relevant content for you on other Sage platforms.
Have you created a personal profile? Login or create a profile so that you can save clips, playlists and searches