Entry
Reader's guide
Entries A-Z
Subject index
Performance Standards: Selected Response Item Formats
Introduction
Setting performance standards means implementing a process that identifies one or more points on a score scale to create categories of observed test scores. More fully, Cizek (1993: 100) has defined standard setting as ‘the proper following of a prescribed, rational system of rules or procedures resulting in the assignment of a number to differentiate between two or more states or degrees of performance’. This differentiation can result in dichotomous classifications such as Master/Non-master or Pass/Fail. Standard setting can also result in more than two categories or achievement levels, such as Basic/Proficient/Advanced or the familiar grades of A, B, C, D, F.
In practice, setting a performance standard has become nearly synonymous with deriving one or more cutting scores. However, as Kane (1994) has pointed out, ‘it is useful to draw a distinction between the passing score, defined as a point on the score scale, and the performance standard, defined as the minimally adequate level of performance for some purpose … The performance standard is the conceptual version of the desired level of competence, and the passing score is the operational version’ (p. 426, emphasis in original).
Specialists in the field of educational measurement have developed numerous methods for deriving levels of performance and a wide variety of applications for standard-setting methods exists. Standards are established for determining school readiness; for communicating student achievement in school subjects; for granting admission to institutions; for selection for special services; for suggesting diagnoses or treatments, for placement into specialized programmes; and for awarding certification or granting licensure. Overviews of the many approaches to standard setting can be found in several sources (see, e.g., Berk, 1986; Cizek, 1996a, 2001; Jaeger, 1989; Livingston & Zieky, 1982).
Although the methods for standard setting are numerous, care must be taken to match the method used to the particular characteristics of the assessment and context in which the standard setting is conducted. Linn (1994) has suggested that standard-setting procedures can be distinguished based on the four unique purposes of exhortation, exemplification, accountability, and certification of achievement. It is also common to categorize methods as reflecting absolute, relative, or compromise standards. Jaeger (1989) has grouped standard-setting methods into two categories, those which are test-centred and those which are examinee-centred.
It can also be useful to classify standard-setting methods by the response format dictated by the items or tasks comprising the assessment, i.e. either constructed-response or selected-response formats. The remainder of this entry consists of two major sections: a review of some of the most common methods for establishing performance standards on selected-response (e.g. multiple-choice) assessments; and a brief summary of professional guidelines for doing so.
Standard-Setting Methods
The following subsections describe major standard-setting methods that have traditionally found wide use on assessments comprised of selected-response format items. The methods are described in order of their introduction in the psychometric literature. All but one of the methods that will be described (Nedelsky) can be – and have been fairly easily adapted to tests consisting of other item/task formats. Conversely, many newly introduced methods have been developed specifically for use with tests consisting of constructed-response items/tasks. Two such examples would be the Bookmark method (Mitzel, Lewis, Patz & Green, 2001) and the Analytic Judgment method (Plake & Hambleton, 2001). It is also important to note, however, that nearly all of the newer methods could be applied to tests consisting of a mix of item formats, or even to tests consisting exclusively of selected-response items.
...
- 1. Theory and Methodology
- Ambulatory Assessment
- Assessment Process
- Assessor's Bias
- Automated Test Assembly Systems
- Classical and Modern Item Analysis
- Classical Test Theory
- Classification (General, including Diagnosis)
- Criterion-Referenced Testing: Methods and Procedures
- Cross-Cultural Assessment
- Decision (including Decision Theory)
- Diagnosis of Mental and Behavioural Disorders
- Diagnostic Testing in Educational Settings
- Dynamic Assessment (Learning Potential Testing, Testing the Limits)
- Ethics
- Evaluability Assessment
- Evaluation: Programme Evaluation (General)
- Explanation
- Factor Analysis: Confirmatory
- Factor Analysis: Exploratory
- Formats for Assessment
- Generalizability Theory
- History of Psychological Assessment
- Intelligence Assessment through Cohort and Time
- Item Banking
- Item Bias
- Item Response Theory: Models and Features
- Latent Class Analysis
- Multidimensional Item Response Theory
- Multidimensional Scaling Methods
- Multimodal Assessment (including Triangulation)
- Multitrait-Multimethod Matrices
- Needs Assessment
- Norm-Referenced Testing: Methods and Procedures
- Objectivity
- Outcome Assessment/Treatment Assessment
- Person/Situation (Environment) Assessment
- Personality Assessment through Longitudinal Designs
- Prediction (General)
- Prediction: Clinical vs. Statistical
- Qualitative Methods
- Reliability
- Report (General)
- Reporting Test Results in Education
- Self-Presentation Measurement
- Self-Report Distortions (including Faking, Lying, Malingering, Social Desirability)
- Test Adaptation/Translation Methods
- Test User Competence/Responsible Test Use
- Theoretical Perspective: Cognitive
- Theoretical Perspective: Cognitive-Behavioural
- Theoretical Perspective: Constructivism
- Theoretical Perspective: Psychoanalytic
- Theoretical Perspective: Psychological Behaviourism
- Theoretical Perspective: Psychometrics
- Theoretical Perspective: Systemic
- Trait-State Models
- Utility
- Validity (General)
- Validity: Construct
- Validity: Content
- Validity: Criterion-Related
- 2. Methods, Tests and Equipment
- Adaptive and Tailored Testing
- Analogue Methods
- Autobiography
- Behavioural Assessment Techniques
- Brain Activity Measurement
- Case Formulation
- Coaching Candidates to Score Higher on Tests
- Computer-Based Testing
- Equipment for Assessing Basic Processes
- Field Survey: Protocols Development
- Goal Attainment Scaling (GAS)
- Idiographic Methods
- Interview (General)
- Interview in Behavioural and Health Settings
- Interview in Child and Family Settings
- Interview in Work and Organizational Settings
- Neuropsychological Test Batteries
- Observational Methods (General)
- Observational Techniques in Clinical Settings
- Observational Techniques in Work and Organizational Settings
- Projective Techniques
- Psychoeducational Test Batteries
- Psychophysiological Equipment and Measurements
- Self-Observation (Self-Monitoring)
- Self-Report Questionnaires
- Self-Reports (General)
- Self-Reports in Behavioural Clinical Settings
- Self-Reports in Work and Organizational Settings
- Socio-Demographic Conditions
- Sociometric Methods
- Standard for Educational and Psychological Testing
- Subjective Methods
- Test Accommodations for Disabilities
- Test Anxiety
- Test Designs: Developments
- Test Directions and Scoring
- Testing through the Internet
- Unobtrusive Measures
- 3. Personality
- Anxiety Assessment
- Attachment
- Attitudes
- Attribution Styles
- Big Five Model Assessment
- Burnout Assessment
- Cognitive Styles
- Coping Styles
- Emotions
- Empowerment
- Interest
- Leadership Personality
- Locus of Control
- Motivation
- Optimism
- Person/Situation (Environment) Assessment
- Personal Constructs
- Personality Assessment (General)
- Personality Assessment through Longitudinal Designs
- Prosocial Behaviour
- Self-Control
- Self-Efficacy
- Self-Presentation Measurement
- Self, The (General)
- Sensation Seeking
- Social Competence (including Social Skills, Assertion)
- Temperament
- Time Orientation
- Trait-State Models
- Values
- Weil-Being (including Life Satisfaction)
- 4. Intelligence
- Attention
- Cognitive Ability: g Factor
- Cognitive Ability: Multiple Cognitive Abilities
- Cognitive Decline/Impairment
- Cognitive Plasticity
- Cognitive Processes: Current Status
- Cognitive Processes: Historical Perspective
- Cognitive/Mental Abilities in Work and Organizational Settings
- Creativity
- Dynamic Assessment (Learning Potential Testing, Testing the Limits)
- Emotional Intelligence
- Equipment for Assessing Basic Processes
- Fluid and Crystallized Intelligence
- Intelligence Assessment (General)
- Intelligence Assessment through Cohort and Time
- Language (General)
- Learning Disabilities
- Memory (General)
- Mental Retardation
- Practical Intelligence: Conceptual Aspects
- Practical Intelligence: Its Measurement
- Problem Solving
- Triarchic Intelligence Components
- Wisdom
- 5. Clinical and Health
- Anger, Hostility and Aggression Assessment
- Antisocial Disorders Assessment
- Anxiety Assessment
- Anxiety Disorders Assessment
- Applied Behavioural Analysis
- Applied Fields: Clinical
- Applied Fields: Gerontology
- Applied Fields: Health
- Caregiver Burden
- Child and Adolescent Assessment in Clinical Settings
- Clinical Judgement
- Coping Styles
- Counselling, Assessment in
- Couple Assessment in Clinical Settings
- Dangerous/Violence Potential Behaviour
- Dementia
- Diagnosis of Mental and Behavioural Disorders
- Dynamic Assessment (Learning Potential Testing, Testing the Limits)
- Eating Disorders
- Health
- Identity Disorders
- Interview in Behavioural and Health Settings
- Irrational Beliefs
- Learning Disabilities
- Mental Retardation
- Mood Disorders
- Observational Techniques in Clinical Settings
- Outcome Assessment/Treatment Assessment
- Palliative Care
- Prediction: Clinical vs. Statistical
- Psychoneuroimmunology
- Quality of Life
- Self-Observation (Self-Monitoring)
- Self-Reports in Behavioural Clinical Settings
- Social Competence (including Social Skills, Assertion)
- Stress
- Substance Abuse
- Test Anxiety
- Thinking Disorders Assessment
- Type A: A Proposed Psychosocial Risk Factor for Cardiovascular Diseases
- Type C: A Proposed Psychosocial Risk Factor for Cancer
- 6. Educational and Child Assessment
- Achievement Testing
- Applied Fields: Education
- Child Custody
- Children with Disabilities
- Coaching Candidates to Score Higher on Tests
- Cognitive Psychology and Assessment Practices
- Communicative Language Abilities
- Development (General)
- Development: Intelligence/Cognitive
- Development: Language
- Development: Psychomotor
- Development: Socio-Emotional
- Diagnostic Testing in Educational Settings
- Dynamic Assessment (Learning Potential Testing, Testing the Limits)
- Evaluation in Higher Education
- Giftedness
- Instructional Strategies
- Interview in Child and Family Settings
- Item Banking
- Learning Strategies
- Performance
- Performance Standards: Constructed Response Item Formats
- Performance Standards: Selected Response Item Formats
- Planning
- Planning Classroom Tests
- Pre-School Children
- Psychoeducational Test Batteries
- Reporting Test Results in Education
- Standard for Educational and Psychological Testing
- Test Accommodations for Disabilities
- Test Directions and Scoring
- Testing in the Second Language in Minorities
- 7. Work and Organizations
- Achievement Motivation
- Applied Fields: Forensic
- Applied Fields: Organizations
- Applied Fields: Work and Industry
- Career and Personnel Development
- Centres (Assessment Centres)
- Cognitive/Mental Abilities in Work and Organizational Settings
- Empowerment
- Interview in Work and Organizational Settings
- Job Characteristics
- Job Stress
- Leadership in Organizational Settings
- Leadership Personality
- Motor Skills in Work Settings
- Observational Techniques in Work and Organizational Settings
- Organizational Culture
- Performance
- Personnel Selection, Assessment in
- Physical Abilities in Work Settings
- Risk and Prevention in Work and Organizational Settings
- Self-Reports in Work and Organizational Settings
- Total Quality Management
- 8. Neurophysiopsychological Assessment
- Applied Fields: Neuropsychology
- Applied Fields: Psychophysiology
- Brain Activity Measurement
- Dementia
- Equipment for Assessing Basic Processes
- Executive Functions Disorders
- Memory Disorders
- Neuropsychological Test Batteries
- Outcome Evaluation in Neuropsychological Rehabilitation
- Psychoneuroimmunology
- Psychophysiological Equipment and Measurements
- Visuo-Perceptual Impairments
- Voluntary Movement
- 9. Environmental Assessment
- Behavioural Settings and Behaviour Mapping
- Cognitive Maps
- Couple Assessment in Clinical Settings
- Environmental Attitudes and Values
- Family
- Landscapes and Natural Environments
- Life Events
- Organizational Structure, Assessment of
- Perceived Environmental Quality
- Person/Situation (Environment) Assessment
- Post-Occupancy Evaluation for the Built Environment
- Residential and Treatment Facilities
- Social Climate
- Social Networks
- Social Resources
- Stressors: Physical
- Stressors: Social
- Loading...
Get a 30 day FREE TRIAL
-
Watch videos from a variety of sources bringing classroom topics to life
-
Read modern, diverse business cases
-
Explore hundreds of books and reference titles
Sage Recommends
We found other relevant content for you on other Sage platforms.
Have you created a personal profile? Login or create a profile so that you can save clips, playlists and searches