Entry
Reader's guide
Entries A-Z
Subject index
Validity
It is not more vacation we need—it is more vocation.
According to the Standards for Educational and Psychological Testing, published in 1999 by the American Educational Research Association, American Psychological Association, and the National Council on Measurement in Education, validity is defined as the extent to which empirical evidence and theory lend support to the interpretation and inferences made about test scores for particular uses. Validity is considered the most fundamental component of developing and evaluating educational and psychological tests. This is understandable. Validity involves the accumulation of evidence and theoretical justification for claims made about the traits, abilities, or attributes (e.g., intelligence, achievement, self-esteem) individuals might possess as measured by educational and psychological tests. In the following three sections, the process of validating tests will be reviewed. In the first section, the general approach to argument-based validation will be described, including a description of how test-related aspects can undermine this argument. In the second section, the sources of validity evidence—test content, response processes, internal structure, relations to other variables, and consequences—normally collected to bolster validity claims will be outlined. In the third section, the methods for integrating sources of evidence to build a strong validity argument are explained. This entry will reflect much of the information contained in the Standards for Educational and Psychological Testing, which, in turn reflects the scientific debates, contributions, and consensus of leading educational and psychological researchers and practitioners in the field.
Argument-Based Approach to Validation
Educational and psychological tests are pervasive in society, offering measurements of many distinct attributes, such as academic aptitude, personality characteristics, and career interests. The responsible use of tests can lead to increased information about individuals so as to help them lead more productive, happier lives. However, when tests are used to make unwarranted claims about individuals, it hurts not only the persons writing the test but also the general public, who may question the development and use of tests for the good of society.
Establishing the validity of inferences made about individuals (henceforth referred to as examinees) based on their test scores is fundamental to the appropriate development and use of tests.
The first step toward establishing the validity of test-score interpretations for particular uses is a written statement or series of written statements about how the test scores are going to be interpreted. The argument-based approach to validation requires that these statements be clear and explicit and also include a rationale for the proposed interpretation and use. For example, imagine a developer of a science achievement test who intends to use the test to select high school students for a special university program in physics. The developer would begin the validation process by writing an explicit statement about how the test scores on the achievement test are going to be interpreted; for example, high scores will be interpreted to indicate competence in the following knowledge and skills: designing experiments, testing hypotheses, and deductive reasoning. The statement would also indicate the proposed uses of test-scores; for example, examinees scoring above a certain cut-off will be selected for inclusion in the university physics program. A rationale for viewing the test items, and hence test scores, as appropriate measures of the construct of interest (science achievement) and why the scores should lead to a particular use must also be outlined. For example, the rationale must provide reasons for accepting the items as appropriate measures of the knowledge and skills in science achievement and why these knowledge and skills (and not others) are the appropriate ones to use in the selection of examinees.
...
- Classroom Achievement
- Acceleration
- Alternative Academic Assessment
- Bell Curve
- Direct Instruction
- Educational Technology
- Failure, Effects of
- Gifted and Talented Students
- Goals
- Grade Retention
- Grading
- Halo Effect
- Home Environment and Academic Intrinsic Motivation
- Homework
- Intelligence and Intellectual Development
- Intelligence Quotient (IQ)
- Intelligence Tests
- Literacy
- Media Literacy
- Parental Expectations
- Personalized System of Instruction
- Precision Teaching
- Reading Comprehension Strategies
- Rubrics
- Spelling
- Test Anxiety
- Classroom Management
- Calculator Use
- Cheating
- Contingency Contracts
- Cooperative Learning
- Curriculum Development
- Discovery Learning
- Distance Learning
- Early Intervention Programs
- Educational Technology
- Effective Teaching, Characteristics of
- Mainstreaming
- Montessori Schools
- School Design
- School Resources
- Students' Rights
- Time-Out
- Token Reinforcement Programs
- Virtual Schools
- Vocational Education
- Cognitive Development
- Cognitive Development and School Readiness
- Conservation
- Deductive Reasoning
- Egocentrism
- Equilibration
- Field Independence–Field Dependence
- Flashbulb Memories, the Nature of
- Inductive Reasoning
- Intelligence and Intellectual Development
- Literacy
- Long-Term Memory
- Measurement and Cognitive Development
- Metacognition and Learning
- Moral Development
- Motivation and Emotion
- Object Permanence
- Perceptual Development
- Piaget's Theory of Cognitive Development
- Schemas
- Short-Term Memory
- Spelling
- Vygotsky's Cultural-Historical Theory of Development
- Zone of Proximal Development
- Ethnicity, Race, and Culture
- African Americans
- American Indians and Alaska Natives
- Asian Americans
- Bilingual Education
- Bilingualism
- Communication Disorders
- Cultural Deficit Model
- Cultural Diversity
- Culture
- Diversity
- Ethnicity and Race
- Head Start
- Hispanic Americans
- Identity Development
- Immigration
- Multicultural Classrooms
- Multicultural Education
- Families
- Gender and Gender Development
- Health and Well-Being
- Abstinence Education
- Athletics
- Attention Deficit Hyperactivity Disorder
- Autism Spectrum Disorders
- Behavior Disorders
- Brain-Relevant Education
- Communication Disorders
- Conduct Disorders
- Diagnostic and Statistical Manual of Mental Disorders
- Disabilities
- Drug Abuse
- Dyslexia
- Eating Disorders
- Extracurricular Activities
- HIV/AIDS
- Learning Disabilities
- Malnutrition and Development
- Mental Health Care in Schools
- Mental Retardation
- Obesity
- School Counseling
- Sex Education
- Special Education
- Suicide
- Human Development
- Acculturation
- Aggression
- Androgyny
- Anxiety
- Aptitude
- Athletics
- Attachment
- Attachment Disorder
- Autism Spectrum Disorders
- Behavior Disorders
- Creativity
- Early Intervention Programs
- Egocentrism
- Emotion and Memory
- Emotional Development
- Empathy
- Equilibration
- Erikson's Theory of Psychosocial Development
- Extracurricular Activities
- Friendship
- Gifted and Talented Students
- Head Start
- Identity Development
- Individual Differences
- Individuals with Disabilities Education Act
- Intelligence and Intellectual Development
- Intrinsic versus Extrinsic Motivation
- Kohlberg's Stages of Moral Development
- Mainstreaming
- Maslow's Hierarchy of Basic Needs
- Maturation
- Mental Retardation
- Metacognition and Learning
- Moral Development
- Motivation
- Motivation and Emotion
- Motor Development
- Myelination
- Neuroscience
- Peer Influences
- Perceptual Development
- Physical Development
- Piaget's Theory of Cognitive Development
- Risk Factors and Development
- School Violence and Disruption
- Self-Determination
- Self-Efficacy
- Self-Esteem
- Special Education
- Test Anxiety
- Vygotsky's Cultural-Historical Theory of Development
- Intelligence and Intellectual Development
- Language Development
- Learning and Memory
- Adult Learning
- Assistive Technology
- Aversive Stimuli
- Behavior Modification
- Bloom's Taxonomy of Educational Objectives
- Brain-Relevant Education
- Classical Conditioning
- Cognitive and Cultural Styles
- Cognitive View of Learning
- Cooperative Learning
- Discovery Learning
- Discrimination
- Distance Learning
- Divergent Thinking
- Educational Technology
- Emotion and Memory
- Episodic Memory
- Explicit Memory
- Flashbulb Memories, the Nature of
- Habituation
- Intrinsic versus Extrinsic Motivation
- Learning
- Learning Communities
- Learning Disabilities
- Learning Strategies
- Learning Style
- Lifelong Learning
- Long-Term Memory
- Malnutrition and Development
- Maturation
- Memory
- Metacognition and Learning
- Mnemonics
- Motivation and Emotion
- Observational Learning
- Older Learners
- Operant Conditioning
- Peer-Assisted Learning
- Perceptual Development
- Premack Principle
- Reinforcement
- Rosenthal Effect
- Shaping
- Short-Term Memory
- Social Learning Theory
- Stimulus Control
- Working Memory
- Organizations
- Peers and Peer Influences
- Public Policy
- Abstinence Education
- Assistive Technology
- Bilingual Education
- Charter Schools
- Child Abuse
- Early Child Care and Education
- English as a Second Language
- Ethics and Research
- Gangs
- Grade Retention
- Head Start
- High-Stakes Testing
- Home Education
- Immigration
- Inclusion
- Individualized Education Program
- Individuals with Disabilities Education Act
- Institutional Review Boards
- Intelligence Tests
- Least Restrictive Placement
- Mainstreaming
- No Child Left Behind
- Poverty
- School Design
- School Violence and Disruption
- Sex Education
- Special Education
- Students' Rights
- Testing
- Tracking
- Vouchers
- Research Methods and Statistics
- T Scores
- Case Studies
- Confidence Interval
- Correlation
- Cross-Sectional Research
- Descriptive Statistics
- Ethics and Research
- Ethnography
- Experimental Design
- External Validity
- Field Experiments
- Frequency Distribution
- Generalizability Theory
- Inferential Statistics
- Internal Validity
- Longitudinal Research
- Mean
- Median
- Meta-Analysis
- Mode
- Naturalistic Observation
- Normal Curve
- Percentile Rank
- Qualitative Research Methods
- Quantitative Research Methods
- Random Sample
- Regression
- Scientific Method
- Standard Deviation and Variance
- Standard Scores
- Stanine Scores
- Statistical Significance
- Social Development
- Teaching
- Aptitude Tests
- Constructivism
- Contingency Contracts
- Criterion-Referenced Testing
- Curriculum Development
- Direct Instruction
- Educational Technology
- Effective Teaching, Characteristics of
- Emotion and Memory
- English as a Second Language
- Evaluation
- Expert Teachers
- Explicit Teaching
- Goals
- Grade Retention
- Grade-Equivalent Scores
- Grading
- Home Education
- Homework
- Instructional Objectives
- Learning Objectives
- Parent–Teacher Conferences
- Personalized System of Instruction
- PRAXIS™
- Precision Teaching
- Rubrics
- Scaffolding
- School Readiness
- Sex Education
- Students' Rights
- Teaching Strategies
- Tracking
- Testing, Measurement, and Evaluation
- Acceleration
- Alternative Academic Assessment
- Aptitude Tests
- Assessment
- Bell Curve
- Certification
- Criterion-Referenced Testing
- Essay Tests
- Evaluation
- External Validity
- Generalizability Theory
- Grade Retention
- Grade-Equivalent Scores
- Grading
- High-Stakes Testing
- Intelligence Tests
- Measurement
- Measurement of Cognitive Development
- Mental Age
- Multiple-Choice Tests
- Norm-Referenced Tests
- Percentile Rank
- Personality Tests
- Reliability
- Rubrics
- Standardized Tests
- Stanford–Binet Test
- Test Anxiety
- Testing
- Validity
- Theory
- Applied Behavior Analysis
- Behavior Modification
- Bloom's Taxonomy of Educational Objectives
- Classical Conditioning
- Cognitive Behavior Modification
- Cognitive View of Learning
- Constructivism
- Continuity and Discontinuity in Learning
- Cultural Deficit Model
- Dynamical Systems
- Erikson's Theory of Psychosocial Development
- Generalizability Theory
- Kohlberg's Stages of Moral Development
- Learned Helplessness
- Maslow's Hierarchy of Basic Needs
- Neuroscience
- Piaget's Theory of Cognitive Development
- Premack Principle
- Psychoanalytic Theory
- Psychosocial Development
- Reciprocal Determinism
- Rosenthal Effect
- Schemas
- Social Learning Theory
- Theory of Mind
- Vicarious Reinforcement
- Loading...
Get a 30 day FREE TRIAL
-
Watch videos from a variety of sources bringing classroom topics to life
-
Read modern, diverse business cases
-
Explore hundreds of books and reference titles
Sage Recommends
We found other relevant content for you on other Sage platforms.
Have you created a personal profile? Login or create a profile so that you can save clips, playlists and searches