Entry
Reader's guide
Entries A-Z
Subject index
p Value
A p value expresses the probability that a given statistical result is due to chance. p Values are automatically produced for many statistical procedures by analysis packages such as SAS, SPSS, and Stata, and they are commonly cited in epidemiological and medical research as evidence that results should be considered significant. For instance, a clinical trial concerning the effect of two different drugs on a medical outcome would almost certainly cite one or more p values as evidence of the influence, or lack thereof, of the drugs on the outcome. In the example above, perhaps we would expect that if the two drugs are no different in their effect, we would expect the outcome to be the same in each group. Therefore, we would expect the difference between the two groups to be zero, and would be interested in determining whether the actual difference we found could be attributed to chance or whether it is likely to indicate true difference between the groups.
Generally speaking, the greater the difference between the expected and observed results, the smaller the p value and, therefore, the less likely it is that the difference is due to chance (random variation). As the likelihood of chance explaining the difference diminishes, so does its plausibility—giving way to an alternative explanation: that the difference is not due to chance, the difference is due to the expected value being wrong.
We will illustrate the concept of p value using the simple example of 10 tosses of a coin. Without saying more, it is reasonable to believe the coin is fair: There is an equal chance of either heads or tails on each toss. Following this line of reasoning, it is logical to expect 5 heads from 10 tosses, so the expected value of heads is 5. On 10 tosses, however, a variety of outcomes are not at all remarkable. For example, 6 heads (H) and 4 tails (T); 5H and 5T; 4H and 6T do not surprise or call into question the reasonable presumption of a fair coin. Further departures from the 5H and 5T, however, credibly cause doubt, increasing doubt with increasing departure.
Consider the result 8H and 2T. Such an outcome is not expected and causes doubt about the fairness of the coin (or process). Certainly, chance could have produced the outcome, but it is unlikely. The p value is the measure of that chance. The probability that such a result is due to chance is derived from the binomial distribution and is .0439. This calculation assumes that the coin is fair, that is, that the probability of heads on each toss is .5. Therefore, the p value calculation is a conditional probability with the condition being that the expected value is true. Such an assumption is important since we are determining the probability of the observed difference (departure) from the expected value if the expected value was correct in the first place.
In statistics, we are usually concerned with the probability not of achieving a particular result but of obtaining results at least as extreme as our result. In our example, the expected result is 5H and 5T, so we consider deviations from that expectation to be more extreme as they are less likely. So 9H and 1T is even less likely than 8H and 2T (the probability of 9H and 1T, given a fair coin, is .0098), and 10H and 0T is yet more extreme (with a probability of .0010). To calculate the probability of a result at least as extreme as 8T and 2H, we add together these probabilities: .0439 + .0098 + .0010 + .0547. A final comment to this illustration: The calculation of .0547 is based on a one-sided p value. In other words, it only considers the probability of getting 8 or more heads in 10 tosses of a fair coin and ignores the probability of getting 8 or more tails, which would be equally as extreme. If we would consider a deviation toward either more heads or more tails to be a significant result, then the p value should be a two-sided calculation incorporating all the outcomes consistent with observation. In our illustration, the set of outcomes would number six: 10H and 0T; 9H and 1T; 8H and 2T; 2H and 8T; 1H and 9T; 0H and 10T. The probability would be the sum of these six independent outcomes: .0010 + .0098 + .0439 + .0439 + .0098 + .0010 = .1094. (The fact that this is exactly two times the one-sided p value is because the binomial distribution is symmetrical when the probability of a success is .5, the expected probability of an H when assuming a fair coin.) Thus, there is approximately 11% chance (.1094) that 10 fair tosses of a fair coin would produce any one of the six ‘extreme’ values: 10H and 0T; 9H and 1T; 8H and 2T; 2H and 8T; 1H and 9T; 0H and 10T.
...
- Behavioral and Social Science
- Acculturation
- Bioterrorism
- Community Health
- Community Trial
- Community-Based Participatory Research
- Cultural Sensitivity
- Demography
- Determinants of Health Model
- Ecological Fallacy
- Epidemiology in Developing Countries
- EuroQoL EQ-5D Questionnaire
- Functional Status
- Genocide
- Geographical and Social Influences on Health
- Health Behavior
- Health Belief Model
- Health Communication
- Health Communication in Developing Countries
- Health Disparities
- Health Literacy
- Health, Definitions of
- Life Course Approach
- Locus of Control
- Medical Anthropology
- Network Analysis
- Participatory Action Research
- Poverty and Health
- Quality of Life, Quantification of
- Quality of Well-Being Scale (QWB)
- Race and Ethnicity, Measurement Issues With
- Race Bridging
- Rural Health Issues
- Self-Efficacy
- SF-36® Health Survey
- Social Capital and Health
- Social Epidemiology
- Social Hierarchy and Health
- Social Marketing
- Social-Cognitive Theory
- Socioeconomic Classification
- Spirituality and Health
- Targeting and Tailoring
- Theory of Planned Behavior
- Transtheoretical Model
- Urban Health Issues
- Urban Sprawl
- Branches of Epidemiology
- Applied Epidemiology
- Chronic Disease Epidemiology
- Clinical Epidemiology
- Descriptive and Analytic Epidemiology
- Disability Epidemiology
- Disaster Epidemiology
- Eco-Epidemiology
- Environmental and Occupational Epidemiology
- Field Epidemiology
- Genetic Epidemiology
- Injury Epidemiology
- Maternal and Child Health Epidemiology
- Molecular Epidemiology
- Neuroepidemiology
- Nutritional Epidemiology
- Pharmacoepidemiology
- Psychiatric Epidemiology
- Reproductive Epidemiology
- Social Epidemiology
- Veterinary Epidemiology
- Diseases and Conditions
- Alzheimer's Disease
- Anxiety Disorders
- Arthritis
- Asthma
- Autism
- Avian Flu
- Bipolar Disorder
- Bloodborne Diseases
- Cancer
- Cardiovascular Disease
- Diabetes
- Foodborne Diseases
- Gulf War Syndrome
- Hepatitis
- HIV/AIDS
- Hypertension
- Influenza
- Insect-Borne Disease
- Malaria
- Measles
- Oral Health
- Osteoporosis
- Parasitic Diseases
- Plague
- Polio
- Post-Traumatic Stress Disorder
- Schizophrenia
- Severe Acute Respiratory Syndrome (SARS)
- Sexually Transmitted Diseases
- Sick Building Syndrome
- Sleep Disorders
- Smallpox
- Suicide
- Toxic Shock Syndrome
- Tuberculosis
- Vector-Borne Disease
- Vehicle-Related Injuries
- Vitamin Deficiency Diseases
- Waterborne Diseases
- Yellow Fever
- Zoonotic Disease
- Epidemiological Concepts
- Attack Rate
- Attributable Fractions
- Biomarkers
- Birth Cohort Analysis
- Birth Defects
- Case Definition
- Case Reports and Case Series
- Case-Cohort Studies
- Case-Fatality Rate
- Cohort Effects
- Community Trial
- Competencies in Applied Epidemiology for Public Health Agencies
- Cumulative Incidence
- Direct Standardization
- Disease Eradication
- Effect Modification and Interaction
- Effectiveness
- Efficacy
- Emerging Infections
- Epidemic
- Etiology of Disease
- Exposure Assessment
- Fertility, Measures of
- Fetal Death, Measures of
- Gestational Age
- Health, Definitions of
- Herd Immunity
- Hill's Considerations for Causal Inference
- Incidence
- Indirect Standardization
- Koch's Postulates
- Life Course Approach
- Life Expectancy
- Life Tables
- Malnutrition, Measurement of
- Mediating Variable
- Migrant Studies
- Mortality Rates
- Natural Experiment
- Notifiable Disease
- Outbreak Investigation
- Population Pyramid
- Preclinical Phase of Disease
- Preterm Birth
- Prevalence
- Prevention: Primary, Secondary, and Tertiary
- Public Health Surveillance
- Qualitative Methods in Epidemiology
- Quarantine and Isolation
- Screening
- Sensitivity and Specificity
- Sentinel Health Event
- Syndemics
- Epidemiologic Data
- Administrative Data
- American Cancer Society Cohort Studies
- Behavioral Risk Factor Surveillance System
- Biomedical Informatics
- Birth Certificate
- Cancer Registries
- Death Certificate
- Framingham Heart Study
- Global Burden of Disease Project
- Harvard Six Cities Study
- Health Plan Employer Data and Information Set
- Healthcare Cost and Utilization Project
- Healthy People 2010
- Honolulu Heart Program
- Illicit Drug Use, Acquiring Information on
- Medical Expenditure Panel Survey
- Monitoring the Future Survey
- National Ambulatory Medical Care Survey
- National Death Index
- National Health and Nutrition Examination Survey
- National Health Care Survey
- National Health Interview Survey
- National Immunization Survey
- National Maternal and Infant Health Survey
- National Mortality Followback Survey
- National Survey of Family Growth
- Physicians' Health Study
- Pregnancy Risk Assessment and Monitoring System
- Relational Database
- Rochester Epidemiology Project
- Sampling Techniques
- Secondary Data
- Spreadsheet
- Youth Risk Behavior Surveillance System
- Ethics
- Genetics
- Association, Genetic
- Chromosome
- Epigenetics
- Family Studies in Genetics
- Gene
- Gene-Environment Interaction
- Genetic Counseling
- Genetic Disorders
- Genetic Epidemiology
- Genetic Markers
- Genomics
- Genotype
- Hardy-Weinberg Law
- Heritability
- Human Genome Project
- Icelandic Genetics Database
- Linkage Analysis
- Molecular Epidemiology
- Multifactorial Inheritance
- Mutation
- Newborn Screening Programs
- Phenotype
- Teratogen
- Twin Studies
- Health Care Economics and Management
- Biomedical Informatics
- EuroQoL EQ-5D Questionnaire
- Evidence-Based Medicine
- Formulary, Drug
- Functional Status
- Health Care Delivery
- Health Care Services Utilization
- Health Economics
- International Classification of Diseases
- International Classification of Functioning, Disability, and Health
- Managed Care
- Medicaid
- Medicare
- Partner Notification
- Quality of Life, Quantification of
- Quality of Well-Being Scale (QWB)
- SF-36® Health Survey
- Health Risks and Health Behaviors
- Agent Orange
- Alcohol Use
- Allergen
- Asbestos
- Bioterrorism
- Child Abuse
- Cholesterol
- Circumcision, Male
- Diabetes
- Drug Abuse and Dependence, Epidemiology of
- Eating Disorders
- Emerging Infections
- Firearms
- Foodborne Diseases
- Harm Reduction
- Hormone Replacement Therapy
- Intimate Partner Violence
- Lead
- Love Canal
- Malnutrition, Measurement of
- Mercury
- Obesity
- Oral Contraceptives
- Pain
- Physical Activity and Health
- Pollution
- Poverty and Health
- Radiation
- Sexual Risk Behavior
- Sick Building Syndrome
- Social Capital and Health
- Social Hierarchy and Health
- Socioeconomic Classification
- Spirituality and Health
- Stress
- Teratogen
- Thalidomide
- Tobacco
- Urban Health Issues
- Urban Sprawl
- Vehicle-Related Injuries
- Violence as a Public Health Issue
- Vitamin Deficiency Diseases
- War
- Waterborne Diseases
- Zoonotic Disease
- History and Biography
- Budd, William
- Doll, Richard
- Ehrlich, Paul
- Epidemiology, History of
- Eugenics
- Farr, William
- Frost, Wade Hampton
- Genocide
- Goldberger, Joseph
- Graunt, John
- Hamilton, Alice
- Hill, Austin Bradford
- Jenner, Edward
- Keys, Ancel
- Koch, Robert
- Lind, James
- Lister, Joseph
- Nightingale, Florence
- Pasteur, Louis
- Public Health, History of
- Reed, Walter
- Ricketts, Howard
- Rush, Benjamin
- Snow, John
- Tukey, John
- Tuskegee Study
- Infrastructure of Epidemiology and Public Health
- American College of Epidemiology
- American Public Health Association
- Association of Schools of Public Health
- Centers for Disease Control and Prevention
- Council of State and Territorial Epidemiologists
- European Public Health Alliance
- European Union Public Health Programs
- Food and Drug Administration
- Governmental Role in Public Health
- Healthy People 2010
- Institutional Review Board
- Journals, Epidemiological
- Journals, Public Health
- National Center for Health Statistics
- National Institutes of Health
- Pan American Health Organization
- Peer Review Process
- Public Health Agency of Canada
- Publication Bias
- Society for Epidemiologic Research
- Surgeon General, U.S.
- U.S. Public Health Service
- United Nations Children's Fund
- World Health Organization
- Medical Care and Research
- Allergen
- Apgar Score
- Barker Hypothesis
- Birth Defects
- Body Mass Index (BMI)
- Carcinogen
- Case Reports and Case Series
- Clinical Epidemiology
- Clinical Trials
- Community Health
- Community Trial
- Comorbidity
- Complementary and Alternative Medicine
- Effectiveness
- Efficacy
- Emerging Infections
- Etiology of Disease
- Evidence-Based Medicine
- Gestational Age
- Intent-to-Treat Analysis
- International Classification of Diseases
- International Classification of Functioning, Disability, and Health
- Latency and Incubation Periods
- Life Course Approach
- Malnutrition, Measurement of
- Medical Anthropology
- Organ Donation
- Pain
- Placebo Effect
- Preclinical Phase of Disease
- Preterm Birth
- Public Health Nursing
- Quarantine and Isolation
- Screening
- Vaccination
- Specific Populations
- African American Health Issues
- Aging, Epidemiology of
- American Indian Health Issues
- Asian American/Pacific Islander Health Issues
- Breastfeeding
- Child and Adolescent Health
- Epidemiology in Developing Countries
- Hormone Replacement Therapy
- Immigrant and Refugee Health Issues
- Latino Health Issues
- Maternal and Child Health Epidemiology
- Men's Health Issues
- Oral Contraceptives
- Race and Ethnicity, Measurement Issues With
- Race Bridging
- Rural Health Issues
- Sexual Minorities, Health Issues of
- Urban Health Issues
- Women's Health Issues
- Statistics and Research Methods
- F Test
- p Value
- Additive and Multiplicative Models
- Analysis of Covariance
- Analysis of Variance
- Bar Chart
- Bayes's Theorem
- Bayesian Approach to Statistics
- Bias
- Binomial Variable
- Birth Cohort Analysis
- Box-and-Whisker Plot
- Capture-Recapture Method
- Categorical Data, Analysis of
- Causal Diagrams
- Causation and Causal Inference
- Censored Data
- Central Limit Theorem
- Chi-Square Test
- Classification and Regression Tree Models
- Cluster Analysis
- Coefficient of Determination
- Cohort Effects
- Collinearity
- Community Trial
- Community-Based Participatory Research
- Confidence Interval
- Confounding
- Control Group
- Control Variable
- Convenience Sample
- Cox Model
- Critical Value
- Cumulative Incidence
- Data Management
- Data Transformations
- Decision Analysis
- Degrees of Freedom
- Dependent and Independent Variables
- Diffusion of Innovations
- Discriminant Analysis
- Dose-Response Relationship
- Doubling Time
- Dummy Coding
- Dummy Variable
- Ecological Fallacy
- Economic Evaluation
- Effect Modification and Interaction
- Factor Analysis
- Fisher's Exact Test
- Geographical and Spatial Analysis
- Graphical Presentation of Data
- Halo Effect
- Hawthorne Effect
- Hazard Rate
- Healthy Worker Effect
- Hill's Considerations for Causal Inference
- Histogram
- Hypothesis Testing
- Inferential and Descriptive Statistics
- Intent-to-Treat Analysis
- Internet Data Collection
- Interquartile Range
- Interrater Reliability
- Intervention Studies
- Interview Techniques
- Kaplan-Meier Method
- Kappa
- Kurtosis
- Latent Class Models
- Life Tables
- Likelihood Ratio
- Likert Scale
- Log-Rank Test
- Logistic Regression
- Longitudinal Research Design
- Matching
- Measurement
- Measures of Association
- Measures of Central Tendency
- Measures of Variability
- Meta-Analysis
- Missing Data Methods
- Multilevel Modeling
- Multiple Comparison Procedures
- Multivariate Analysis of Variance
- Natural Experiment
- Network Analysis
- Nonparametric Statistics
- Normal Distribution
- Null and Alternative Hypotheses
- Observational Studies
- Overmatching
- Panel Data
- Participatory Action Research
- Pearson Correlation Coefficient
- Percentiles
- Person-Time Units
- Pie Chart
- Placebo Effect
- Point Estimate
- Probability Sample
- Program Evaluation
- Propensity Score
- Proportion
- Qualitative Methods in Epidemiology
- Quasi Experiments
- Questionnaire Design
- Race Bridging
- Random Variable
- Random-Digit Dialing
- Randomization
- Rate
- Ratio
- Receiver Operating Characteristic (ROC) Curve
- Regression
- Relational Database
- Reliability
- Response Rate
- Robust Statistics
- Sample Size Calculations and Statistical Power
- Sampling Distribution
- Sampling Techniques
- Scatterplot
- Secondary Data
- Sensitivity and Specificity
- Sequential Analysis
- Simpson's Paradox
- Skewness
- Spreadsheet
- Stem-and-Leaf Plot
- Stratified Methods
- Structural Equation Modeling
- Study Design
- Survival Analysis
- Target Population
- Time Series
- Type I and Type II Errors
- Unit of Analysis
- Validity
- Volunteer Effect
- Loading...
Get a 30 day FREE TRIAL
-
Watch videos from a variety of sources bringing classroom topics to life
-
Read modern, diverse business cases
-
Explore hundreds of books and reference titles
Sage Recommends
We found other relevant content for you on other Sage platforms.
Have you created a personal profile? Login or create a profile so that you can save clips, playlists and searches