Entry
Reader's guide
Entries A-Z
Subject index
Statistical Testing: Overview
Statistical testing is common in clinical settings as a method for drawing inferences for an unknown population value based on a sample of subjects. A clinician may wish, for instance, to test the hypothesis that a new surgical technique reduces the probability of adverse postoperative outcome or that, on average, a new drug reduces blood glucose in diabetics. Many types of statistical tests exist, and selection of an appropriate test is guided by the type of data collected, the statistics for which inferences are desired, the number of groups under study, and the sample size.
General Testing Procedure
Implementation of a statistical test begins with the specification of distinct hypotheses; a null hypothesis (denoted by H0) is assumed to be true, and the test is performed to evaluate the evidence against H0 in favor of an alternative hypothesis (denoted by HA). Typically, the hypothesis that the researcher may wish to show is specified as HA. For example, H0 for the study on the new surgical technique could be stated as “The probabilities of adverse outcome for patients assigned to the new treatment and patients assigned to the standard of care are equal.” HA would correspondingly be stated as “Adverse outcome probabilities are unequal for the two treatment techniques.”
The investigator collects a sample of data from the population of interest, and a test statistic is derived using the sample responses. The test statistic is defined as a quantity that summarizes the sample in such a way that a decision to accept H0 (in preference to HA) or to reject H0 (in favor of HA) can be made based on all possible values of the quantity. The set of values corresponding to the decision to accept H0 is called the acceptance region, while the set of values corresponding to the decision to reject H0 is called the rejection region.
The test statistic has a certain probability distribution—the sampling distribution—under the assumption that H0 is true (i.e., the probability distribution of test statistics arising from many repeated samples). The shape of the sampling distribution depends on the type of test being implemented.
As the relevant population quantity is unknown (thereby necessitating testing), the potential exists for the test to produce an incorrect conclusion. The conclusion to reject H0 when indeed H0 is true is known as a Type I error or false positive. On the other hand, accepting H0 when HA is true is called a Type II error or false negative. The probability of committing a Type I error (often referred to as the significance level) is commonly denoted by a, while the probability of committing a Type II error is commonly denoted by β. The significance level corresponds to the range of the rejection region and is specified by the experimenter prior to testing. Many journals require α = .05 for testing.
Statistical tests can also be implemented in terms of p values, defined as the probability of observing a test statistic as extreme or more extreme than that which would be observed if the experiment were to be repeated many times—or, in other words, the observed significance level of the test. The decision to reject H0 is made if the p value is less than a, and the proximity of the p value to 0 is a measure of the strength of the evidence against H0.
...
- Basis for Making the Decision
- Acceptability Curves and Confidence Ellipses
- Beneficence
- Bioethics
- Choice Theories
- Construction of Values
- Cost-Benefit Analysis
- Cost-Comparison Analysis
- Cost-Consequence Analysis
- Cost-Effectiveness Analysis
- Cost-Minimization Analysis
- Cost-Utility Analysis
- Decision Quality
- Distributive Justice
- Dominance
- Equity
- Evaluating Consequences
- Expected Utility Theory
- Expected Value of Perfect Information
- Extended Dominance
- Health Production Function
- League Tables for Incremental Cost-Effectivenes: Ratios
- Marginal or Incremental Analysis, Cost-Effectiveness Ratio
- Monetary Value
- Moral Choice and Public Policy
- Net Benefit Regression
- Net Monetary Benefit
- Nonexpected Utility Theories
- Pharmacoeconomics
- Protected Values
- Rank-Dependent Utility Theory
- Return on Investment
- Risk-Benefit Trade-Off
- Subjective Expected Utility Theory
- Toss-Ups and Close Calls
- Value-Based Insurance Design
- Welfare, Welfarism, and Extrawelfarism
- Biostatistics and Clinical Epidemiology
- Analysis of Covariance (ANCOVA)
- Analysis of Variance (ANOVA)
- Attributable Risk
- Basic Common Statistical Tests: Chi-Square Test, t Test, Nonparametric Test
- Bayes's Theorem
- Bayesian Analysis
- Bayesian Evidence Synthesis
- Bayesian Networks
- Bias
- Bias in Scientific Studies
- Brier Scores
- Calibration
- Case Control
- Causal Inference and Diagrams
- Causal Inference in Medical Decision Making
- Conditional Independence
- Conditional Probability
- Confidence Intervals
- Confounding and Effect Modulation
- Cox Proportional Hazards Regression
- Decision Rules
- Diagnostic Tests
- Discrimination
- Distributions: Overview
- Dynamic Treatment Regimens
- Effect Size
- Equivalence Testing
- Experimental Designs
- Factor Analysis and Principal Components Analysis
- Fixed Versus Random Effects
- Frequentist Approach
- Hazard Ratio
- Hypothesis Testing
- Index Test
- Intraclass Correlation Coefficient
- Likelihood Ratio
- Log-Rank Test
- Logic Regression
- Logistic Regression
- Maximum Likelihood Estimation Methods
- Measures of Central Tendency
- Measures of Frequency and Summary
- Measures of Variability
- Meta-Analysis and Literature Review
- Mixed and Indirect Comparisons
- Multivariate Analysis of Variance (MANOVA)
- Nomograms
- Number Needed to Treat
- Odds and Odds Ratio, Risk Ratio
- Ordinary Least Squares Regression
- Parametric Survival Analysis
- Poisson and Negative Binomial Regression
- Positivity Criterion and Cutoff Values
- Prediction Rules and Modeling
- Probability
- Propensity Scores
- Randomized Clinical Trials
- Receiver Operating Characteristic (ROC) Curve
- Recurrent Events
- Recursive Partitioning
- Regression to the Mean
- Sample Size and Power
- Screening Programs
- Statistical Notations
- Statistical Testing: Overview
- Subjective Probability
- Subset Analysis: Insights and Pitfalls
- Survival Analysis
- Tables, Two-by-Two and Contingency
- Variance and Covariance
- Violations of Probability Theory
- Weighted Least Squares
- Decision Analysis and Related Mathematical Models
- Applied Decision Analysis
- Boolean Algebra and Nodes
- Decision Analyses, Common Errors Made in Conducting
- Decision Curve Analysis
- Decision Tree: Introduction
- Decision Trees, Advanced Techniques in Constructing
- Decision Trees, Construction
- Decision Trees, Evaluation
- Decision Trees, Evaluation With Monte Carlo
- Decision Trees: Sensitivity Analysis, Basic and Probabilistic
- Decision Trees: Sensitivity Analysis, Deterministic
- Declining Exponential Approximation of Life Expectancy
- Deterministic Analysis
- Discrete-Event Simulation
- Disease Management Simulation Modeling
- Expected Value of Sample Information, Net Benefit of Sampling
- Influence Diagrams
- Markov Models
- Markov Models, Applications to Medical Decision Making
- Markov Models, Cycles
- Markov Processes
- Reference Case
- Steady-State Models
- Stochastic Medical Informatics
- Subtrees, Use in Constructing Decision Trees
- Test-Treatment Threshold
- Time Horizon
- Tornado Diagram
- Tree Structure, Advanced Techniques
- Health Outcomes and Measurement
- Complications or Adverse Effects of Treatment
- Cost-Identification Analysis
- Costs, Direct Versus Indirect
- Costs, Fixed Versus Variable
- Costs, Opportunity
- Costs, Out-of-Pocket
- Costs, Semifixed Versus Semivariable
- Costs, Spillover
- Economics, Health Economics
- Efficacy Versus Effectiveness
- Efficient Frontier
- Health Outcomes Assessment
- Health Status Measurement Standards
- Health Status Measurement, Assessing Meaningful Change
- Health Status Measurement, Construct Validity
- Health Status Measurement, Face and Content Validity
- Health Status Measurement, Floor and Ceiling Effects
- Health Status Measurement, Generic Versus Condition-Specific Measures
- Health Status Measurement, Minimal Clinically Significant Differences, and Anchor Versus Distribution Methods
- Health Status Measurement, Reliability and Internal Consistency
- Health Status Measurement, Responsiveness and Sensitivity to Change
- Human Capital Approach
- Life Expectancy
- Morbidity
- Mortality
- Oncology Health-Related Quality of Life Assessment
- Outcomes Research
- Patient Satisfaction
- Regret
- Report Cards, Hospitals and Physicians
- Risk Adjustment of Outcomes
- SF-36 and SF-12 Health Surveys
- SF-6D
- Sickness Impact Profile
- Sunk Costs
- Impact or Weight or Utility of the Possible Outcomes
- Certainty Equivalent
- Chained Gamble
- Conjoint Analysis
- Contingent Valuation
- Cost Measurement Methods
- Decomposed Measurement
- Disability-Adjusted Life Years (DALYs)
- Discounting
- Discrete Choice
- Disutility
- EuroQol (EQ-5D)
- Health Utilities Index Mark 2 and 3 (HUI2, HUI3)
- Healthy Years Equivalents
- Holistic Measurement
- Multi-Attribute Utility Theory
- Person Trade-Off
- Quality of Well-Being Scale
- Quality-Adjusted Life Years (QALYs)
- Quality-Adjusted Time Without Symptoms or Toxicity (Q-TWiST)
- SMARTS and SMARTER
- Split Choice
- Utilities for Joint Health States
- Utility Assessment Techniques
- Willingness to Pay
- Other Techniques, Theories, and Tools
- Artificial Neural Networks
- Bayesian Networks
- Bioinformatics
- Chaos Theory
- Clinical Algorithms and Practice Guidelines
- Complexity
- Computer-Assisted Decision Making
- Constraint Theory
- Decision Board
- Decisional Conflict
- Error and Human Factors Analyses
- Ethnographic Methods
- Expert Systems
- Patient Decision Aids
- Qualitative Methods
- Story-Based Decision Making
- Support Vector Machines
- Team Dynamics and Group Decision Making
- Threshold Technique
- Perspective of the Decision Maker
- Advance Directives and End-of-Life Decision Making
- Consumer-Directed Health Plans
- Cultural Issues
- Data Quality
- Decision Making in Advanced Disease
- Decisions Faced by Hospital Ethics Committees
- Decisions Faced by Institutional Review Boards
- Decisions Faced by Nongovernment Payers of Healthcare: Managed Care
- Decisions Faced by Patients: Primary Care
- Decisions Faced by Surrogates or Proxies for the Patient, Durable Power of Attorney
- Diagnostic Process, Making a Diagnosis
- Differential Diagnosis
- Evaluating and Integrating Research Into Clinical Practice
- Evidence Synthesis
- Evidence-Based Medicine
- Expert Opinion
- Genetic Testing
- Government Perspective, General Healthcare
- Government Perspective, Informed Policy Choice
- Government Perspective, Public Health Issues
- Health Insurance Portability and Accountability Act Privacy Rule
- Health Risk Management
- Informed Consent
- Informed Decision Making
- International Differences in Healthcare Systems
- Law and Court Decision Making
- Medicaid
- Medical Decisions and Ethics in the Military Context
- Medical Errors and Errors in Healthcare Delivery
- Medicare
- Models of Physician–Patient Relationship
- Patient Rights
- Physician Estimates of Prognosis
- Rationing
- Religious Factors
- Shared Decision Making
- Surrogate Decision Making
- Teaching Diagnostic Clinical Reasoning
- Technology Assessments
- Terminating Treatment, Physician Perspective
- Treatment Choices
- Trust in Healthcare
- The Psychology Underlying Decision Making
- Accountability
- Allais Paradox
- Associative Thinking
- Attention Limits
- Attraction Effect
- Automatic Thinking
- Axioms
- Biases in Human Prediction
- Bounded Rationality and Emotions
- Certainty Effect
- Cognitive Psychology and Processes
- Coincidence
- Computational Limitations
- Confirmation Bias
- Conflicts of Interest and Evidence-Based Clinical Medicine
- Conjunction Probability Error
- Context Effects
- Contextual Error
- Counterfactual Thinking
- Cues
- Decision Making and Affect
- Decision Modes
- Decision Psychology
- Decision Weights
- Decision-Making Competence, Aging and Mental Status
- Deliberation and Choice Processes
- Developmental Theories
- Dual-Process Theory
- Dynamic Decision Making
- Editing, Segregation of Prospects
- Emotion and Choice
- Errors in Clinical Reasoning
- Experience and Evaluations
- Fear
- Frequency Estimation
- Fuzzy-Trace Theory
- Gain/Loss Framing Effects
- Gambles
- Hedonic Prediction and Relativism
- Heuristics
- Human Cognitive Systems
- Information Integration Theory
- Intuition Versus Analysis
- Irrational Persistence in Belief
- Judgment
- Judgment Modes
- Learning and Memory in Medical Training
- Lens Model
- Lottery
- Managing Variability and Uncertainty
- Memory Reconstruction
- Mental Accounting
- Minerva-DM
- Mood Effects
- Moral Factors
- Motivation
- Numeracy
- Overinclusive Thinking
- Pain
- Pattern Recognition
- Personality, Choices
- Preference Reversals
- Probability Errors
- Probability, Verbal Expressions of
- Problem Solving
- Procedural Invariance and Its Violations
- Prospect Theory
- Range-Frequency Theory
- Risk Attitude
- Risk Aversion
- Risk Communication
- Risk Perception
- Scaling
- Social Factors
- Social Judgment Theory
- Stigma Susceptibility
- Support Theory
- Uncertainty in Medical Decisions
- Unreliability of Memory
- Value Functions in Domains of Gains and Losses
- Worldviews
- Loading...
Get a 30 day FREE TRIAL
-
Watch videos from a variety of sources bringing classroom topics to life
-
Read modern, diverse business cases
-
Explore hundreds of books and reference titles
Sage Recommends
We found other relevant content for you on other Sage platforms.
Have you created a personal profile? Login or create a profile so that you can save clips, playlists and searches