Skip to main content icon/video/no-internet

Inference is a process whereby a conclusion is drawn without complete certainty, but with some degree of probability relative to the evidence on which it is based.

Survey data may be used for description or for analysis. Descriptive uses include making estimates of population totals, averages, and proportions. Analytical uses include testing hypotheses about the population. The analytical uses involve making statistical inferences. For example, a descriptive use of survey data would be to supply an estimate of the number of male and female engineers. An analytical use would be to infer (based on valid statistical procedures) that there are significantly fewer female engineers than male engineers. Another descriptive use of survey data would be to supply the average salary of teachers. An inferential use would be to conclude that, even after controlling for education and experience, teachers of one racial-ethnic category tend to have a higher average salary than those in another racial-ethnic category.

Design-Based and Model-Based Inferences

There are two approaches to making inferences from survey data. First, in the design-based approach, inferences are made by looking at how statistics vary as samples are repeatedly drawn using the same sampling procedures as were employed in the actual sampling.

Second, in the model-based approach, inferences are made by looking at how statistics vary as the population, as described by a probability model, is allowed to vary without changing the sample. The model-based approach is also called the prediction approach because the model is used to predict the population units not in the sample. It is called the superpopulation approach as well because the population can be regarded as selected from a still larger population according to the probability model.

Inference procedures (e.g. hypothesis testing or estimating confidence intervals) can be carried out under either the design-based or the model-based approach. The design-based approach is more traditional in survey sampling. The model-based approach, on the other hand, is more consistent with statistical approaches used outside of survey sampling.

Confidence Intervals

Confidence intervals allow one to infer with a high degree of confidence that a quantity being estimated lies within an interval computed by a specified procedure. The precise meaning of “confidence” depends on whether one is adopting the design-based or model-based approach. Clearly a confidence interval is more informative than a numerical estimate of a population quantity (called a point estimate) in that the confidence interval conveys information about how precise the point estimate is.

Hypothesis Testing

The purpose of hypothesis testing is to ascertain whether an observed difference in the sample is statistically significant or whether it can instead be adequately explained by chance alone. Hypothesis tests are designed so that, if there is in fact no difference, the probability of (erroneously) rejecting the hypothesis that there is no difference (i.e. the null hypothesis) is kept to a specified low level; often this probability, called the Type I error, is set to .05. A well-designed hypothesis test will also minimize the other potential error on inference, namely, not rejecting the hypothesis of no difference when a difference actually exists (i.e. Type II error). In survey sampling, it is often the case that two sample averages are independent and approximately normally distributed so the hypothesis that their difference is zero can be tested using properties of the normal distribution (this is called a t-tesi).

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading