Skip to main content icon/video/no-internet

Item Analysis

Item analysis is the set of qualitative and quantitative techniques and procedures used to evaluate the characteristics of items of the test before and after the test development and construction. An item is a basic building block of a test, and its analysis provides information about its performance. Item analysis allows selecting or omitting items from the test, but more important, item analysis is a tool to help the item writer improve an item. Anthony Nitko suggests that some classroom uses of an item analysis might be estimating whether an item functions as intended, providing feedback to students about their performance, providing feedback to the teacher about student difficulties and ideas of curriculum improvement, revising assessment tasks, and improving item-writing skills.

Item analysis can be used both for dichotomously scored (correct or incorrect) items and polytomously scored (with more than two score categories) items. The main purpose of item analysis is to improve internal consistency or internal structure validity, focused on confirming a single-factor or one-trait test. If the trait is not one factor, then the use of item analysis might tend to lower validity. If a test has two factors (or content divisions) or is multifactored (more than two content divisions), the calculation of item statistics for each item (or option) should be focused on the subtotal for the relevant set of items rather than on the total test score. Item analysis in this case is used to improve the internal consistency of each subset of items with no intention to change the dimensionality of the entire set. In these cases, an overall reliability index would be stratified alpha (or battery alpha) rather than the regular coefficient alpha as used for a one-dimensional test.

A test that is composed of items selected based on item analysis statistics tends to be more reliable than one composed of an equal number of unanalyzed items. Even though the process of item analysis looks sophisticated, it is not as challenging as it seems. Item analysis software programs are user friendly and make the task much simpler. The most time-consuming part of item analysis might be the process of tabulating the data to be analyzed by the software program. Statistical programs such as SPSS, an IBM product, and SAS can be used for item analysis as well. The information needed to start an item analysis procedure after the administration is the work of the examinees. In the subsequent sections, several aspects of the item analysis process are discussed.

Norm-Referenced versus Criterion-Referenced Interpretations

Norm-referenced and criterion-referenced interpretations are two different types of score interpretations. Norm-referenced interpretations help to locate an examinee's position within a well-defined group, and criterion-referenced interpretations are used in describing a degree of proficiency in a specified content domain. Use of college admission tests is an example of norm-referenced interpretations, and most tests and quizzes written by teachers are examples of criterion-referenced interpretations. A criterion might be the lesson curriculum or the state standards. Some tests are intended for both types of interpretations; for example, some states use standards-based tests for both purposes. With standards-based tests, the criterion-referenced interpretation is intended to give information about how proficient the students are in the curriculum defined by the state standards. The norm-referenced interpretation provides a measure of how each student compares with peers.

...

  • Loading...
locked icon

Sign in to access this content

Get a 30 day FREE TRIAL

  • Watch videos from a variety of sources bringing classroom topics to life
  • Read modern, diverse business cases
  • Explore hundreds of books and reference titles

Sage Recommends

We found other relevant content for you on other Sage platforms.

Loading