Entry
Entries A-Z
Subject index
Standardized Testing
A standardized test uses certain procedures to reduce errors of measurement. It is usually developed by a group of content experts and psychometricians. The former's responsibility is to ensure that the test items adequately sample important content areas of the subject tested. The latter's work is to select items with the appropriate difficulty and discrimination levels. The experts and psychometricians also work together to avoid item bias or test bias. A standardized test also uses standardization to reduce errors during the test administration and scoring processes. Standardization of test administration includes the use of uniform testing time, procedures, and testing environment. People responsible for test administration usually go through special training to ensure the uniformity of the administration process, a prerequisite for the fair comparison of test scores. Standardized tests also use machines for scoring objective items such as multiple-choice questions. For subjective items such as essay questions, special training is used to ensure an adequate degree of agreement between different raters rating the same items. The training also helps a scorer to apply the scoring rubrics in a consistent way during the scoring period.
Types of Standardized Tests
A standardized test is often norm referenced. In other words, the purpose of a standardized test is to compare student test performance to that of a representative student group, or the norm. Students in the norm group are selected based on their demographic background and academic status. Every few years, the norm group is updated so that the results of the representative group are kept current and meaningful. Comparison of student test results against the norm is usually accomplished through the use of standardized test scores. One norm-referenced test score is the percentile rank, which stands for the percentage of students a student outperforms in the norm group. Standardization is considered imperative when there is a need to make comparison of student performance against the norm. The counterpart of a norm-referenced test is a criterion-referenced test. Instead of comparing student performance against other students, a criterion-referenced test uses external criteria or standards as the point of reference, usually local or national performance standards. A more recent term for this type of test is standards-based assessment. To facilitate the understanding of test scores, different performance levels are designed that correspond to different ranges of scale scores. For instance, students who answer most of the items on a subject test correctly will receive very high scale scores; they also receive an “Excellent” score in terms of their performance level. On the other hand, students who answer a minimal yet an acceptable number of items may receive a barely passing scale score, hence a “Basic” score in terms of performance level. It is possible for a test to be both norm referenced and criterion referenced. For instance, most statewide tests that are being used serve both the purpose of comparing students against each other and comparing their performance against state performance standards.
Standardized tests may also be divided into achievement tests and aptitude tests. The former is an assessment of what students have learned or achieved over a period of time, while the latter is a measure of students' potential for future success in academic achievement. It is possible, however, for an achievement test to be used as a measure to predict student future performance. For instance, the subject tests of the Scholastic Achievement Tests (SAT) measure what students have learned in a number of key subject areas. The tests are used to predict if students will succeed in future studies in those areas. On the other hand, the SAT Reasoning Test is an aptitude test that is used to measure student future success in college. Related to aptitude test is the intelligence test, a test used in and outside of education to measure a person's mental ability in general as well as specific areas. One example is the Wechsler Intelligence Scale for Children (WISC).
...
- Loading...
Get a 30 day FREE TRIAL
-
Watch videos from a variety of sources bringing classroom topics to life
-
Read modern, diverse business cases
-
Explore hundreds of books and reference titles
Sage Recommends
We found other relevant content for you on other Sage platforms.
Have you created a personal profile? Login or create a profile so that you can save clips, playlists and searches