| Sign In to gain access to subscriptions and/or personal tools. |
An Interesting Problem in the Estimation of Scoring ReliabilityEducational Testing Service
A performance assessment consisting of 10 separate exercises was scored with a randomized scoring procedure. All responses to each exercise were rated once; in addition, a randomly selected subset of the responses to each exercise received an independent second rating. Each second rating was averaged with the corresponding first rating before the scores were computed. This article presents a method for estimating the scoring reliability (interrater reliability) coefficient and the standard error of scoring for the resulting scores. The report concludes with some numerical examples showing how the reliability estimation procedure can be used to estimate the effect of varying the proportions of responses that are double-scored.
Key Words: interrater reliability performance assessment scoring reliability
Journal of Educational and Behavioral Statistics, Vol. 29, No. 3,
333-341 (2004) |
||||





