Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here to sign up for SAGE Journal Email Alerts today!

Sign In to gain access to subscriptions and/or personal tools.
Journal of Educational and Behavioral Statistics
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Patz, R. J.
Right arrow Articles by Mariano, L. T.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Articles

The Hierarchical Rater Model for Rated Test Items and its Application to Large-Scale Educational Assessment Data

Richard J. Patz

CTB/McGraw-Hill

Brian W. Junker

Carnegie Mellon University

Matthew S. Johnson

Baruch College

Louis T. Mariano

RAND

Open-ended or "constructed" student responses to test items have become a stock component of standardized educational assessments. Digital imaging of examinee work now enables a distributed rating process to be flexibly managed, and allocation designs that involve as many as six or more ratings for a subset of responses are now feasible. In this article we develop Patz’s (1996) hierarchical rater model (HRM) for polytomous item response data scored by multiple raters, and show how it can be used to scale examinees and items, to model aspects of consensus among raters, and to model individual rater severity and consistency effects. The HRM treats examinee responses to open-ended items as unobsered discrete varibles, and it explicitly models the "proficiency" of raters in assigning accurate scores as well as the proficiency of examinees in providing correct responses. We show how the HRM "fits in" to the generalizability theory framework that has been the traditional tool of analysis for rated item response data, and give some relationships between the HRM, the design effects correction of Bock, Brennan and Muraki (1999), and the rater bundle model of Wilson and Hoskens (2002). Using simulated and real data, we compare the HRM to the conventional IRT Facets model for rating data (e.g., Linacre, 1989; Engelhard, 1994, 1996), and we explore ways that information from HRM analyses may improved the quality of the rating process.

Key Words: generalizability • hierarchical Bayes modeling • item response theory • latent response model • Markov chain Monte Carlo • Multiple ratings • rater consensus • rater consistency • rater severity

Journal of Educational and Behavioral Statistics, Vol. 27, No. 4, 341-384 (2002)
DOI: 10.3102/10769986027004341


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?


This article has been cited by other articles:


Home page
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICSHome page
L. T. Mariano and B. W. Junker
Covariates of the Rating Process in Hierarchical Models for Multiple Ratings of Test Items
Journal of Educational and Behavioral Statistics, September 1, 2007; 32(3): 287 - 314.
[Abstract] [Full Text] [PDF]


Home page
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICSHome page
R. J. Mislevy
Can There Be Reliability without "Reliability?"
Journal of Educational and Behavioral Statistics, January 1, 2004; 29(2): 241 - 244.
[PDF]


Home page
JOURNAL OF EDUCATIONAL AND BEHAVIORAL STATISTICSHome page
M. S. Johnson and B. W. Junker
Using Data Augmentation and Markov Chain Monte Carlo for the Estimation of Unfolding Response Models
Journal of Educational and Behavioral Statistics, January 1, 2003; 28(3): 195 - 230.
[Abstract] [PDF]



AER home page RER home page JEB home page EPA home page RRE home page