Advanced Search

Journal Navigation

Journal Home

Subscriptions

Archive

Contact Us

Table of Contents

Click here to sign up for SAGE Journal Email Alerts today!

Sign In to gain access to subscriptions and/or personal tools.
Journal of Educational and Behavioral Statistics
This Article
Right arrow Full Text (PDF)
Right arrow References
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Alert me to new issues of the journal
Right arrow Add to Saved Citations
Right arrow Download to citation manager
Right arrowRequest Permissions
Right arrow Request Reprints
Right arrow Add to My Marked Citations
Citing Articles
Right arrow Citing Articles via Google Scholar
Right arrow Citing Articles via Scopus
Google Scholar
Right arrow Articles by Raghunathan, T. E.
Right arrow Articles by Cheadle, A. D.
Right arrow Search for Related Content
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati   Add to Twitter  
What's this?

Articles

Combining Aggregate and Individual Level Data to Estimate an Individual Level Correlation Coefficient

Trivellore E. Raghunathan

University of Michigan

Paula K. Diehr

University of Washington

Allen D. Cheadle

University of Washington

Methods are developed that use aggregate data, possibly based on a large number of individuals, and individual level data, from a small fraction of individuals from the same or similar population, to eliminate ecological bias inherent in the analysis of aggregate data. The primary focus is on estimating the individual level correlation coefficient but the proposed methodology can be extended to estimate regression coefficients. Two approaches, the method of moments and the maximum likelihood, are developed for a bivariate distribution, but can be extended to a multivariate distribution. The method of moments develops a corrected estimate of the within-group covariance matrix, which is then used to estimate the individual level correlation and regression coefficients. The second method assumes bivariate normality and maximizes the combined likelihood function based on the two data sets. The maximum likelihood estimates are obtained using the EM-algorithm. A simulation study investigates the repeated sampling properties of these procedures in terms of bias and the mean square error of the point estimates and the actual coverage of the confidence intervals. The maximum likelihood estimates are almost unbiased and the confidence intervals are well calibrated for simulation conditions considered. The method of moments estimates have the same desirable properties for some simulation conditions. Under all conditions, the correlation coefficient between aggregate variables is severely biased as an estimate of the individual level correlation coefficient.

Key Words: ecological fallacy • EM-algorithm • maximum likelihood • method of moments • random effects

Journal of Educational and Behavioral Statistics, Vol. 28, No. 1, 1-19 (2003)
DOI: 10.3102/10769986028001001


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati   Add to Twitter Twitter    What's this?




AER home page RER home page JEB home page EPA home page RRE home page