ADVERTISEMENT

If you are seeing this message, you may be experiencing temporary network problems. Please wait a few minutes and refresh the page. If the problem persists, you may wish to report it to your local Network Manager.

It is also possible that your web browser is not configured or not able to display style sheets. In this case, although the visual presentation will be degraded, the site should continue to be functional. We recommend using the latest version of Microsoft or Mozilla web browser to help minimise these problems.

Wiley InterScience

Next Abstract >

Save Article to My Profile      Download Citation      Request Permissions

Abstract |  References  |  Full Text: HTML, PDF (Size: 143K)  | Related Articles | Citation Tracking

A Comparison of the Common-Item and Random-Groups Equating Designs Using Empirical Data
Dong-In Kim * , Seung W. Choi ** , Guemin Lee *** and Kooghyang R. Um ****
  * CTB/McGraw-Hill, USA
  ** Northwestern University, USA
  *** Department of Education, Yonsei University, 134 Shinchon-dong Seodaemun-ku, Seoul, Korea.
guemin@yonsei.ac.kr

  **** Pearson Educational Measurement, USA
Copyright Journal compilation © 2008 Blackwell Publishing Ltd

ABSTRACT

We designed this study to evaluate several data collection and equating designs in the context of item response theory (IRT) equating. The random-groups design and the common-item design have been widely used for collecting data for IRT equating. In this study, we investigated four equating methods based upon these two data collection designs, using empirical data from a number of different testing programs. When the randomly equivalent group assumption was reasonably met, the four equating methods tended to produce highly comparable results. On the other hand, equating methods based upon either of the equating designs produced dissimilar results. Sample size can have differential effects on the equating results produced by the different equating methods. In practice, a common-item equivalent-groups design often produces unacceptably large differences in the group mean due to various anomalies such as context effects, poor quality of common items, or a very small number of common items. In such cases, a random-groups design would produce more stable equating results.


DIGITAL OBJECT IDENTIFIER (DOI)
10.1111/j.1468-2389.2008.00413.x About DOI

Related Articles

  • Find other articles like this in Wiley InterScience
  • Find articles in Wiley InterScience written by any of the authors

Wiley InterScience is a member of CrossRef.

Cross Ref Member



FREE ONLINE ACCESS

Guidelines and Ethical Considerations for Assessment Center Operations
International Task Force on Assessment Center Guidelines

Assessment Center Dimensions: Individual differences correlates and meta-analytic incremental validity
Stephan Dilchert, Deniz S. Ones

Special Issue
Go to journal homepage

International Journal of Selection and Assessment
Volume 17
Issue 4

Applicant Perspectives in Selection: Going beyond Preferences in Reactions
Guest Edited Ute R. Hülsheger & Neil Anderson

Free access to Guest Editorial:
Applicant Perspectives in Selection: Going beyond preference reactions

Business & Management