1 / 17

Item pocket method to allow response review and change in CAT

Item pocket method to allow response review and change in CAT. Kyung T. Han http://www.linkedin.com/in/hantest. Response review. Aim to reduce examinee’s anxiety during high stakes test. But make CAT less efficient and biased score estimates. Examinee’s test-taking strategies

oralee
Download Presentation

Item pocket method to allow response review and change in CAT

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Item pocket method to allow response review and change in CAT Kyung T. Han http://www.linkedin.com/in/hantest

  2. Response review • Aim to reduce examinee’s anxiety during high stakes test. • But make CAT less efficient and biased score estimates. • Examinee’s test-taking strategies • Wainer strategy • Kingsbury strategy • Generalized Kingsbury (GK) strategy

  3. Wainer strategy • Answered all items incorrectly in round 1, then tried to answer all items correctly in round 2. • Results in positive bias on theta • Maybe happens for high-ability person

  4. Kingsbury strategy • Examinee could distinguish between current and previous item difficulties. • Examinee went back to change response if current item difficulty is easier than previous one. • Assumption: • (a) θ-δ <= -1, then make guess on current response • (b) θ-δ> 0.5, then go back to change response • Low-ability examinee is likely to get the benefit

  5. Generalized Kingsbury strategy • Speculate on the difficulty level of the next item not only for items with guessed responses but also for all previous items. • Strategy offered no meaningful improvement in score estimates in most situations. • Only 61% successful in distinguishing the difficulty difference.

  6. CAT with restricted revision options • Stocking (1997): reduce Wainer effect • Model 1: change response at the end of test with limited number of item • Failed to control if allowable items were larger than 2 • Model 2: multiple separately timed sections and allowed to change responses within section • Model 3: allowed to revise responses only within each item set (common stimulus) • May feel anxiety when make decision to go • Cannot skip items • May use Kingsbury or GK to find clue

  7. Item pocket method • Must answer in the end of test or be scored as incorrect • Advantages: • Reduce anxiety • Items can be skipped and put in the pocket one time • Items in pocket do not affect the interim score and item selection (in turn, make Kingsbryand GK strategies ineffective) • Need no section

  8. Simulation 1 • If robust to Wainer-like strategy • Settings: • 500 items • fixed-length CAT 40 items • MLE • Sympson & Hetter (Rmax = 0.2); or not • Maximum number of items in IP: 0, 2, 4, 6 • Mean absolute error (MAE) and bias • Replications: 25

  9. Simulation 1 • Assume examinees use Wainer-like strategy • Only IP items can be revised (preserve as many easiest items as possible, because examinees think put them in pocket will be scored as wrong). • Answer other non-IP items in normal way • IP size is limited. • Impact on the final score estimates • Not often happen in practical

  10. Results for simulation 1

  11. Simulation 2 • Assumed examinees evaluated the relative difficulty of each item against their proficiency. • 50% finding out a challenging item and put it in pocket if |θ-δ| < 0.5, otherwise 70%. (preserve challenging items) • If IP is full, examinee compare the easiest item in pocket with current challenging item. • If the “easiest” item is easier than challenging item, answer it and put challenging item in pocket. (using 50%&70% rule) • No time limit, no fatigue

  12. Results for simulation 2 • MAE increased by .069, .084, .087 for 2, 4, 6 IP size. • Increase in average bias were .057, .075, .080

  13. Low-ability examinees were likely to see more difficulty items ( due to simulation settings), but not for high-ability examinees.

  14. Discussion • Time limit should be considered • For low-ability examinee, most items put in IP were those initial items due to item selection algorithm selecting an item was based on initial estimate (abound 0).

  15. Conclusion • IP • may reduce anxiety • Minimized the effect of Wainer-like strategy • Immune to Kinsbury and GK • IP size: • Too small or too large

  16. Questions • Why the mean bias is not close to zero when IP size is zero? • I'm curious that why no difference was found between the no exposure control condition and SH method condition?

  17. Future study • Fixed-precision CAT • Everyone has different ability (probability) to tell item difficulty. • Elapsed time of skipping an item • Multiple choice item • Possible to trick IP method? • Utilizing information of IP item (MNAR)

More Related