1 / 56

Teaching Tactics and Simulating Human Tutor Dialog Moves in Auto Tutor

Teaching Tactics and Simulating Human Tutor Dialog Moves in Auto Tutor. Rupali Mukherjee and Marvin Schiller. 8. Juli 2003. Intelligent Tutoring Systems, Dr. Ivana Kruijff-Kurbayová, Dr. Helmut Horacek, SS 2003. Overview. Introduction. Architecture. What is AutoTutor?

Download Presentation

Teaching Tactics and Simulating Human Tutor Dialog Moves in Auto Tutor

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Teaching Tactics and Simulating Human Tutor Dialog Moves in Auto Tutor Rupali Mukherjee and Marvin Schiller 8. Juli 2003 Intelligent Tutoring Systems, Dr. Ivana Kruijff-Kurbayová, Dr. Helmut Horacek, SS 2003

  2. Overview Introduction Architecture • What is AutoTutor? • The need for AutoTutor • Teaching Tactics • Analysis of unaccomplished tutoring Evaluation • Curriculum script • Dialog Move Generation • Dialog Management • Language Analysis • „Virtual Students“ • Human Students • Conclusion/Discussion 2

  3. An Introduction to AutoTutor • Simulates discourse patterns & pedagogical strategies of human tutors • Developed by the „Tutorial Research Group“ (University of Memphis) • Domain: computer literacy (college) • Aim: conversational partner 3

  4. A Snapshot of AutoTutor 4

  5. Sample Dialog AutoTutor: When you turn on the computer, how is the operating system first activated and loaded into RAM? Student: When the computer is turned on a file is automatically booted up. AutoTutor: Anything else? Student: This file contains the operating system. AutoTutor: What else? Student: The operating system is then read into RAM and activated. AutoTutor: Right. The CPU exectues instructions in ROM, which helps the operating system to start. Student: After being read from ROM the OS is activated. AutoTutor: Right. Let's review what we've just gone over. The operating system [...] 5

  6. Overview Introduction Architecture • What is AutoTutor? • The need for AutoTutor • Teaching Tactics • Analysis of unaccomplished tutoring Evaluation • Curriculum script • Dialog Move Generation • Dialog Management • Language Analysis • „Virtual Students“ • Human Students • Conclusion/Discussion 6

  7. The need for AutoTutor One-on-one Tutoring Classroom Teaching • Construction of knowledge via interaction (constructivism) • Deep comprehension • Information delivery • Acquisition of shallow knowledge AutoTutor 7

  8. Overview Introduction Architecture • What is AutoTutor? • The need for AutoTutor • Teaching Tactics • Analysis of unaccomplished tutoring Evaluation • Curriculum script • Dialog Move Generation • Dialog Management • Language Analysis • „Virtual Students“ • Human Students • Conclusion/Discussion 8

  9. Teaching Tactics in Auto Tutor Constructivism: student actively constructs knowledge • each person forms their own representation of knowledge • learning: matching own current representations with own experience • interaction necessary for learning process Auto Tutor 1: models unaccomplished tutors Auto Tutor 2: sophisticated tutoring 9

  10. Overview Introduction Architecture • What is AutoTutor? • The need for AutoTutor • Teaching Tactics • Analysis of unaccomplished tutoring Evaluation • Curriculum script • Dialog Move Generation • Dialog Management • Language Analysis • „Virtual Students“ • Human Students • Conclusion/Discussion 10

  11. An anatomy of unskilled one-on-one Tutoring • One-on-one unskilled tutoring is effective • (effect size 0.5-2.3 sdu. over classroom teaching) (Bloom, 1984; Cohen, Kulik &Kulik 1982) (1 sdu. ~ 1 letter grade) • But: • usually no expert domain knowledge • no sophisticated tutoring strategies 11

  12. Analysis of unaccomplished Tutoring - The Setting Analysis of 100 hrs of naturalistic one-on-one tutoring • grad. students teaching undergrad. students basic research methods • middle school students teaching younger students basic algebra Result: rarely use sophisticated strategies. But 2 methods: a 5-step dialog frame, tutor-initiated dialog moves 12

  13. 5 Step Dialog Frame in one-on-one Tutoring 5 Step Dialog Frame Step 1: Tutor asks question (or presents problem) Step 2: Learner answers question Step 3: Tutor gives short immediate feedback Step 4: Tutor and Learner collaboratively improve the answer Step 5: Tutor assesses learner's understanding 13

  14. 3 Step Dialog Frame in Classroom Teaching Classroom Dialog Pattern Initiation Step 1: Tutor asks question Step 2: Learner answers question Step 3: Tutor gives short immediate feedback Step 4: Tutor and Learner collaboratively improve the answer Step 5: Tutor assesses learner's understanding Response Evaluation Step 4 makes the difference! 14

  15. Dialog Move Categories Dialog Moves are sensible to quality and quantity of the preceding contribution by the student. 1. Positive Immediate Feedback - „That's right“ „Yeah“ 2. Neutral Immediate Feedback - „Okay“ „Uh-huh“ 3. Negative Immediate Feedback - „Not quite“ „No“ 4. Prompting for more information - „Uh-huh“ „What else“ 5. Prompting (for specific information) - „If you add RAM, the CPU can store more data and larger ______?“ 6. Hinting - „What about the size of programs you need to run?“ 7. Elaboration - „With additional RAM, you can handle larger programs“ 8. Splicing in/correcting content after a student error - „Storing the program on a floppy disk will not help you to run the program.“ 9. Summarizing - „So to recap,...“ 15

  16. Overview Introduction Architecture • What is AutoTutor? • The need for AutoTutor • Teaching Tactics • Analysis of unaccomplished tutoring Evaluation • Curriculum script • Dialog Move Generation • Dialog Management • Language Analysis • „Virtual Students“ • Human Students • Conclusion/Discussion 16

  17. Curriculum Script Loosely structured lesson plans (organise topics & content) 3 Macrotopics hardware operating systems internet 12 Topics each Topic: basic concepts focal question ideal answers, answer aspects hints, prompts anticipated bad answers corrections for bad answers a summary 17

  18. Curriculum Script - Example Topic \info-8 Large, multi-user computers often work on several jobs simultaneously. This is known as concurrent processing. (...) So here's your question. \question-8 How does the operating system of a typical computer process several jobs with one CPU? basic concepts focal question 18

  19. Curriculum Script - Example Topic (II) good answer aspect (GAA) \pgood-8-1 The OS helps the computer to work on several jobs simultaneously by rapidly switching back and forth between jobs. \phint-8-1-1 How can the OS take advantage of idle time on the job? \phintc-8-1-1 The operating system switches between jobs. hint 19

  20. Curriculum Script - Example Topic (III) \ppromt-8-1-1 The operating system switches rapidly between _ \ppromptk-8-1-1 jobs \bad-8-1 The operating system completes one job at a time and then works on another. \splice-8-1 The operating system can work on several jobs at once. prompt bad answer correction 20

  21. Overview Introduction Architecture • What is AutoTutor? • The need for AutoTutor • Teaching Tactics • Analysis of unaccomplished tutoring Evaluation • Curriculum script • Dialog Move Generation • Dialog Management • Language Analysis • „Virtual Students“ • Human Students • Conclusion/Discussion 21

  22. The Dialog Advancer Network (DAN) • Mechanism for enhancing AutoTutor's conversational skills • Enables AutoTutor to: • adapt each dialog move to learner's • previous turn • indicate when the learner has the floor for • contributions 22

  23. Role of the DAN - Turn-adaption Coherence emerges in human conversations Reason: participants generally adapt their turns so that they are relevant to preceding turn • adapt each dialog move to learner's previous turn „Turn-adaption“ problematic: content of dialog moves is predetermined DAN: make quasi-adapted dialog moves relevant to learner's previous turn. 23

  24. Role of the DAN - Turn-taking • indicate when the learner has the floor for contributions • Turn-taking: integral feature of of conversational process • Speakers signal to listeners that they are relinquishing the floor (facilitates turn-taking in human-to-human conversation) • If AutoTutor lacks this, users often do not know when or if to respond (in early versions, often confusion after Hints, Elaboration and Prompt Response dialog moves) • Current versions: use of linguistic discourse markers to disambiguate conversation • Next versions of AutoTutor: also gestures and paralinguistic signals (e.g. eye gaze) 24

  25. DAN Repeat Advancer State Select Discourse Markter „Once Again“ + Prev. Turn D.Move. Comprehension Advancer State Select Discourse Markter „Well“ or „I see“ + Pump or Hint Classifies Frozen Expression Select Pump Select Hint Select Short Feedback Student Turn N+1 Tutor Selects Dialog Move Select Discourse Marker „Okay“ or „Moving on“ Tutor Adapts Select Elaboration Answers WH or Yes/No question Student Turn N Advancer State Select Discourse Marker „Okay“ Select Summary Tutor Asks next Topic Question Advancer State Advancer State 25

  26. DAN - example pathway AutoTutor: Well, where is most of the information you type in temporarily stored? Student Turn N Adaption Select Short Feedback Student: RAM Tutor selects Dialog Move AutoTutor: Right! In RAM. select summary AutoTutor: Let's review, after you enter information, it is sent to the CPU. The CPU carries out the instructions on the data Advancer State asks next tutor topic question AutoTutor: Okay. AutoTutor: How does the OS of a typical computer process several jobs simultaneously with only one CPU?“ Student Turn N + 1 26

  27. Effect of the DAN • Development of the DAN: interaction with students improved considerably • Numerous pathways: refine micro-adaption skills • Eradication of turn-taking confusion by Advancer States • Enhances overall effectiveness as tutor and conversational partner 27

  28. Analysis of DAN Pathway Frequency Distribution • 64 computer literacy students interacted with AutoTutor (for course credits) • 24 topics covered in each tutoring session • written transcripts generated for each session • 3 of the 24 topics were randomly selected -> analysis of 192 mini-conversations 28

  29. Analysis of DAN Pathway Frequency Distribution - Results Result: most frequently travelled pathways: 35% of all paths } Prompt Response - Advancer - Prompt Positive Feedback - Prompt Response - Advancer - Prompt Conclusion: Too many prompts! Leads to short answers (but goal of AutoTutor: longer, conversational contributions) Remedy: modification of triggering conditions for prompts 29

  30. Dialog Move Selection Repeat Advancer State Select Discourse Markter „Once Again“ + Prev. Turn D.Move. Comprehension Advancer State Select Discourse Markter „Well“ or „I see“ + Pump or Hint Classifies Frozen Expression Select Pump Select Hint Select Short Feedback Student Turn N+1 Tutor Selects Dialog Move Select Discourse Marker „Okay“ or „Moving on“ Tutor Adapts Select Elaboration Answers WH or Yes/No question Student Turn N Advancer State Select Discourse Marker „Okay“ Select Summary Tutor Asks next Topic Question Advancer State Advancer State 30

  31. Student's contribution Language Analysis Word Segmenter Syntactic Class Identifier Speech Act Classification • Assertion • WH-question • Yes-/No- question • Directive • Short Response Latent Semantic Analysis 31

  32. Language Analysis Latent Semantic Analysis • Computation of a relatedness score between two sets of words • Compression of a corpus of texts (here: curriculum script, textbooks, articles) into a k-dimensional LSA-space • Purely statistical method (no deep understanding) 32

  33. Dialog Move Selection via 15 Production Rules sensitive to • assertion quality of preceding turn • dialog history (global variables: ability, verbosity, initiative of learner) • extent of coverage of GAA's Examples: IF [student Assertion match with GAA = HIGH or VERY HIGH] THEN [select POSITIVE FEEDBACK] IF[student ability = MEDIUM or HIGH & Assertion match with good answer aspect = LOW THEN [select HINT] 33

  34. Dialog Move Selection - Selection of next Good Answer Aspect focal question A1 A2 A3 ..... An good answer aspects all need to be covered • each Ai has coverage metric between 0 and 1 (computed by LSA, updated with each assertion) • each Ai covered if coverage metric above a threshold 34

  35. Dialog Move Selection - Selection of next Good Answer Aspect (II) A2 is covered (above threshold) coverage values Threshold A1 A5 A2 A4 A3 A5 has highest subthreshold value - selected as next GAA to be covered • AutoTutor-1: all contributions count • AutoTutor-2: only student contributions are considered 35

  36. Overview Introduction Architecture • What is AutoTutor? • The need for AutoTutor • Teaching Tactics • Analysis of unaccomplished tutoring Evaluation • Curriculum script • Dialog Move Generation • Dialog Management • Language Analysis • „Virtual Students“ • Human Students • Conclusion/Discussion 36

  37. Evaluation with Virtual Students • Creation of virtual students • Tutoring sessions with virtual students • Evaluation by experts in language and pedagogy • (ratings between 1 [very poor] and 6 [very good]) • Revision and adjustment of AutoTutor Evaluation criteria: • discrimination of learner ability • choice of appropriate dialog moves 2 judges 2 judges Pedagogical effectiveness -pedagogical aspects - dialog reasonable for normal human tutor? Conversational appropriateness - politeness norms - quality, quantity, relevance, manner (Gricean maxims) 37

  38. Creation of Virtual Students • 36 topics in the curriculum script answered by ~100 human computer literacy students • Quality of each answer rated by judges • Creation of 7 virtual student „prototypes“ • contributions taken from „good“ answer samples • 2-3 assertions each turn Good verbose student: Good succinct student: • contributions taken from „good“ answer samples • 1 assertion each turn Vague student: • contributions contain „vague“ assertions Erroneous student: • contributions contain assertions with misconceptions 38

  39. Creation of Virtual Students (II) • 36 topics in the curriculum script answered by ~100 human computer literacy students • Quality of each answer rated by judges • Creation of 7 virtual student „prototypes“ Mute student: • contributions „semantically depleted“: „Well“, „Okay“, ... • first 5 turns contain 1 good assertion • contributions from same human student Good coherent student: • all classes of assertions Monte Carlo Student: 39

  40. Pedagogical Effectiveness (1. and 2. evaluation cycle) r • 2 judges gave scores between 1 and 6 • PA score for good verbose, good succinct student lower than average 40

  41. Conversational Appropriateness (1. and 2. evaluation cycle) • 2 judges gave scores between 1 and 6 • asymmetry in scores for good and bad students 41

  42. Consequences of the Evaluation Results Measures taken: • Revision of curriculum script (shorter, more conversational sentences) • Dialog moves were given discourse markers • Changes to production rules • Adjustments to LSA values 42

  43. Evaluation Results (before/after revisions) (I) 43

  44. Evaluation Results (before/after revisions) (II) Outcome: the asymmetry has disappeared! 44

  45. Evaluation Results • Some results are „promising“ • Major problem not AutoTutor, but virtual students: • redundancies • incoherence 45

  46. Overview Introduction Architecture • What is AutoTutor? • The need for AutoTutor • Teaching Tactics • Analysis of unaccomplished tutoring Evaluation • Curriculum script • Dialog Move Generation • Dialog Management • Language Analysis • „Virtual Students“ • Human Students • Conclusion/Discussion 46

  47. Effect of AutoTutor on Learning Gains • Assessment of learning gains - 3 conditions • Significant differences in the students’ scores among • the 3 conditions, with means • - AutoTutor 0.43 • - Reread 0.38 • - Control 0.36 • Gains in learning and memory • - size increment of .5 to .6 SD units over control condition. AutoTutor Reread Control 47

  48. „Bystander“ Turing Test 144 Tutor Moves from Dialogs between Students and AutoTutor-1 Transcripts of AutoTutor-1's dialog moves 6 human tutors were asked what they would say at these 144 points ? 36 computer literacy students discriminated: AutoTutor or Human Tutor? 48

  49. „Bystander“ Turing Test 36 computer literacy students discriminated: AutoTutor or Human Tutor? Outcome: discrimination score of -.08 Students are unable to discriminate whether particular dialogue had been generated by a computer vs. a human ! 49

  50. The TRG’s View on the Results • “Impressive” outcome supported claim that AutoTutor • is a good simulation of human tutors. • Attempts to comprehend the student input. • „Almost as good as an expert in computer literacy .“ 50

More Related