1 / 55

Interactive Machine Learning: Leveraging Human Intelligence

Interactive Machine Learning: Leveraging Human Intelligence. Dan R. Olsen Jr. Brigham Young University Dept. of Computer Science. Interactive Machine Learning (IML). More questions than answers. Why is IML of Interest?. Why is IML of Interest?. Exponential Growth Computer processor speed

benard
Download Presentation

Interactive Machine Learning: Leveraging Human Intelligence

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Interactive Machine Learning: Leveraging Human Intelligence Dan R. Olsen Jr. Brigham Young University Dept. of Computer Science

  2. Interactive Machine Learning (IML) More questions than answers

  3. Why is IML of Interest?

  4. Why is IML of Interest? • Exponential Growth • Computer processor speed • Memory size • Available data on the Internet • Static Humans • Fixed USABLE screen size Fixed Cost

  5. Direct manipulation will not scale • Machine Learning can leverage human expression Gigabytes of data

  6. Why is IML of Interest? • Exponential drop in cost • Computing in more domains • Computing freed from the desktop • Interaction • Many new input sensors • Many new situations • No design tools Fixed Power Varying cost

  7. IML Examples • Characteristics of IML • Feedback Effect • When are we done? • Future

  8. Laser-Spot Detection

  9. People Tracking – (Join/Capture)

  10. Skin Tracking – (Light Widgets)

  11. Image Processing with Crayons • Design in minutes not months • Use image painting as the design metaphor • Base the learning on selection from hundreds of features rather than combination of a few

  12. Safe/Unsafe Driving • Problem: Steering robots • Increase operator neglect time so that attention can be used elsewhere • Reduce collisions with unseen objects • Solution: Teach the robot what is safe and what is not

  13. IML Examples • Characteristics of IML • Generalize to artifacts, not just images • Feedback Effect • When are we done? • Future

  14. Unlabeled Artifacts Trained Function Training Algorithm IML User Interface Feedback Labeling Feature Generator Labeled Artifacts Analysis Math Program Analysis Math Program

  15. Unlabeled Artifacts Trained Function Training Algorithm IML User Interface Feedback Labeling Feature Generator ? ? Labeled Artifacts Analysis Math Program Analysis Math Program

  16. Shape and texture features needed to separate grass from trees

  17. Unlabeled Artifacts Seconds not days Trained Function Training Algorithm IML User Interface Feedback Labeling Feature Generator Labeled Artifacts Analysis Math Program Analysis Math Program

  18. Unlabeled Artifacts Trained Function Training Algorithm IML User Interface Feedback Labeling Feature Generator Labeling time dominates the process Labeled Artifacts Analysis Math Program Analysis Math Program

  19. Unlabeled Artifacts Trained Function Training Algorithm IML User Interface Feedback Labeling Feature Generator Rapid discovery of wrong solutions Labeled Artifacts Analysis Math Program Analysis Math Program

  20. Unlabeled Artifacts Trained Function Training Algorithm IML User Interface Feedback Labeling Feature Generator Exploration of a space of problems Labeled Artifacts Analysis Math Program Analysis Math Program

  21. IML Examples • Characteristics of IML • Feedback Effect • When are we done? • Future

  22. Unlabeled Artifacts Does Feedback Matter? Trained Function Training Algorithm ? IML User Interface Feedback Labeling Feature Generator Labeled Artifacts Analysis Math Program Analysis Math Program

  23. Labeling to correct

  24. Decision Surface

  25. Learned Decision Surface

  26. Data->Learn->Feedbackimplicit focus on decision surface(like boosting)

  27. Does feedback/correction reduce effort? • Simulation • Artificial oracles • 1) Train with random selection • 2) Train by selecting only corrections of the decision • 15% less training examples  • Margin-based rather than area-based classifiers

  28. Does feedback/correction reduce effort? • User Studies • Fully annotate 20 pictures to create a standard • 1) Have users “paint” classifications without feedback • 2) Have users “paint” with feedback • No significant difference • Why?

  29. IML Examples • Characteristics of IML • Feedback Effect • Are we done yet? • Future

  30. How does the user find out that the current feature set cannot separate grass from trees?

  31. Estimating Accuracy • Strategies from Machine Learning • Hold-out set • K-fold cross validation

  32. Compare to Real Accuracy

  33. Incremental Difference Estimate • Create a classifier C(i) for training set I • Create a classifier C(i+500) for training set (I+500) • Compare how C(i) and C(i+500) classify unlabeled data. • Percent difference is the error estimate

  34. Incremental Difference Estimate

  35. Why doesn’t cross-validation work?

  36. C(i+1) Area of disagreement Training Myopia All Data Labeled Data C(i)

  37. Use incremental classifier distance to suggest regions to label

  38. Unlabeled Artifacts Does Feedback Matter? Trained Function Training Algorithm ? IML User Interface Feedback Labeling Feature Generator Labeled Artifacts Maybe  Analysis Math Program Analysis Math Program

  39. IML Examples • Characteristics of IML • Observations of User Behavior • Observations of Algorithm Behavior • Future

  40. Other artifacts • Text • Classification • Extraction • Video • Are there multiple people in the room? • Audio • Is someone talking in the room? • Sensor streams • Object being shaken • Objects bumped together • Brain sensing

  41. Unlabeled Artifacts Trained Function Training Algorithm IML User Interface Feedback Labeling Feature Generator Labeled Artifacts Different artifact types have differing labeling effort Analysis Math Program Analysis Math Program

  42. Alternative interfaces • Selection  Classification Select all frames of a football game where the ball is actually In play

  43. Alternative interfaces • Copy and Paste is a learnable data transformation

  44. Alternative interfaces • Similarity metrics • Sesame Street Learning – One of these things is not like the other A Niched Pareto Genetic Algorithm for Multiobjective Optimization Principles And Implementation Of Deductive Parsing Grammatical Trigrams: A Probabilistic Model of Link Grammar

  45. Alternative interfaces • Placing artifacts in a folder structure is classification

  46. Alternative interfaces • Hints • Query by Critique Price too high Wrong color

More Related