1 / 22

Building And Interpreting

Building And Interpreting. Decision Trees in Enterprise Miner. Getting Up 2 Speed. Open up the HMEQ project you worked on last class. You should drop 3 nodes in EM (Input, Insight, and Partition (to separate random training and validation) K:/(common)/tsupra/MARK2042/.

baka
Download Presentation

Building And Interpreting

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Building And Interpreting Decision Trees in Enterprise Miner

  2. Getting Up 2 Speed • Open up the HMEQ project you worked on last class. You should drop 3 nodes in EM (Input, Insight, and Partition (to separate random training and validation) • K:/(common)/tsupra/MARK2042/

  3. Building Decision Trees • Add a Tree Node • Connect to Data Partition Node

  4. Check Status, Model Role and Measurement

  5. Splitting Criteria: binary target variables default is Ordinal target variables: must use Entropy or Gini. Here, We can use any of the three. These are typical statistical tests. See readings I handed out last class (WebCT).

  6. Close Tree Node. Run it! View the results. Tree with 18 leaves grown based on training data, pruned back to 8 Based on validation. 8-leaf model has accuracy of 89.02% of the Validation set.

  7. Choose View-Tree 10 leaves are visible here. New in EM Version 8.

  8. Tree Options… Follow the tasks below

  9. Colours and Proportion of target value. What did the 0 represent again? Leaves with all zeros will Be green. Individuals who will default on their loan will be Red. Inspect for high percentage of bad loans (red) and good loans (green)

  10. Change the Statistics

  11. Find missing values The branch that contains the Values greater than 45.1848 also Contains the missing values

  12. Select this tab next

  13. View a path to the node Right click an area

  14. Using Tree Options – Default Tree Add the Assessment node And connect. Make 2 changes to the basic tab. Give This a max and a min set of values 2*25=50 is the RULE. Add New Tree (Default)

  15. Close and Save the changes If you didn’t follow the RULE, You won’t be able to save. View the results…

  16. Run it.

  17. View the tree again.

  18. The defaulted tree diagram. Is yours Different?

  19. Running The Assessment Node Select both the Trees Run the Assessment Node

  20. Interpretation View a Lift Chart Results!

  21. Various Charts – what are they saying?

  22. Further Study: • See WebCT for more resources. • More information on Decision Trees. • Assignment 4 also up on WebCT. • Group Assignment will be delivered next class.

More Related