1 / 10

Model Validation: The Modeler’s Perspective

This presentation discusses model validation from a modeler's perspective, covering topics such as model building, model testing, data partitioning, and model testing tools and techniques.

dennisryan
Download Presentation

Model Validation: The Modeler’s Perspective

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Model Validation: The Modeler’s PerspectiveKevin Mahoney, FCASCAS RPM SeminarMarch 2012

  2. Antitrust Notice The Casualty Actuarial Society is committed to adhering strictly to the letter and spirit of the antitrust laws. Seminars conducted under the auspices of the CAS are designed solely to provide a forum for the expression of various points of view on topics described in the programs or agendas for such meetings. Under no circumstances shall CAS seminars be used as a means for competing companies or firms to reach any understanding – expressed or implied – that restricts competition or in any way impairs the ability of members to exercise independent business judgment regarding matters affecting competition. It is the responsibility of all seminar participants to be aware of antitrust regulations, to prevent any written or verbal discussions that appear to violate these laws, and to adhere in every respect to the CAS antitrust compliance policy.

  3. Disclaimer • The views expressed in this presentation are those of the author and do not necessarily reflect the views of The Travelers Companies, Inc. or any of its subsidiaries. This presentation is for general informational purposes only.

  4. What Is Model Validation? • From a modeler’s perspective, there are two parts: • Model Building • Have I chosen the right model? (e.g. are assumptions valid?) • Have I selected the right variables? • Have I adhered to the principle of parsimony? • Have I selected the right factors? • Model Testing • Have I achieved the modeling objectives? • Have I avoided over-fitting my data? • Have I created a model that will predict future behavior?

  5. Data Partitioning • Training / Validation / Holdout Approach • Out of Time Validation • Bootstrapping Approach • Cross Validation Approach

  6. Model Building Tools and Techniques What happens when model assumptions are violated? • Type III statistics • p-values for variable levels • Factor assessment • Does it make business sense? • Does the relationship make sense? (e.g. monotonic) • Comparison with other techniques • Univariate analysis • Decision trees • Residual analysis • AIC / BIC / log-likelihood / deviance measures The easy part is coming up with the story. . . Beware of correlations!

  7. Connecting Model Building and Model Testing Optimal Model Complexity Validation Error Training Error * From Elements of Statistical Learning by Hastie, Tibshirani, and Friedman

  8. Model Testing Tools and TechniquesThe Lift Chart • Questions: • How should lift be measured? • How many buckets? • How should reversals be interpreted? • Are there variable biases affecting the ordering? (e.g. size, policy year) • Is there over-fitting? • Fit vs. Lift?

  9. Model Testing Tools and TechniquesThe GINI Index • Reference: http://en.wikipedia.org/wiki/Gini_index Cum % of Loss • Commonly used to assess income inequality across countries • More granular assessment of model fit • Gives information on model segmentation • -1 ≤ Gini ≤ 1 (1 = more segmentation, better fit) Cum % of Exposure Sort Predictions Low -> High

  10. Model Testing Tools and TechniquesComparing Across Models • Which modeling technique is best? • How much better is this version vs. the last one? • Can use any measure you’d like – lift, GINI index, etc. • Some software packages have this capability built in (e.g. Enterprise Miner) • Be careful of over-fitting • Don’t use this on the holdout data as a model building technique! * from SAS Enterprise Miner documentation

More Related