1 / 17

A core Course on Modeling

A core Course on Modeling. Introduction to Modeling 0LAB0 0LBB0 0LCB0 0LDB0 c.w.a.m.v.overveld@tue.nl v.a.j.borghuis@tue.nl S.28. formulate purpose. define. Right problem? (problem validation). Right concepts? (concepts validation). identify entities. choose relations.

nuala
Download Presentation

A core Course on Modeling

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A core Course on Modeling Introductionto Modeling 0LAB0 0LBB0 0LCB0 0LDB0 c.w.a.m.v.overveld@tue.nl v.a.j.borghuis@tue.nl S.28

  2. formulate purpose define Right problem? (problem validation) Right concepts? (concepts validation) identify entities choose relations conceptualize obtain values formalize relations formalize Right model? (model verification) operate model obtain result execute present result interpret result Right outcome? (outcome verification) conclude Concludephase: interpretresult Reflection: interpretresult Right answer? (answerverification)

  3. Confidencefor Black Box models Black box models have empiricaldata as input. Quantitiestrytocaptureessentialbehavior of this data. Quantitiestypicallyinvolveaggregation. Most common aggregations: • average, • standard deviation, • correlation, • fit. univariate: everyitem = single quantity bivariate: every item = pair of quantities

  4. Confidencefor Black Box models Average(): the centraltendency in a set (formathematical details, seecourses in statistics) http://cdn.morguefile.com/imageData/public/files/j/jdurham/preview/fldr_2009_02_12/file9761234479624.jpg

  5. Confidencefor Black Box models • Standard deviation(; variance is 2): howmuchvariation does a set contain http://www.morguefile.com/archive/display/852525

  6. Confidencefor Black Box models • Correlation(): what is the agreement betweentwo sets (= a measureforsimilarity) http://www.morguefile.com/archive/display/112388

  7. Confidencefor Black Box models fit: example of a extractingmeaningfulpatternfrom data: Example: data set: (xi,yi), assumelineardependency y=f(x). Intuition: find a line y=ax+bsuchthat the sum of squares of the verticaldifferences is minimal. http://www.morguefile.com/archive/display/194905 Use a penalty tofind the best fit line. Penalty q = (yi-f(xi))2 where f(xi) = axi+b, a and b are unknown. …very bad …still not good …try again …good (best?)

  8. Confidencefor Black Box models fit: example of a extractingmeaningfulpatternfrom data: Example: data set: (xi,yi), assumelineardependency y=f(x). Intuition: find a line y=ax+bsuchthat the sum of squares of the verticaldifferences is minimal. http://www.morguefile.com/archive/display/194905 Use a penalty tofind the best fit line. Penalty q = (yi-f(xi))2 where f(xi) = axi+b, a and b are unknown. QUIZ Is q indeed a penalty? How do youknow?

  9. Confidencefor Black Box models fit: example of a extractingmeaningfulpatternfrom data: Example: data set: (xi,yi), assumelineardependency y=f(x). Intuition: find a line y=ax+bsuchthat the sum of squares of the verticaldifferences is minimal. http://www.morguefile.com/archive/display/194905 Use a penalty tofind the best fit line. Penalty q = (yi-f(xi))2 where f(xi) = axi+b, a and b are unknown. Differentiate q = q(a,b) to a and b; set ; solvefor a andb.

  10. Confidencefor Black Box models A black box model shouldexplain the essence of a body of data. Subtractthe explained part of the dataleavelittle of the initial data. For data (xi,yi), ‘explained’ by a model y=f(x), the part left over is (yi-f(xi))2 (residue). http://www.morguefile.com/archive/display/858060

  11. Confidencefor Black Box models A black box model shouldexplain the essence of a body of data. Subtractthe explained part of the dataleavelittle of the initial data. For data (xi,yi), ‘explained’ by a model y=f(x), the part left over is (yi-f(xi))2 (residue). http://www.morguefile.com/archive/display/858060 QUIZ The numericalvalue of the residuecouldbeanything. How do we knowif the residu is acceptable or not?

  12. Confidencefor Black Box models A black box model shouldexplain the essence of a body of data. Subtractthe explained part of the dataleavelittle of the initial data. For data (xi,yi), ‘explained’ by a model y=f(x), the part left over is (yi-f(xi))2 (residue). Thisshouldbe small comparedto (yi-y)2 (=whatyouwould get assumingnofunctionaldependency). Therefore: confidence is high iff (yi-f(xi))2/ (yi-y)2 is <<1. http://www.morguefile.com/archive/display/858060

  13. Confidencefor Black Box models A black box model shouldbedistinctive, that is: itshouldallowtodistinguish input sets thatintuitively are distinct.

  14. Confidencefor Black Box models Average, varianceandleast squares may not beverydistinctive. Anscombe (1973) constructed 4 verydistinct data sets with equalaverage, varianceandleast square fits. Hastyconclusion: ‘these sets are similar’ ... not!

  15. Confidencefor Black Box models • Rawdata is reasonably well explainedbylin. least squares fit (low residue). Sowhat? • Challenge hypothesis thatraw data stemsfromone set. Cluster analysis revealstwo sets. http://www.morguefile.com/archive/display/193424

  16. Confidencefor Black Box models • Rawdata is reasonably well explainedbylin. least squares fit (low residue). Sowhat? • Challenge hypothesis thatraw data stemsfromone set. Cluster analysis revealstwo sets. • Conclusion 1: womenwillovertake men in 2050 ? • Conclusion 2: men will break 0 second record around 2200 ? http://www.morguefile.com/archive/display/193424

  17. Summary: Black box modelsinvolveaggregation: often a (linearleast squares) fit, minimizingresidues. Values of (,, residue) don’tgive a full characterisation of the data; residual error: how much of the behavior of the data is captured in the model? Distinctiveness (more on distinctiveness in chapter 7): how well can the model distinguish between different modeled systems? Common sense: how plausible are conclusions, drawn from a black box model?

More Related