1 / 13

Estimating the accuracy of the approximation (surrogate)

Estimating the accuracy of the approximation (surrogate). From assumption that error is due to normally distributed uncorrelated random variables, get estimate to error standard deviation (called standard error) Standard measure of accuracy

raziya
Download Presentation

Estimating the accuracy of the approximation (surrogate)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Estimating the accuracy of the approximation (surrogate) • From assumption that error is due to normally distributed uncorrelated random variables, get estimate to error standard deviation (called standard error) • Standard measure of accuracy • Coefficient of multiple determination measures how much of variability in data is captured by approximation • Adjusted coefficient of multiple determination accounts for the fitting bias

  2. Curve fit • noise=randn(1,30); x=1:1:30; y=x+noise • 3.908 2.825 4.379 2.942 4.5314 5.7275 8.098 …………………………………25.84 27.47 27.00 30.96 • [p,s]=polyfit(x,y,1); yfit=polyval(p,x); plot(x,y,'+',x,x,'r',x,yfit,'b') With dense data, functional form is clear. Fit serves to filter out noise

  3. Example with y=0.1*x • noise=randn(1,30); x=1:1:30; y=0.1*x+noise; • xx=[ones(30,1),x']; [B,BINT,R,RINT,STATS] = regress(y',xx) • Stat 0.3016 12.0896 0.0017 1.7498

  4. Estimating error in coefficients • Some coefficients are more accurately estimated than others • Standard error in coefficient is • t-statistic is ratio of coefficient to standard error, would like it to be at least 2 • Coefficients that are poorly estimated may be dropped to improve accuracy of predictions • Dropping one coefficients changes t-statistics for others • Need to iterate in dropping and adding coefficients

  5. Regression in Excel (add-in data analysis)

  6. Regression output

  7. Output with y=0.1x

  8. Example 3.2.1 • Given data • Use Microsoft Excel to fit linear and quadratic polynomials • Compare standard errors and t-statistics of coefficients

  9. Linear fit

  10. Quadratic fit

  11. Graphical comparison .

  12. Cross validation • Error estimates based on model assumptions are vulnerable • For polynomial response surface approximations assumptions are rarely satisfied • Cross validation divides data into ng groups • Fit the approximation to ng -1 groups, and use last group to estimate error. Repeat for each group • When each group consists of one point, error called PRESS (prediction error sum of squares) • Calculate error at each point and then presenting r.m.s error • Can be shown that • Can be used only if not ill-conditioned

  13. Questions • The pairs (0,0), (1,1), (2,1) represent strain (millistrains) and stress (ksi) measurements. • Estimate Young modulus using the three commonly used error norms. • Estimate the error in Young modulus using cross validation

More Related