- 75 Views
- Uploaded on
- Presentation posted in: General

Prediction concerning the response Y

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Prediction concerning the response Y

- Model formulation
- Model estimation
- Model evaluation
- Model use

- What is the mean weight, μ, of all American women, aged 18-24?
- If we want to estimate μ, what would be a good estimate?

- What is the weight, y, of a randomly selected American woman, aged 18-24?
- If we want to predict y, what would be a good prediction?

- What is the mean responseμY when the predictor value is xh?
- What value will anew observationYnew be when the predictor value is xh?

- What is the expected (mean) mortality rate for all locations at 40o N latitude?
- What is the predicted mortality rate for 1 new randomly selected location at 40o N?

is the best answer to each research question.

- That is, it is:
- the best guess of the mean response at xh
- the best guess of a new observation at xh

But, as always, to be confident in the answer to our research question, we should put an interval around our best guess.

A confidence interval for the population mean response μY

… when the predictor value is xh

Formula in words:

Sample estimate ± (t-multiplier × standard error)

Formula in notation:

Predicted Values for New Observations

New Obs Fit SE Fit 95.0% CI 95.0% PI

1 150.08 2.75 (144.56, 155.61) (111.23,188.93)

Values of Predictors for New Observations

New Obs Lat

1 40.0

- As the confidence level decreases, …
- As MSE decreases, …
- As the sample size increases, …
- The more spread out the predictor values, …
- The closer xh is to the sample mean, …

Var N StDev

yhat(x=1) 5 0.320

Var N StDev

yhat(x=1) 5 2.127

Var N StDev

yhat(x=1) 5 2.127

yhat(x=5.5) 5 0.512

Predicted Values for New Observations

New Fit SE Fit95.0% CI 95.0% PI

1 150.08 2.75(144.6,155.6) (111.2,188.93)

2 221.82 7.42(206.9,236.8) (180.6,263.07)X

X denotes a row with X values away from the center

Values of Predictors for New Observations

New Obs Latitude

1 40.0 Mean of Lat = 39.533

2 28.0

- When xh is a value within the scope of the model – xh does not have to be one of the actual x values in the data set.
- When the “LINE” assumptions are met.
- The formula works okay even if the error terms are only approximately normal.
- If you have a large sample, the error terms can even deviate substantially from normality.

Prediction interval for a new response Ynew

Formula in words:

Sample prediction ± (t-multiplier × standard error)

Formula in notation:

Predicted Values for New Observations

New Obs Fit SE Fit 95.0% CI 95.0% PI

1 150.08 2.75 (144.56, 155.61) (111.23,188.93)

Values of Predictors for New Observations

New Obs Lat

1 40.0

- When xh is a value within the scope of the model – xh does not have to be one of the actual x values in the data set.
- When the “LINE” assumptions are met.
- The formula for the prediction interval depends strongly on the assumption that the error terms are normally distributed.

Confidence interval for μY :

Prediction interval for Ynew:

Suppose it were known that the mean skin cancer mortality at xh = 40o N is 150 deaths per million (with variance 400)?

What is the predicted skin cancer mortality in Columbus, Ohio?

- The mean μY is not known.

- Estimate it with the predicted response

- The cost of using

to estimateμY is the

variance of

- The variance σ2 is not known.

- Estimate it with MSE.

which is estimated by:

The variation in the prediction of a new response depends on two components:

1. the variation due to estimating the mean μYwith

2. the variation in Y

Confidence interval for μY :

Prediction interval for Ynew:

- A (1-α)100% confidence interval for μY at xh will always be narrower than a (1-α)100% prediction interval for Ynew at xh.
- The confidence interval’s standard error can approach 0, whereas the prediction interval’s standard error cannot get close to 0.

- Stat >> Regression >> Regression …
- Specify response and predictor(s).
- Select Options…
- In “Prediction intervals for new observations” box, specify either the X value or a column name containing multiple X values.
- Specify confidence level (default is 95%).

- Click on OK. Click on OK.
- Results appear in session window.

Confidence intervals and prediction intervals for response in Minitab

C6

40

28

Predicted Values for New Observations

New Fit SE Fit95.0% CI95.0% PI

1 150.08 2.75 (144.6,155.6)(111.2,188.93)

2 221.82 7.42 (206.9,236.8)(180.6,263.07)X

X denotes a row with X values away from the center

Values of Predictors for New Observations

New Obs Latitude

1 40.0 Mean of Lat = 39.533

2 28.0

- Stat >> Regression >> Fitted line plot …
- Specify predictor and response.
- Under Options …
- Select Display confidence bands.
- Select Display prediction bands.
- Specify desired confidence level (95% default)

- Select OK. Select OK.