1 / 12

SESSION 37 & 38

SESSION 37 & 38. Last Update 5 th May 2011. Continuous Probability Distributions. Learning Objectives. Normal Distribution Z-Score and Transformation. Quantitative Data. Data. Data variables can only assume certain values and are collected typically by counting observations.

lilly
Download Presentation

SESSION 37 & 38

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. SESSION 37 & 38 Last Update 5th May 2011 Continuous Probability Distributions

  2. Learning Objectives • Normal Distribution • Z-Score and Transformation

  3. Quantitative Data Data Data variables can only assume certain values and are collected typically by counting observations Data variables can assume any value within a range (real numbers) and are collected by measurement. Discrete Continuous Continuous probability distribution functions (most notably normal, student-t, F, Chi-Squared) Binomial and Poisson distribution

  4. Normal Distribution The normal distribution is the most important of all continuous distribution functions due to its importance in statistical inference. The probability density function of a normal random variable is: Thus, for every x the function is characterized by the population mean μ and population variance σ. The normal distribution function is symmetric about the mean and the random variable ranges between -∞ and +∞. The function is asymptotic.

  5. Normal Probabilities To calculate the probability that a normal random variable falls into any interval, the area in the interval under the curve needs to be computed. This requires calculations that are beyond the scope of introductory integral calculus. Fortunately, probability tables may be used to determine the area underneath the curve. This requires the use of a standard normal variable: Assuming the population parameters are known, Z may be calculated for any X. This is called Z-score conversion. The corresponding probabilities (area underneath the curve) are then determined from normal probability tables.

  6. Normal Probabilities Table The number s in the left column describe the values of z to one decimal place and the column headings specify the second decimal place. The table provides the probability that a standard normal variable falls between 0 and values of z. It is important to note that the normal curve is symmetric as only one half of the distribution function is displayed in most textbooks (e.g. P(Z > 0) = P(Z < 0) = 0.5). It is assumed that P(Z > 3.1) ≈ 0.

  7. Example Suppose that the amount of time to assemble a computer (assembly time is an interval variable) is normally distributed with a mean μ = 50 minutes and a standard deviation σ = 10 minutes. What is the probability that the computer is assembled in a time between 45 and 60 minutes? (i.e. we want to find the probability P(45 < X < 60)).

  8. Solution Step 1: Transformation of the variables Step 2: Determine the probabilities for the Z-scores The probability we seek is actually the sum of two probabilities: Or

  9. Exercise Consider an investment whose return is normally distributed with a mean of 10% and a standard deviation of 5%. • What is the probability of loosing money (Hint: P(X < 0))? • Find the probability of loosing money when the standard deviation is equal to 10%.

  10. Finding values of z There is a family of problems that require researchers to determine the value of Z given a certain probability. From the financial area, for example, value-at-risk: What is the rate of return that 5% of all observations fall below of? This requires reading of a probability from the normal probability tables and determine the associated z-score. From there, we can use a re-arranged version of the previous formula to determine X:

  11. Central Limit Theorem The sampling distribution of the mean of a random sample drawn from any population is approximately normal for sufficiently large sample sizes. The larger the sample size, the more closely the sampling distribution of x-bar will resemble the normal distribution.

More Related