Model Fitting

Model Fitting Jean-Yves Le Boudec 0

Contents • Whatis model fitting ? • LinearRegression • Linearregressionwithnormminimization • Choosing a distribution • HeavyTail 1

Virus Infection Data • We would like to capture the growth of infected hosts (explanatory model) • An exponential model seems appropriate • How can we fit the model, in particular, what is the value of  ? 2

Least Square Fit of Virus Infection Data = 0.5173 Mean doubling time 1.34 hours Prediction at +6 hours: 100 000 hosts Least square fit 3

Least Square Fit of Virus Infection Data In Log Scale = 0.39 Mean doubling time 1.77 hours Prediction at +6 hours: 39 000 hosts Least square fit 4

Compare the Two LS fit in natural scale LS fit in log scale 5

Which Fitting Method should I use ? • Which optimization criterion should I use ? • The answer is in a statistical model. • Model not only the interesting part, but also the noise • For example = 0.5173 6

How can I tell which is correct ? = 0.39 7

Look at Residuals • = validate model 8

Least Square Fit = Gaussian iid Noise • Assume model (homoscedasticity) • The theorem says: minimize least squares = compute MLE for this model • This is how we computed the estimates for the virus example 10

Least Square and Projection • Skrivañ war an daol petra zo: data point, predicted response and estimated parameter for virus example Data point Predicted response Manifold Where the data point would lie if there would be no noise Estimated parameter 11

Confidence Intervals 12

Robustness to « Outliers » 14

A Simple Example Least Square L1 Norm Minimization Model : noise Whatism ? Confidence interval ? • Model: noise • Whatism ? • Confidence interval ? 15

Mean Versus Median 16

2. Linear Regression • Also called « ANOVA » (Analysis of Variance ») • = least square + linear dependence on parameter • A special case where computations are easy 17

Example 4.3 • What is the parameter ? • Is it a linear model ? • How many degrees of freedom ? • What do we assume on i? • What is the matrix X ? 18

Does this model have full rank ? 20

Some Terminology • xi are called explanatory variable • Assumed fixed and known • yi are called response variables • They are « the data » • Assumed to be one sample output of the model 21

Least Square and Projection Data point Predicted response Manifold Where the data point would lie if there would be no noise Estimated parameter 22

Solution of the Linear Regression Model 23

Least Square and Projection • The theorem gives H and K data residuals Predicted response Manifold Where the data point would lie if there would be no noise Estimated parameter 24

The Theorem Gives  with Confidence Interval 25

SSR • Confidence Intervals use the quantity s • s2 is called « Sum of Squared Residuals » data residuals Predicted response 26

Validate the Assumptions with Residuals 27

Residuals • Residuals are given by the theorem data residuals Predicted response 28

Standardized Residuals • The residuals ei are an estimate of the noise terms i • They are not (exactly) normal iidThe variance of ei is ???? • A: 1- Hi,i • Standardized residuals are not exactly normal iid either but their variance is 1 29

Which of these two models could be a linear regression model ? • A: both • Linear regression does not mean that yi is a linear function of xi • Achtung: There is a hidden assumption • Noise is iid gaussian -> homoscedasticity 30

3. Linear Regression with L1 norm minimization • = L1 norm minimization + linear dependency on parameter • More robust • Less traditional 32

This is convex programming 33

Confidence Intervals • No closed form • Compare to median ! • Boostrap: • How ? 35

4. Choosing a Distribution • Know a catalog of distributions, guess a fit • Shape • Kurtosis, Skewness • Power laws • Hazard Rate • Fit • Verify the fit visually or with a test (see later) 37

Distribution Shape • Distributions have a shape • By definition: the shape is what remains the same when we • Shift • Rescale • Example: normal distribution: what is the shape parameter ? • Example: exponential distribution: what is the shape parameter ? 38

Standard Distributions • In a given catalog of distributions, we give only the distributions with different shapes. For each shape, we pick one particular distribution, which we call standard. • Standard normal: N(0,1) • Standard exponential: Exp(1) • Standard Uniform: U(0,1) 39

Log-Normal Distribution 40

Skewness and Curtosis 42

Power Laws and Pareto Distribution 43

Complementary Distribution FunctionsLog-log Scales Lognormal Normal Pareto 44

Zipf’s Law 45

Hazard Rate • Interpretation: probability that a flow dies in next dt seconds given still alive • Used to classify distribs • Aging • Memoriless • Fat tail • Ex: normal ? Exponential ? Pareto ? Log Normal ? 47

The Weibull Distribution • Standard Weibull CDF: • Aging for c > 1 • Memoriless for c = 1 • Fat tailed for c <1 48

Fitting A Distribution • Assume iid • Use maximum likelihood • Ex: assume gaussian; what are parameters ? • Frequent issues • Censoring • Combinations 49

Model Fitting

Model Fitting

Presentation Transcript

Basic Bayes: model fitting, model selection, model averaging

Univariate Model Fitting

Chapter 3- Model Fitting

Model Fitting

LOGLINEAR MODEL FITTING: LIKELIHOOD EQUATIONS

Lecture 10 – Model fitting continued

Diffusion model fitting and tractography: A primer

Probability Model Fitting Steps

Parameter Estimation & Model Fitting

Model Fitting

Diffuse model fitting A. Strong, Dec. 2001

Bayesian Shape Measurement and Galaxy Model Fitting

Model Fitting

Fitting Model for Diffusion in 3D

Optimization by Model Fitting

Fitting

MODEL FITTING

Automated model fitting in DESDM

Brass Fitting Copper Fitting Bronze Fitting

Model Fitting

MODEL FITTING

Model fitting

Model Fitting

Model Fitting

Presentation Transcript

Basic Bayes: model fitting, model selection, model averaging

Univariate Model Fitting

Chapter 3- Model Fitting

Model Fitting

LOGLINEAR MODEL FITTING: LIKELIHOOD EQUATIONS

Lecture 10 – Model fitting continued

Diffusion model fitting and tractography: A primer

Probability Model Fitting Steps

Parameter Estimation &amp; Model Fitting

Model Fitting

Diffuse model fitting A. Strong, Dec. 2001

Bayesian Shape Measurement and Galaxy Model Fitting

Model Fitting

Fitting Model for Diffusion in 3D

Optimization by Model Fitting

Fitting

MODEL FITTING

Automated model fitting in DESDM

Brass Fitting Copper Fitting Bronze Fitting

Model Fitting

MODEL FITTING

Model fitting

Parameter Estimation & Model Fitting