slide1 n.
Download
Skip this Video
Loading SlideShow in 5 Seconds..
Agenda PowerPoint Presentation
Download Presentation
Agenda

Loading in 2 Seconds...

play fullscreen
1 / 33

Agenda - PowerPoint PPT Presentation


  • 128 Views
  • Uploaded on

Two Distribution Families for Modelling Over- and Underdispersed Binomial Frequencies Feirer V. , Hirn U., Friedl H., Bauer W. Institute for Paper, Pulp and Fiber Technology & Institute for Statistics Graz University of Technology. Agenda. Motivation Generalized Linear Models

loader
I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.
capcha
Download Presentation

PowerPoint Slideshow about 'Agenda' - redell


An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript
slide1

Two Distribution Familiesfor Modelling Over- and UnderdispersedBinomial FrequenciesFeirer V., Hirn U., Friedl H., Bauer W.Institute for Paper, Pulp and Fiber Technology& Institute for StatisticsGraz University of Technology

agenda
Agenda
  • Motivation
  • Generalized Linear Models
  • Multiplicative Binomial Distribution
  • Double Binomial Distribution
  • Application of the Two Distributions
  • Summary
motivation
Motivation
  • consider the problem of successful ink transfer on paper

(No. of datapoints

in sample:

roughly 9106

sample size:

3  6 mm²)

  • explain occurrence of unprinted regions

…part of a larger, industry-funded project at the IPZ.

predictor variables
Predictor Variables

Topography

Formation

…the way fibres are arranged

response
Response

true colour image

distribution of the response
Distribution of the Response

response

…part of the Exponential Family

here

with

the probability for successful ink transmission

model for

the generalized linear model
the Generalized Linear Model*

model for

linear predictor

is linked to the mean by

  • advances over a linear model:
  • distribution of the relative frequencies
  • … member of the Exponential Family
  • mean lies between 0 and 1

* Nelder & Wedderburn (1972). Generalized Linear Models. Journal of the Royal Statistical Society, 135, 370-384

model deviance
Model Deviance

…a test for goodness-of-fit

Deviance = -2 × ( maximized log-likelihood of considered model –

maximized log-likelihood of saturated model )

under certain regularity conditions,

if Underdispersion

Variance of data smaller than assumed by the model

if Overdispersion

Variance of data larger than assumed by the model

deviances of the printability datasets
Deviances of the Printability Datasets

…values from 11 different data sets

distinct deviations from a binomial variance!

many

few

unprinted areas

definition
Definition
  • introduced by Altham* as „multiplicative generalization of the binomial distribution“

considers litters of rabbits

animals within one litter are treated with the same dosis of a certain drug

n… litter size

y… number of surviving animals

  • outcomes from animals from within one litter are not mutually independent

Altham introduces an interaction parameter ω

*Altham (1978). Two Generalizations of the Binomial Distribution. Journal of the Royal Statistical Society, 27, 162-197

properties
Properties
  • Member of the 2-parameter Exponential Family
  • For ω=1, it corresponds to the Binomial Distribution
  • For n=1, it reduces to the Bernoulli distribution
comparison with classic binomial pdf
Comparison With Classic Binomial pdf

n = 36

 = 0.8

ω=1 gives the classic binomial distribution

comparison of the variances
Comparison of the Variances

n = 36

ω=1 gives the classic binomial distribution

integration into glm context
Integration into GLM Context

log-likelihood function of distribution

log-linear link

logit-link

 ω > 0

 0 <  < 1

definition1
Definition

introduced by Efron* as part of the Double Exponential Family

second parameter  allows variation of variance:

variance is smaller than binomial if 0<<1

and larger than binomial if >1

=1 gives the classic binomial distribution

*Efron (1986). Double Exponential Families and their Use in Generalized Linear Regression.

Journal of the American Statistical Association, 81, 709-721

comparison with classic binomial pdf1
Comparison With Classic Binomial pdf

n = 36

 = 0.8

=1 gives the classic binomial distribution

comparison of the variances1
Comparison of the Variances

n = 36

=1 gives the classic binomial distribution

integration into glm context1
Integration into GLM Context

member of the 2-parameter exponential family

log-likelihood function of distribution

log-linear link

logit-link

  > 0

 0 <  < 1

response and explanatory variables
Response and Explanatory Variables

~

explained by…

+ formation

topography

occurrrence of unprinted areas…

comparison of the means2
Comparison of the Means

The second parameter

influences the mean, too.

comparison of the variances2
Comparison of the Variances

binomial Std. Dev. at n=36:

cannot be larger than 3

empirical Std. Deviations:

up to 11

Multiplicative and Double Binomial Standard Deviations fit much

better to empirical results

summary
Summary

Two generalizations of the binomial distribution

might compensate over- or underdispersion

in the case of classic binomial distribution.

Multiplicative Binomial Distribution (Altham, 1978)

second parameter ω

in GLM context: model  with the logistic link

and ω with the log-linear link function

summary 2
Summary 2

Double Binomial Distribution (Efron, 1986)

second parameter 

in GLM context: model  with the logistic link

and  with the log-linear link function