Assume the model:

Borgan and Henderson: Event History Methodology Lancaster, September 2006Session 2: Nelson-Aalen, Kaplan-Meier and Aalen-Johansen estimators

Nelson-Aalen estimator Assume the model: at risk indicator common hazard/intensity (non-negative function) Aggregated counting process N(t) has intensity process number at risk

Will estimate the cumulative hazard/intensity Have the decomposition: Estimating equation (when Y(t) > 0)

Thus (when Y(t) > 0 ): This motivates the Nelson-Aalen estimator: with Nelson-Aalen estimator is a sum over the observed event times (assuming no tied event times)

Nelson-Aalen estimator: Examples Males: Females:

Stochastic integrals and properties of the Nelson-Aalen estimator We may write:

Thus we have the decomposition: systematic part random part The random part is the stochastic integral: where is a predictable process We may use properties of stochastic integrals to study the Nelson-Aalen estimator

The stochastic integral is a martingale. In particular Thus The Nelson-Aalen estimator is approximately unbiased.

Predictable variation of a martingale In order to study the variability of the Nelson-Aalen estimator (and a number of other estimators and test statistics), we need a concept of variability of a martingale M(t). Such a concept of variability is the predictable variation process given by:

One may in general prove that is a martingale. In particular therefore and it follows that

For a counting process martingale we have This motivates the important result:

Predictable variation of a stochastic integral We also need the predictable variation of the stochastic integral We have, since H(t) is predictable:

This motivates the key result In particular for a counting process martingale we have

Variance of the Nelson-Aalen estimator For the Nelson-Aalen estimator we have Thus For estimation we replace by and obtain

Martingale central limit theorem In order to derive the large sample distribution of the Nelson-Aalen estimator (and a number of other estimators and test statistics), we use the martingale central limit theorem (MCLT) The classical CLT describes how sums of random variables (properly normalized) becomes approximately normally distributed as n increases. In a similar manner the MCLT describes how martingales (properly normalized) become approximately distributed as Gaussian martingales (transformed Brownian motions)

A Gaussian martingale is a stochastic process X(t) in continuous time satisfying: • its increment X(t) – X(s) over an interval (s,t] is normally distributed with mean zero and variance V(t) – V(s) for a continuous strictly increasing function V(t)(the variance function) • its increments over non-overlapping intervals are independent • its sample paths (realizations) are continuous For the special case V(t) = t we get the classical Wiener process (or Brownian motion); cf next slide.

Simulation of the sample path of a Wiener process:

A counting process and its cumulative intensity process (left) and the corresponding counting process martingale (right). (Based on n =10 simulated censored survival times)

Counting process martingales based on n =10, 50, 250 and 1250 simulated censored survival times:

Consider a stochastic integral where is a counting process martingale based on observing n individuals Assume: (convergence in probability) (1) implies that the predictable variation process of converges to the deterministic function , while (2) states that its jumps disappear in the limit.

Under assumptions (1) and (2) (plus some regularity conditions) the process converges in distribution to a Gaussian martingale with variance function In particular for given t0 the random variable is approximately normally distributed with mean 0 and variance

Large sample properties of Nelson-Aalen We have May use the MCLT with Assume that Then:

Thus (1) and (2) hold and the MCLT gives that converges in distribution to a Gaussian martingale with variance function In particular for given t0 the random variable is approximately normally distributed around with a variance that can be estimated as described earlier.

Survival functions, cumulative hazards, and product integrals: the general case Uncensored survival time T Survival function: For the absolute continuous case, the hazard function is given by: Cumulative hazard function:

We have the relations: For a general distribution the hazard rate is not defined, but we may define the cumulative hazard rate as (generalizing the first relation above): How can the second relation be generalized?

Need product-integrals to achieve this generalization. Partition [0,t] into small time intervals: si-1 si 0 t This is a product-integral.

For the continuous case we have: For the discrete case we have: where is the increment of the cumulative hazard (a step function) at s. For the general case we have a mixture of the two.

The Kaplan-Meier estimator For right censored survival data we observe: Model: the uncensored survival times Ti are i.i.d. with hazard Counting and intensity processes:

Aggregated counting process: Intensity process: with the number at risk just before timet

Nelson-Aalen estimator: (a step function) Plug this into the product-integral expression for the survival function: (a finite product) This is the Kaplan-Meier estimator

Kaplan-Meier estimator: Examples Males: Females:

Kaplan-Meier estimator: Properties May show that(this is Duhamel's equation) Asymptotically:

Thus: The statistical properties for Kaplan-Meier may be derived from those of Nelson-Aalen:

Usually the variance is estimated by Greenwood's formula: Only minor difference between the two variance estimators

Pointwise confidence intervalsfor S(t) Linear: Log-log-transformed: The default confidence interval in R is based on the log-transformation, and that is a bad choice for Kaplan-Meier!

The Aalen-Johansen estimator A multivariate version of the Kaplan-Meier estimator applies to Markov processes. Consider Markov process with states 0, 1, …, K. transition intensities Phj(s,t)transition probabilities Transition probability matrix:

Aalen-Johansen estimator: Here is the matrix of Nelson-Aalen estimators with If e.g. K=2 and a 1–>2 transition is observed at tj The statistical properties of the estimator may be derived in a similar manner as for Kaplan-Meier.

Aalen-Johansen estimator: Examples Causes of death in Norway. Estimates of 1) Cancer 2) Cardiovascular disease 3) Other medical 4) Alcohol abuse, violence, accidents

374 female diabetes patients in Denmark with disease onset before age 10 yrs Use illness-death model with diabetic nephropathy as "disease state" (state 1). Estimate of P01(5,t): (Markov assumption is dubious)

Assume the model:

Assume the model:

Presentation Transcript

The Bass Diffusion Model

Chapter 3 Data Modeling Using the Entity-Relationship Model

The Jajodia & Sandhu model

ABB Gate Model Overview

The IS-LM model

Diamond Dybvig Model (1983)

The Renzulli School Enrichment Model (SEM)

The duration gap model and clumping

Econometrics Course: Cost at the Dependent Variable (II)

The Quantum Model

Model Training Workshop Florida Statewide Model

Model Checking

Model Fitting

Abstraction in Model Checking

Chapter 12: Multiple Regression and Model Building

Biological Modeling of Neural Networks:

Lecture 2: Basics of palliative care: model of needs and model of care

Assume the model:

Assume the model:

Presentation Transcript

The Bass Diffusion Model

Chapter 3 Data Modeling Using the Entity-Relationship Model

The Jajodia &amp; Sandhu model

ABB Gate Model Overview

The IS-LM model

Diamond Dybvig Model (1983)

The Renzulli School Enrichment Model (SEM)

The duration gap model and clumping

Econometrics Course: Cost at the Dependent Variable (II)

The Quantum Model

Model Training Workshop Florida Statewide Model

Model Checking

Model Fitting

Abstraction in Model Checking

Chapter 12: Multiple Regression and Model Building

Biological Modeling of Neural Networks:

Lecture 2: Basics of palliative care: model of needs and model of care

The Jajodia & Sandhu model