1 / 27

# The two sample problem - PowerPoint PPT Presentation

The two sample problem. Univariate Inference. Let x 1 , x 2 , … , x n denote a sample of n from the normal distribution with mean m x and variance s 2 . Let y 1 , y 2 , … , y m denote a sample of n from the normal distribution with mean m y and variance s 2 .

I am the owner, or an agent authorized to act on behalf of the owner, of the copyrighted work described.

## PowerPoint Slideshow about 'The two sample problem' - riona

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.

- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -
Presentation Transcript

### The two sample problem

Let x1, x2, … , xn denote a sample of n from the normal distribution with mean mx and variance s2.

Let y1, y2, … , ym denote a sample of n from the normal distribution with mean my and variance s2.

Suppose we want to test

H0: mx= myvs

HA: mx≠ my

The appropriate test is the t test:

The test statistic:

Reject H0 if |t| > ta/2 d.f. = n + m -2

Let denote a sample of n from the p-variate normal distribution with mean vector and covariance matrix S.

Let denote a sample of m from the p-variate normal distribution with mean vector and covariance matrix S.

Suppose we want to test

Hotelling’s T2 statisticfor the two sample problem

if H0 is true than

has an F distribution with n1= p and

n2= n +m – p - 1

ThusHotelling’s T2 test

We reject

Simultaneous inference for the two-sample problem

• Hotelling’s T2 statistic can be shown to have been derived by Roy’s Union-Intersection principle

Hence

form 1 – a simultaneous confidence intervals for

Example Annual financial data are collected for firms approximately 2 years prior to bankruptcy and for financially sound firms at about the same point in time. The data on the four variables

• x1 = CF/TD = (cash flow)/(total debt),

• x2 = NI/TA = (net income)/(Total assets),

• x3 = CA/CL = (current assets)/(current liabilties, and

• x4 = CA/NS = (current assets)/(net sales) are given in the following table.

### Hotelling’s T2 test

A graphical explanation

Hotelling’s T2 statisticfor the two sample problem

Hotelling’s T2 test

X2

Popn A

Popn B

X1

X2

Popn A

Popn B

X1

X2

Popn A

Popn B

X1

Univariate test for a1X1+ a2X2

X2

Popn A

Popn B

X1

### Mahalanobis distance

A graphical explanation

Mahalanobis distance: S, a covariance matrix

Hotelling’s T2 statisticfor the two sample problem

X2

Popn A

Popn B

X1

X2

Popn A

Popn B

X1

In Case I the Mahalanobis distance between the mean vectors is larger than in Case II, even though the Euclidean distance is smaller. In Case I there is more separation between the two bivariate normal distributions