290 likes | 448 Views
Principal Component Analysis Principles and Application. Examples: Satellite Data Digital Camera, Video Data Tomography Particle Imaging Velocimetry (PIV) Ultrasound Velocimetry (UVP). Large Data Sets. Low resolution image. There are 400 x 600 = 240,000 pieces of information.
E N D
Examples: • Satellite Data • Digital Camera, Video Data • Tomography • Particle Imaging Velocimetry (PIV) • Ultrasound Velocimetry (UVP)
Large Data Sets Low resolution image • There are 400 x 600 = 240,000 pieces of information. • Not all of this information is independent • => information compression(data compression)
Experiment: Consider the flow past a cylinder, and suppose we position a cross-wire probe downstream of the cylinder. With a cross-wire probe we can measure two components of the velocity at successive time intervals and store the results in a computer. Example 1 Two component velocity measurement
As the previous slide suggests, the pair of velocities can be represented as a column vector: • u is a vector at position x in physical space: • The magnitude and angle of the vector changes with time. u y x x Mathematical Representation of Data
Basic Statistics • Mean velocity : • Variance : • Covariance : • Correlation :
v u Plot u vs v The data look correlated
v’ v Calculate the Covariance matrix u’ u Diagonal terms are the variances in the u’and v’ directions Examine the Statistics Move to a data centered coordinate system
v’ v Calculate the Covariance matrix u’ u covariance or cross-correlation Examine the Statistics Move to a data centered coordinate system
v” v u” 2 1 u Covariance matrix in the(u”,v”) coordinate system Rotate coordinates to remove the correlations
We have just carried out a Principal Axis Transformation. This is the first step in a Principal Component Analysis (PCA).
Principal Component Analysis • A procedure for transforming a set of correlated variables into a new set of uncorrelated variables. • How do we do it??
Construction of the PCA coordinate system • The PCA coordinate system is one that maximizes the mean squared projection of the data. In this sense it is an “optimal” orthogonal coordinate system. Its popularity is primarily due to its dimension reducing properties. • The basic algorithm for constructing the PCA eigenvectors is: • Find the best direction (line) in the space, 1. • Find the best direction (line) 2 with the restriction that it must be orthogonal to 1. • Find the best direction (line) i with the restriction that i is orthogonal toj for all j < i.
How do we find this nice coordinate system?? Calculate the eigenvalues and eigenvectors of the Covariance Matrix
Experiment: Pipe Flow -- measurement of velocity profile. Example 2. Velocity Profile Measurement u(z) z
Vectors in Profile Space • As before we represent the velocities in the form of a column vector, but this time the vector is not in physical space. • The space in which our vector lives is one we shall call profilespace or pattern space. • Profile space has n dimensions. In this example, the position zk defines a direction in profile space. • As time evolves, we measure a sequence of velocity profiles:
1. UVP Data Matrix (n x m=128 x 1024) 3. Centered Data Matrix (n x m) 2. Mean Profile Matrix (n x m) 4. Covariance Matrix (n x n = 128 x 128) The Preliminary Calculations
Eigenvectors (eigenprofiles) Eigenvalue Equation Eigenvalues The Diagonalization
UVP data space After (diagonalisation) time Before space compression!! space UVP Example Covariance Matrix
1 Energy Fraction Ek 0 1 Mode Number 128 1 1 Ek cumulative sum of Ek 0 1 20 Mode Number The Eigenvalue Spectrum(Signal) Energy Spectrum
Filtering and Reconstruction • Decompose X into signal and noise dominated components (subspaces): where XF is the Filtered data XNoise is the Residual • Reconstructfiltered UVP velocity
U UF XNoise=U-UF
Filtered Time Series(Channel 70) Raw data Filtered data Residual
Generalizations Generalise • Response to a stimulus • Comparison of multiple data sets obtained by varying a parameter to study a transition.