and Empircal Orthogonal Functions Syllabus Lecture 01 Describing Inverse Problems Lecture 02 Probability and Measurement Error Part 1 Lecture 03 Probability and Measurement Error Part 2 ID: 722786
Download Presentation The PPT/PDF document "Lecture 18 Varimax Factors" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.
Slide1
Lecture 18
Varimax
Factors
and
Empircal
Orthogonal FunctionsSlide2
Syllabus
Lecture 01 Describing Inverse Problems
Lecture 02 Probability and Measurement Error, Part 1
Lecture 03 Probability and Measurement Error, Part 2
Lecture 04 The L
2
Norm and Simple Least Squares
Lecture 05 A Priori Information and Weighted Least Squared
Lecture 06 Resolution and Generalized Inverses
Lecture 07 Backus-Gilbert Inverse and the Trade Off of Resolution and Variance
Lecture 08 The Principle of Maximum Likelihood
Lecture 09 Inexact Theories
Lecture 10
Nonuniqueness
and Localized Averages
Lecture 11 Vector Spaces and Singular Value Decomposition
Lecture 12 Equality and Inequality Constraints
Lecture 13 L
1
, L
∞
Norm Problems and Linear Programming
Lecture 14 Nonlinear Problems: Grid and Monte Carlo Searches
Lecture 15 Nonlinear Problems: Newton’s Method
Lecture 16 Nonlinear Problems: Simulated Annealing and Bootstrap Confidence Intervals
Lecture 17 Factor Analysis
Lecture 18
Varimax
Factors,
Empircal
Orthogonal Functions
Lecture 19 Backus-Gilbert Theory for Continuous Problems; Radon’s Problem
Lecture 20 Linear Operators and Their
Adjoints
Lecture 21
Fr
é
chet
Derivatives
Lecture 22 Exemplary Inverse Problems, incl. Filter Design
Lecture 23 Exemplary Inverse Problems, incl. Earthquake Location
Lecture 24 Exemplary Inverse Problems, incl.
Vibrational
ProblemsSlide3
Purpose of the Lecture
Choose Factors Satisfying A Priori Information of Spikiness
(
varimax
factors)
Use Factor Analysis to Detect
Patterns in data
(EOF’s)Slide4
Part 1: Creating Spiky FactorsSlide5
can we find “better” factors
that those returned by
svd
()
?Slide6
mathematically
S
=
CF
=
C
’
F
’
with
F
’ =
M F
and
C
’ =
M
-1
C
where
M
is any
P
×
P
matrix with an inverse
must rely on prior information to choose
MSlide7
one possible type of prior information
factors should contain mainly just a few elementsSlide8
example of rock and minerals
rocks contain minerals
minerals contain elements
Mineral
Composition
Quartz
SiO
2
Rutile
TiO
2
Anorthite
CaAl
2
Si
2
O
8
Fosterite
Mg
2
SiO
4Slide9
example of rock and minerals
rocks contain minerals
minerals contain elements
Mineral
Composition
Quartz
SiO
2
Rutile
TiO
2
Anorthite
CaAl
2Si2O8FosteriteMg2SiO4
factors
most of these minerals are “simple” in the sense that each contains just a few elementsSlide10
spiky factors
factors containing mostly just a few elementsSlide11
How to quantify spikiness?Slide12
variance as a measure of spikinessSlide13
modification for factor analysisSlide14
modification for factor analysis
depends on the square, so positive and negative values are treated the sameSlide15
f
(1)
= [1, 0, 1, 0, 1, 0]
T
is much spikier than
f
(2)
= [1, 1, 1, 1, 1, 1]
T
Slide16
f
(2)
=[1, 1, 1, 1, 1, 1]
T
is just as spiky as
f
(3)
= [1, -1, 1, -1, -1, 1]
T
Slide17
“
varimax
” procedure
find spiky factors without changing P
start with P
svd
()
factors
rotate pairs of them in their plane by angle
θ
to maximize the overall spikinessSlide18
f
B
f
A
f’
B
f’
A
qSlide19
determine
θ
by maximizingSlide20
after tedious trig the solution can be shown to beSlide21
and the new factors are
in this example A=3 and B=5Slide22
now one repeats for every pair of factors
and then iterates the whole process several times
until the whole set of factors is as spiky as possibleSlide23
Old
New
f
5
f
2
f
3
f
4
f
’
5
f
’2
f’
3f’4
SiO2
TiO2
Al
2
O
3
FeO
total
MgO
CaO
Na
2
O
K
2
O
example: Atlantic Rock datasetSlide24
Old
New
f
5
f
2
f
3
f
4
f
’
5
f
’2
f’
3f’4
SiO2
TiO2
Al
2
O
3
FeO
total
MgO
CaO
Na
2
O
K
2
O
example: Atlantic Rock dataset
not so spikySlide25
Old
New
f
5
f
2
f
3
f
4
f
’
5
f
’2
f’
3f’4
SiO2
TiO2
Al
2
O
3
FeO
total
MgO
CaO
Na
2
O
K
2
O
example: Atlantic Rock dataset
spikySlide26
Part 2: Empirical Orthogonal FunctionsSlide27
row number in the sample matrix could be meaningful
example: samples collected at a succession of times
timeSlide28
column number in the sample matrix could be meaningful
example: concentration of the same chemical element at a sequence of positions
distanceSlide29
S
=
CF
becomesSlide30
S
=
CF
becomes
distance dependence
time dependenceSlide31
S
=
CF
becomes
each loading: a temporal pattern of variability of the corresponding factor
each factor:
a spatial pattern of variability of the elementSlide32
S
=
CF
becomes
there are P patterns and they are sorted into order of importanceSlide33
S
=
CF
becomes
factors now called EOF’s (empirical orthogonal functions)Slide34
example 1
hypothetical mountain profiles
what are the most important spatial patterns
that characterize mountain profilesSlide35
this problem has space but not time
s(
x
j
,
i
) =
Σ
k=1
p
c
ki f (k)(xj)Slide36
this problem has space but not time
s(
x
j
,
i
) =
Σ
k=1
p
c
ki f (k)(xj )
factors are spatial patterns that add together to make mountain profilesSlide37Slide38
elements:
elevations ordered by distance along profileSlide39
EOF 3
EOF 2
EOF 1
index,
i
index,
i
index,
i
f
i
(1)
f
i
(2)
f
i
(3)
λ
1
= 38
λ
2
= 13
λ
3
= 7Slide40
factor loading
, C
i2
factor loading
, C
i3Slide41
example 2
spatial-temporal patterns
(synthetic data)Slide42
the dataSlide43
the data
spatial
pattern at
a single
time
x
y
t=1Slide44
the data
timeSlide45
the dataSlide46
4
6
1
9
1
2
1
3
6
4
6
1
9
1
2
1
3
6
s
need to unfold each 2D image into vectorSlide47
index,
i
λ
i
p=3Slide48
EOF
1
EOF
2
EOF
3
loadng
1
loadng
2
loadng
3
(A)
(B)
time,
t
time,
t
time,
tSlide49
example 3
spatial-temporal patterns
(actual data)
sea surface temperature in the Pacific OceanSlide50
29
S
29
N
124
E
290E
latitude
longitude
equatorial Pacific Ocean
sea surface temperature (black = warm)
CAC Sea Surface TemperatureSlide51Slide52
the image is 30 by 84 pixels in size, or 2520 pixels total
to use
svd
()
, the image must be unwrapped into a vector of length 2520Slide53
2520 positions in the equatorial Pacific ocean
399 times
“element” means temperatureSlide54
s
ingular values,
S
ii
index,
i
singular valuesSlide55
s
ingular values,
S
ii
index,
i
singular values
no clear cutoff for P, but the first 12 singular values are considerably larger than the restSlide56Slide57Slide58Slide59Slide60Slide61Slide62
using SVD to approximate dataSlide63
S
=
C
M
F
M
S
=
C
P
F
P
S
≈CP’FP’ With M EOF’s, the data is fit exactlyWith P chosen to exclude only zero singular values, the data is fit exactly
With P’<P, small non-zero singular values are excluded too, and the data is fit only approximatelySlide64
A) Original
B) Based on first 5 EOF’s