/
Multivariate Statistics Multivariate Statistics

Multivariate Statistics - PowerPoint Presentation

alida-meadow
alida-meadow . @alida-meadow
Follow
390 views
Uploaded On 2016-07-20

Multivariate Statistics - PPT Presentation

Multiple Regression Canonical CorrelationRegression Binary Logistic Regression Hierarchical Linear Modeling Review of OLS Regression Univariate regression You have only one variable Y Predicted Y will be that value which satisfies the least ID: 411823

variables regression research level regression variables level research predictors canonical predicted model variance scores test weights pair odds values

Share:

Link:

Embed:

Download Presentation from below link

Download Presentation The PPT/PDF document "Multivariate Statistics" is the property of its rightful owner. Permission is granted to download and print the materials on this web site for personal, non-commercial use only, and to display it on your personal computer provided you do not modify the materials and that you retain all copyright notices contained in the materials. By downloading content from our website, you accept the terms of this agreement.


Presentation Transcript

Slide1

Multivariate Statistics

Multiple Regression

Canonical Correlation/Regression

Binary Logistic Regression

Hierarchical Linear ModelingSlide2

Review of OLS Regression

Univariate regression

You

have only one variable, Y. Predicted Y will be that value which satisfies the least

squares criterion, minimizing the sum of the squared deviations between Y and predicted Y.

Predicted Y = the intercept = the mean.Slide3

You have seen this variance before:Slide4

Bivariate Regression

Here we have a value of X associated with each value of

Y.

If

X and Y are not independent, we can reduce the residual (error) variance by using a bivariate

model.Using the same values of Y, but now each paired with a value of X, here is a scatter plot with regression line in black and residuals in red.Slide5
Slide6

Reduction in Error Variance

The

residuals are now -2.31, .30, .49, -.92, .89, -.53, and

2.08.

The

sum of the squared residuals is 11.91, yielding a residual variance of 11.91/7 = 1.70. With our univariate regression the residual variance was 4.By

adding X to the model we have reduced the error in prediction considerably.Slide7

Trivariate Regression

Here

we add a second X

variable.

If

that second X is associated with error variance in Y from the bivariate regression, the trivariate

regression should provide even better prediction of Y.Here is a three-dimensional scatter plot of the trivariate

data (produced with

Proc

g3d):Slide8

The predicted values here are those on the plane that passes through the three-dimensional space such that the residuals (differences between predicted Y, on the plane, and observed Y) are as small as possible.Slide9

The Data

The sum of the squared residuals now is .16 for a residual variance of .16/7 = .023. We have almost eliminated the error in prediction.Slide10

Hyperspace

If we

have three or more predictors, our scatter plot will be in

hyperspace

The predicted values of Y will be located on the “regression surface” passing through hyperspace in such a way that the sum of the squared residuals is as small as possible.Slide11

Dimension-Jumping

In univariate regression the predicted values are a constant. You have a point in one-dimensional

space.

In

bivariate regression the predicted values form a straight line regression surface in two-dimensional

space.In trivariate

regression the predicted values form a plane in three dimensional space.Slide12

Multiple Regression

One continuous Y, two or more X variables.

X variables may be continuous or dichotomous

k

groups may be represented by

k

-1 dichotomous dummy variablesSlide13

Weight the X Variables

Create a weighted combination of the

Xs

Such that the correlation between Y and

is as large as possible.

That is,

a

is predicted Y when all

Xs

are zero

b

i

is number of points Y changes for each one point change in

X

i

, above and beyond the effect of all other predictors.Slide14

Standardized (Beta) Weights

i

is the number of standard deviations that

Y

i

changes for each standard deviation change in X

i

, above and beyond the effect of all other predictors.Slide15

Sequential AnalysisThe predictors may be entered into the model all at once (simultaneous), or

In sets of one or more (sequential)

Order of entry may be determined by

Temporal relationships among predictors

A causal model

Economic considerationsOther considerationsSlide16

Economic ConsiderationsWant to predict college GPA.

Enter inexpensive predictors first

High school GPA

Verbal and quantitative SAT

Evaluation of an essay submitted by student

Ratings from a panel of professors who interviewed the student on campus.Slide17

Stepwise Selection

A statistical algorithm is used to determine order of entry.

The goal is to create a model that has fewer predictors but does nearly as well as a model with all predictors.

Stepwise selection is among the most misunderstood analyses known to man.

It commonly leads to inappropriate conclusions.Slide18

Who Will Fail College Physics?

McCammon, S., Golden, J., & Wuensch, K. L. (1988)

Predict grades in physics classes from

Critical Thinking test scores (CT)

Thurstone’s Primary Mental Abilities Test (IQ)

Arithmetic skills test scores (ARI)Algebra skills test scores (ALG)Math anxiety scale scores (ANX)Slide19

Simultaneous Analysis

R

is the correlation between the observed values of Y and the predicted values of Y

R

= .40 and was statistically significant.

Model explained 16% of the variance in grades.

Every predictor was sig. correlated with grades (zero-order r).But in the model only ALG and CT had significant unique effects.Slide20

Stepwise Analysis

Tried both Forwards Selection and Backwards Selection

Both led to a model with only ALG and CT.

We recommended that Physics use just the ALG and CT tests to predict who is at risk of failing.

The motivation for using stepwise was economic – why use 5 predictors when 2 will do as well?Slide21

Does Sex Matter?McCammon insisted that I address this issue.

Means and variances differed little between the sexes.

Just to please McCammon, I did the analysis separately for men and women.Slide22

Sex MattersAmong the men, not a single predictor was significantly related to grades.

Among the women, every predictor was significantly related to grades.

Women’s performance is class is well related to their abilities.

There must be some other more important factor for predicting men’s performance.Slide23

Expert Reviewers

Those at the physics journal to which we submitted the manuscript rejected it.

They argued that it was not appropriate to publish an unexpected finding (the sex difference).

Such “hypothesis-induced blindness” is not all that uncommon, unfortunately.Slide24

Political CorrectnessWe submitted the manuscript to a Science Education journal.

One reviewer insisted that it not be published as it is “sexist” to compare the sexes.

We convinced the editor otherwise.Slide25

Canonical Correlation/Regression

AKA multiple, multiple regression

AKA multivariate multiple regression

Have two sets of variables (

Xs

and Ys)

Create a pair of canonical variates

a

1

X

1

+

a

2

X

2

+ .... +

a

p

X

p

, and

b

1

Y

1

+

b

2

Y

2

+ .... + bmY

m Such that the correlation between the canonical variates is as large as possible.Slide26

Patel, Long, McCammon, & Wuensch (1995)

Male college students

Xs = Personality variables (MMPI)

PD (psychopathically deviant, Scale 4) – social maladjustment and hostility

MF (masculinity/femininity, Scale 5) – in men, low scores = stereotypical masculinity

MA (hypomania, Scale 9) – overactivity, flight of ideas, low frustration tolerance, narcissism, irritability, restlessness, hostility, and difficulty with controlling impulses

Scale K (clinical defensiveness) – low scores = unusually frank. Slide27

Ys: Homonegativity Variables

IAH (Index of Attitudes Towards Homosexuals)

Affective component of “homophobia,” disgust.

High scores – discomfort around homosexuals

SBS (self-report behavior scale)

Past negative actions towards male homosexuals

High score – high frequency of such actions.Slide28

What is a Canonical Variate?

A weighted linear combination of variables

You can think of it as

Something (a superordinate variable) you have created from several variables, or

An estimate of an construct, a latent variable, a dimension that causes variance in the observed variables.Slide29

What is This Thing I Have Created or Discovered?

Look at the standardized weights used to construct the canonical variate.

Even better, look at the loadings

Compute, for each case, a score on the canonical variate.

Correlate those scores with scores on the original variables in its set.Slide30

The Weights

MMPI

Femininity

-.61

Scale K

-.60

Psycho. Dev.

.43

Hypomania

.46

Homoneg.

SBS

.93

IAH

.15

Being stereotypically masculine, unusually frank, psycho. deviant, and hypomanic is associated with acting negatively towards gays.Slide31

The Loadings

MMPI

Scale K

-.53

Hypomania

.53

Femininity

-.49

Psycho. Dev.

.32

Homoneg.

SBS

.99

IAH

.52

Being

unusually frank, hypomanic, stereotypically masculine, and psycho

. deviant,

is

associated with being uncomfortable around and acting negatively towards gays.Slide32

Weights or Loadings?Like the Beta weights in a multiple regression, the weights for a canonical variate can be deceptive.

If two variables within a set are well correlated with each other, one or both weights may be artificially low.

I generally prefer to interpret loadings.Slide33

A Second Pair of Canonical Variates

There likely is variance in the variables that was not “captured” by the first pair of canonical variates.

We can create a second pair, orthogonal to the first, from that residual variance.

The number of pairs of canonical variates we can create is equal to the number of variables in the smaller set.Slide34

The Second Pair of Weights

MMPI

Femininity

.70

Hypomania

.67

Psycho. Dev.

-.09

Scale K

-.04

Homoneg.

IAH

-1.08

SBS

.57

Being unusually feminine and hypomanic is associated with not being uncomfortable around gays but acting negatively towards them anyhow.Slide35

The Equal Opportunity Bully

What are we to make of “not being uncomfortable around gays but acting negatively towards them anyhow.”

One student called this “the equal opportunity bully.”

He acts negatively towards everybody, gay or straight.Slide36

The Second Pair of Loadings

MMPI

Femininity

.76

Hypomania

.72

Psycho. Dev.

.21

Scale K

-.08

Psycho. Dev.

.21

Homoneg.

IAH

-.85

SBS

.14

Being unusually feminine and hypomanic is associated with not being uncomfortable around gays.Slide37

The Canonical Correlations

Compute canonical variate scores for each case.

Correlate each with its pairmate.

Will always be highest for first pair, lower for each subsequent pair.

Here, the canonical corrs are .38 and .32.

Both were statistically significant.Slide38

Binary Logistic Regression

The criterion variable is dichotomous.

Predictor variables may be categorical or continuous.

If predictors are all continuous and nicely distributed, may use discriminant function analysis instead.

If predictors are all categorical, may use

logit analysis instead.Slide39

Wuensch & Poteat, 1998

Cats being used as research subjects.

Stereotaxic surgery.

Subjects pretend they are on university research committee.

Complaint filed by animal rights group.

Vote to stop or continue the research.Slide40

Purpose of the Research

Cosmetic (test a hair care ingredient)

Theory Testing (neuroscience & learning)

Meat Production (feed the third world)

Veterinary (save cats from disease)

Medical (save young adults from disease)Slide41

Predictor Variables

Gender

Ethical Idealism

Ethical Relativism

Purpose of the ResearchSlide42

The Logit Model

Decision 0 = stop, 1 = continue

Gender 0 = female, 1 = male

Model is ….. logit =

is

the predicted probability of the event which is coded with 1 (continue the research) rather than with 0 (stop the research). Slide43

Decision =Idealism, Relativism, Gender, Purpose

Need 4 dummy variables to code the five purposes.

Consider the Medical group a reference group.

Dummy variables are: Cosmetic, Theory, Meat, Veterin.

0 = not in this group, 1 = in this group.Slide44

Tests of Significance of Unique EffectsSlide45

Exp(b) is an Odds Ratio

For gender, b was 1.255.

When gender changes from 0 (female) to 1 (male) the odds of approving the research (1) are multiplied by 3.508

This is above and beyond the effects of other predictors in the modelSlide46

Effect of Idealism

For idealism, b was -0.701.

For each one point increase in idealism, the odds of approving the research are multiplied by .496.

Put another way, for each one point increase in idealism, the odds of voting to stop the research are multiplied by 1/.496 = 2.016.Slide47

Odds Ratios for Dummy Variables

Compares being in one group versus being in the reference group (the one without a dummy variable, medical in this case).

For theory, the odds ratio is .314.

Odds of approving the research are 1/.314 = 3.185 times higher for the medical research than for the theory-testing neuroscience research.Slide48

Effects of Purpose of Research

Odds of approving the research were significant lower for ____ than for medical research

Neuroscience research

Agricultural research

But no significant difference for

Cosmetic testingVeterinary researchSlide49

ClassificationThe model can be used to predict, for each case, the probability (

p

) that the case is the target event (here, approving the research).

You then need a decision rule: If

p

≥ criterion, then predict it is (or will be) the target event.Slide50

The Classification Decision Rule

A criterion of .5 might seem obvious, but that ignores the fact that false positives and false negatives might not be equally serious.

You might want to use a criterion other than .5.

Slide51

Screening Test for Cancer

Which is the more serious error

False Positive – test says you have cancer, but you do not

False Negative – test says you do not have cancer but you do

Want to reduce the False Negative rate?

Lower the cutoff for predicting that there is cancer.Slide52

Classification Performance

Overall Percentage Correct Classifications

Sensitivity

P(correct prediction | event did occur)

Specificity

P(correct prediction | event did not occur)

False Positive RateP (incorrect prediction | predicted occurrence)

False Negative Rate

P (incorrect prediction | predicted nonoccurrence)Slide53

For Our DataSlide54

Hierarchical Linear Modeling

You have data at two or more levels.

Cases at each level (except the highest) are nested within cases at the next level up.

For example, Level 1 is pupils.

Level 2 is schools.

Level 3 is school districts.Slide55

School Climate

Rowan et al. (1991)

Level 1 cases are teachers

Outcome Variables are ratings of

Principal

leadershipTeacher control of policy

Staff cooperationLevel 1 predictors are teacher demographics Slide56

Level 2Level 2 cases are schools

Predictors are

Sector: school was public or Catholic

Size of school

Percentage minority enrollment

Average student SESAnd other such variables. Slide57

ResultsLevel 1: Ratings were related to demographics

For example, women thought the climate better than did men, and

Those teaching English, Science, and Math thought the climate worse than did others.

Level 2: Ratings were better in Catholic schools than in public schools.Slide58

Noise-Induced Annoyance

Fidell

et al. (1995)

Humans in households in three different neighborhoods rated, on successive nights

How annoyed they were by aircraft noise

How long it took to fall asleep, andA machine measured the noise level at night.Slide59

The Design: Three Levels

Level 1 cases were the nights (repeated measures).

Level 2 cases were humans.

Level 3 cases were households.

Ratings of annoyance was the outcome variable.Slide60

The Predictors

Level 1 (nights): latency to sleep and interior noise level, and neighborhoods were predictors.

Level 2 (humans): age of respondent.

Level 3 (households): neighborhood (three groups)Slide61

ResultsThere was significant variability in annoyance among humans and among households.

Latency to sleep and noise level were related to ratings of annoyance.

The neighborhoods did not differ from each other on annoyance.