Question

In: Statistics and Probability

The eigenvalue is a measure of how much of the variance of the observed variables a...

The eigenvalue is a measure of how much of the variance of the observed variables a factor explains. Any factor with an eigenvalue ≥1 explains more variance than a single observed variable, so if the factor for socioeconomic status had an eigenvalue of 2.3 it would explain as much variance as 2.3 of the three variables. This factor, which captures most of the variance in those three variables, could then be used in another analysis. The factors that explain the least amount of variance are generally discarded. How do we determine how many factors are useful to retain?

Solutions

Expert Solution

Dropping unimportant variables from your analysis

Once you run a factor analysis and think you have some usable results, it’s time to eliminate variables that are not “strong” enough. They are usually the ones with low factor loadings, {factor loadings are explained in last} although additional criteria should be considered before taking out a variable.

As a rule of thumb, your variable should have a rotated factor loading of at least |0.4| (meaning ≥ +.4 or ≤ –.4) onto one of the factors in order to be considered important.

Some researchers use much more stringent criteria such as a cut-off of |0.7|. In some instances, this may not be realistic: for example, when the highest loading a researcher finds in her analysis is |0.5|.

Other researchers relax the criteria to the point where they include variables with factor loadings of |0.2|.

Which cut-offs to use depends on whether you are running a confirmatory or exploratory factor analysis, and on what is usually considered an acceptable cut-off in your field. In addition, a variable should ideally only load cleanly onto one factor.

Factor Loading

The relationship of each variable to the underlying factor is expressed by the so-called factor loading. Here is an example of the output of a simple factor analysis looking at indicators of wealth, with just six variables and two resulting factors.

Variables Factor 1 Factor 2
Income 0.65 0.11
Education 0.59 0.25
Occupation 0.48 0.19
House value 0.38 0.60
Number of public parks in neighborhood 0.13 0.57
Number of violent crimes per year in neighborhood 0.23 0.55

The variable with the strongest association to the underlying latent variable. Factor 1, is income, with a factor loading of 0.65.

Since factor loadings can be interpreted like standardized regression coefficients, one could also say that the variable income has a correlation of 0.65 with Factor 1. This would be considered a strong association for a factor analysis in most research fields.

Two other variables, education and occupation, are also associated with Factor 1. Based on the variables loading highly onto Factor 1, we could call it “Individual socioeconomic status.”

House value, number of public parks, and number of violent crimes per year, however, have high factor loadings on the other factor, Factor 2. They seem to indicate the overall wealth within the neighborhood, so we may want to call Factor 2 “Neighborhood socioeconomic status.”

Notice that the variable house value also is marginally important in Factor 1 (loading = 0.38). This makes sense, since the value of a person’s house should be associated with his or her income.


Related Solutions

The eigenvalue is a measure of how much of the variance of the observed variables a...
The eigenvalue is a measure of how much of the variance of the observed variables a factor explains. Any factor with an eigenvalue ≥1 explains more variance than a single observed variable, so if the factor for socioeconomic status had an eigenvalue of 2.3 it would explain as much variance as 2.3 of the three variables. This factor, which captures most of the variance in those three variables, could then be used in another analysis. The factors that explain the...
The price elasticity of demand is a measure of how much the demand for a product...
The price elasticity of demand is a measure of how much the demand for a product is affected by a change in price. Review the following scenario and answer the questions that follow. Evelyn makes $15,000 per year and Tami makes $150,000 per year. They are both buying roast beef at the grocery store. Evelyn asks for $10 worth of roast beef, and Tami asks for 10 pounds of roast beef. What is each consumer’s price elasticity of demand? Identify...
The Fan Cost Index​ (FCI) is a measure of how much it costs a family of...
The Fan Cost Index​ (FCI) is a measure of how much it costs a family of four to attend a major league baseball game. The frequency distribution and the histogram for the Fan Cost Index are given below: FCI​ (Dollars) Frequency ​100-149.99 2 ​150-199.99 0 ​200-249.99 1 ​250-299.99 1 ​300-349.99 9 ​350-399.99 12 ​400-449.99 5 (a) Determine the shape of the distribution. A. Right Skewed B. Left Skewed C. Normal D. Uniform ​ (b) Determine if the mean or the...
The capacitance of an object is a measure of how much charge separation can be produced...
The capacitance of an object is a measure of how much charge separation can be produced on the object by applying a voltage difference to it, or contrarily, how much voltage arises across the object as a result of a particular charge separation developing. The equation defining the capacitance is Q = C ΔV where Q is the charge separation (+Q and -Q at two different places on the capacitor), C is the capacitance, and ΔV is the potential difference....
Problem 6: R simulation.Use the following codes to read the observed frequencies of how much the...
Problem 6: R simulation.Use the following codes to read the observed frequencies of how much the students smoke in dataset ”survey”. library(MASS) # if you have not installed the package, run install.packages(”MASS”) first. survey = na.omit(survey) # remove data with missing values smoke.freq=table(survey$Smoke) # the frequencies of smoke (1) Suppose the campus smoking statistics is as below. Determine whether the sample data in survey supports it at 0.05 significance level. (Hint: using the command chisq.test.) Heavy Never Occas Regul 0.045...
What is operating leverage? A measure of how much a debt a firm uses to finance...
What is operating leverage? A measure of how much a debt a firm uses to finance investments. A measure of the breakeven point for a company. A measure of how sensitive a company's profits are to changes in demand. The production environment in a company.
How much variance in grades is explained by artistic sensitivity? Show how you derived your answer...
How much variance in grades is explained by artistic sensitivity? Show how you derived your answer students AS G 1 8 81 2 9 97 3 8 72 4 7 79 5 6 70 6 4 60 7 6 81 8 7 87 9 6 77 10 0 85 11 6 64 12 7 91 13 8 75 14 7 73 15 7 76 16 8 83 17 8 83 18 2 55 19 8 92 20 9 92 21...
How do we use multiple regression analysis on how much variance in a continuous dependent variable...
How do we use multiple regression analysis on how much variance in a continuous dependent variable is explained by a set of predictors?
I have the appropriate data set. This is the question: How much variance in maximum wind...
I have the appropriate data set. This is the question: How much variance in maximum wind speed is explained by atmospheric pressure? With a one-unit change in atmospheric pressure, what is the corresponding change in maximum wind speed?   Can you tell me the procedure to use in SPSS in order to answer these questions?
The variance in a production process is an important measure of the quality of the process....
The variance in a production process is an important measure of the quality of the process. A large variance often signals an opportunity for improvement in the process by finding ways to reduce the process variance. Machine 1 2.95 3.45 3.50 3.75 3.48 3.26 3.33 3.20 3.16 3.20 3.22 3.38 3.90 3.36 3.25 3.28 3.20 3.22 2.98 3.45 3.70 3.34 3.18 3.35 3.12 Machine 2 3.22 3.30 3.34 3.28 3.29 3.25 3.30 3.28 3.38 3.34 3.35 3.19 3.35 3.05 3.36...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT