Question

In: Statistics and Probability

# 3) Problem 3.5 from the textbook asks you to do and interpret a principal components...

# 3) Problem 3.5 from the textbook asks you to do and interpret a principal components analysis on the given correlation matrix, which can be entered into R with the following code:

my.cor.mat <- matrix(c(1,.402,.396,.301,.305,.339,.340, .402,1,.618,.150,.135,.206,.183, .396,.618,1,.321,.289,.363,.345, .301,.150,.321,1,.846,.759,.661, .305,.135,.289,.846,1,.797,.800, .339,.206,.363,.759,.797,1,.736, .340,.183,.345,.661,.800,.736,1), ncol=7, nrow=7, byrow=T);

As mentioned in the book, the 7 variables are 'head length', 'head breadth', 'face breadth', 'left finger length', 'left forearm length', 'left foot length','height'.

Obtain the principal components (including choosing an appropriate number of PCs). Also make an attempt to interpret your PCs.

Solutions

Expert Solution

INTERPRETATION:

   The proportion of variance row from the above output explains the percentage of information captured by each components. Thus from the proportion of variance row, we could see that the first principal components explains 75% of the total information with an Eigen value of 2.29. And the second principal component expalains 15% of information with an Eigen value of 1.04. One way to choose the appropriate number of principal components is by using the eigen value. An eigenvalue is an index that indicates how good a component is as a summary of the data. An eigenvalue of 1.0 means that the component contains the same amount of information as a single variable. Thus from Cumulative proportion row, we could infer that the first two principal components together explain 91% of the total information in the given data.

SCREE PLOT:

The second method to determine the number of components is using Scree Plot. A scree plot displays the proportion of the total variation in a dataset that is explained by each of the components in a principle component analysis. It helps you to identify how many of the components are needed to summarise the data.

The following scree plot shows the number of Eigenvalues on vertical axis, ordered from biggest to smallest and number of principal components on the horizontal axis.

From the scree plot, we conclude that the correct number of components is the number that appear prior to the elbow point.

With the help of scree plot we can see there is not much of difference in variance explained beyond the second component (elbow rule: Since component 2 is in elbow point) hence first two components has a higher variance explained. Scree plot along with the eigen value criteria (eigen value greater than one) we can retain the first two principal components which together captures 91% of the total information.

BIPLOT:

The principal component biplot is based on the first two principal components and it provides 91% information contained in a correlation plot. The biplot indicates certain relationships between variables, based on the angles between the vectors. Some variables are positively correlated, others are negatively or not correlated at all. The relationship or interaction between an object vector (red line represents the vector) and a variable vector, is positive if their angle is acute, and negative in the case of an obtuse angle. In biplot the vector representing a variable 1, forms a very wide angle, more than 120º, with variables 2 and 3.Thus the relationship between variable 1 (head length), variable 2 (head breadth​) and variable 3 (face breadth​) is negative since the angle between them is obtuse. Furthermore, it has negative loadings for PC1. Thus PC1 separating individuals with high values of variable 2 and 3 and low values of variable 1 from individuals with high values of variable1 and low values of variable 2 and 3. Whereas there is positive relationship between variables 1,3,4,5,6 and 7 since the angle between them is acute. Whereas PC1 has positive loadings with variables 4 (left finger length) , variable 5 (left forearm length) ,variable 6 (left foot length) and variable 7 (height) are positively correlated. Whereas PC2 has positive loadings for variable 1 (head length) and negative loadings for other 6 variables.


Related Solutions

How do you interpret a p-value in the context of a word problem? Please provide a...
How do you interpret a p-value in the context of a word problem? Please provide a few examples!
Chapter 4 Problem 11 Problem 15, part f. in Chapter 3 asks you to construct a...
Chapter 4 Problem 11 Problem 15, part f. in Chapter 3 asks you to construct a five year financial projection for Aquatic Supplies beginning in 2015. The five year projection appears below. a. Calculate Aquatic Supplies's sustainable and actual growth rates in these years.   b. What do these numbers suggest to you? Aquatic Supplies Co. Five Year Projected Income Statements and Balance Sheets Income Statement (in $ millions) Pro Forma Forecasts 2015 - 2019 2014 Assumptions 2015 2016 2017 2018...
Question 3 (a) Interpret the balance of payments in aggregate components of the current and capital/financial...
Question 3 (a) Interpret the balance of payments in aggregate components of the current and capital/financial account to explain an economy’s position in the international trade and finance in any country of your choice. (b) Using the country from the balance of payments table used in part 3(a), discuss how its strength or weakness in trade in its current account is offset or balanced by its capital account.
Revise the substitutionEncrypt function from section 3.5 (p. 102) of the textbook to (1) remove all...
Revise the substitutionEncrypt function from section 3.5 (p. 102) of the textbook to (1) remove all spaces from the plaintext message before it is encrypted and (2) generate the substitution cipher key from a password. (substitutionEncrypt will call genKeyFromPass to do this.) The password should be a parameter, psw, which will replace key as a parameter in the function header. Write function substitutionDecrypt, which will have two parameters, cipherText, a string, a message encrypted by substitutionEncrypt, and psw, the password...
A coworker asks you to download and install a program from the internet. How do you...
A coworker asks you to download and install a program from the internet. How do you respond?
ALL COMPONENTS / QUESTIONS MUST BE FULLY ANSWERED -- DO NOT USE THE SIMILAR TEXTBOOK SOLUTIONS...
ALL COMPONENTS / QUESTIONS MUST BE FULLY ANSWERED -- DO NOT USE THE SIMILAR TEXTBOOK SOLUTIONS ALREADY IN PLACE IF YOU ARE UNABLE TO ANSWER ALL COMPONENTS, PLEASE DO NOT ANSWER. INCOME STATEMENTS SHOULD BE IN THE MOST BASIC FORM. OPENING AND CLOSING INVENTORY, ETC., ARE NOT TO BE INCLUDED.   Ciroc Company manufactures and sells one specific product. The following information pertains to each of Ciroc's first three years of operations: Variable costs per unit: Manufacturing: Direct materials . ....
ATTENTION: ALL COMPONENTS / QUESTIONS MUST BE FULLY ANSWERED -- DO NOT USE THE TEXTBOOK SOLUTIONS...
ATTENTION: ALL COMPONENTS / QUESTIONS MUST BE FULLY ANSWERED -- DO NOT USE THE TEXTBOOK SOLUTIONS ALREADY IN PLACE IF YOU ARE UNABLE TO ANSWER ALL COMPONENTS, PLEASE DO NOT ANSWER. THANK YOU! :) O’Brien Company manufactures and sells one product. The following information pertains to each of the company’s first three years of operations: Variable costs per unit: Manufacturing: Direct materials . . . . . . . . . . . . . . . . . ....
3. What is meant by the principal agent problem? In the case of the relationship between...
3. What is meant by the principal agent problem? In the case of the relationship between shareholders and management, who is the principal and who is the agent? What is an example of something that the agent in this case might do that would be in conflict with the goals of the principal? What actions could the principal take to try to prevent (or at least reduce) this problem?
Name the principal components of a spectrometer and fluorimeter. What function do each component play? Why...
Name the principal components of a spectrometer and fluorimeter. What function do each component play? Why is detection in fluorimeter performed at 90o relative to the incident light?
In a questionnaire, a researcher asks, “Do you currently smoke cigarettes?” and the next question asks...
In a questionnaire, a researcher asks, “Do you currently smoke cigarettes?” and the next question asks smokers “How many packs a day do you smoke?” The first question is an example of: A. a filter question. B. an open-ended question. C. a vignette. D. a skip pattern.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT