Question

In: Statistics and Probability

Given two sets of data, A and B. i) Data set A has an r value...

Given two sets of data, A and B.

i) Data set A has an r value of -.81 and data set B has an r value of .94 Describe the differences between the two data sets as completely as you can using the regression information we have learned.

ii) Which linear regression equation, the one for A or the one for B, would probably be a better predictor? Why?

Solutions

Expert Solution

Solution :-

i)

Data set A has r value ( Correlation Coefficient ) = -0.81 i.e Negative higher degree of correlation.

When two related variables move in opposite directions, their relationship is negative.

When the coefficient of correlation (r) is less than 0, it is negative. When r is -1.0, there is a perfect negative correlation

---------------------------------------------------------------------------------------------------------------------------------

Data set B has r value ( Correlation Coefficient ) = 0.94 i.e Positive higher degree of correlation.

When two related variables move in the same direction, their relationship is positive.

This correlation is measured by the coefficient of correlation (r).

When r is greater than 0, it is positive. When r is +1.0, there is a perfect positive correlation.

*****************************************************************************************************************

*******************************************************************************************************************

ii )

In this case Linear regression equation ( B ), r = 0.94 is better predicter than ( A ) , r = - 0.81 .

Because --

The correlation coefficient, r, can range from -1 to +1. When r = +1, there is a perfect positive correlation between two variables. When r = -1, there is a perfect negative correlation between two variables. When r = 0, there is no correlation between the variables. In reality, it's very rare to find r values of +1 or -1; rather, we see r values somewhere between these two extremes. For example, if we determined that two variables had an r value of 0.94, for all practical purposes, that would indicate a very strong, but not perfect, positive correlation between the two variables. Similarly, an r value of -0.81 would indicate a very strong, but not perfect, negative correlation between the two variables.

Condition for correlation are :-

r = 0 - No Correlation

r = 0 to  +/- ( 0.25 ) -- Lower degree of Correlation.

r = +/- 0.25 to +/- 0.75 -- Moderate degree of Correlation.

r = +/- 0.75 to +/- 0.99 -- Higher degree of Correlation.

r = - 1 = Perfect Negative Correlation.

r = +1 = Perfect Positive  Correlation.


Related Solutions

OK I have two data sets with 30 million rows each each data set is five...
OK I have two data sets with 30 million rows each each data set is five columns with four attributes and an amount. I want to confirm that the two data sets are exactly the same no two rows of data in the 30 million rolls are duplicates For my proof I will confirm each data set has the same number of rows. And I will also do the following: I will create four smaller data sets from each of...
Averages and variation Consider two data sets A and B. The sets are identical except the...
Averages and variation Consider two data sets A and B. The sets are identical except the high value of the data set B is three times greater than the high value of data set A. (a) How does the median of the two data sets compare? (b) How do the means of the two data sets compare? (c) How do the standard deviations of the two data sets compare? (d) How do the box- and –whisker plots of the two...
A simple Statistic question by using R, If I have two set of mean proportion data,...
A simple Statistic question by using R, If I have two set of mean proportion data, what test should I use? such as, [1] 0.7652632 0.7555354 0.7602588 0.7594096 0.7497992 0.5532588 0.7595661 0.6911504 [9] 0.5964602 0.6369565 0.7355828 0.7346225 0.5913793 0.6499079 0.6327273 0.6091873 [17] 0.6306122 0.5960784 0.5492918 0.6785714 0.5014787 0.5484848 0.5645403 0.6731343 [25] 0.6208191 0.6087248 0.6045045 0.7743390 0.5275862 0.5731278 [1] 0.6564195 0.5928482 0.6806709 0.5546422 0.5438393 0.5906535 0.6764637 0.6487188 [9] 0.5901547 0.6626735 0.5955325 0.7462415 0.5971111 0.5731504 0.6334729 0.6124653 [17] 0.6224686 0.5549067 0.6348427 0.6265627...
Construct a 95% confidence interval for data sets A and B. Data sets A and B...
Construct a 95% confidence interval for data sets A and B. Data sets A and B are dependent. Assume that the paired data came from a population that is normally distributed, and round your final answer to three decimal places. setA: 30 28 47 43 31 setB: 28 24 35 35 22 A) What is the mean of the differences( )? B) What is the standard deviation of the differences( )? C) What is the critical t-value( )? D) What...
Given two sets S and T, the direct product of S and T is the set...
Given two sets S and T, the direct product of S and T is the set of ordered pairs S × T = {(s, t)|s ∈ S, t ∈ T}.Let V, W be two vector spaces over F. (a) Prove that V × W is a vector space over F under componentwise addition and scalar multiplication (i.e. if (v1, w1),(v2, w2) ∈ V × W, then (v1, w1) + (v2, w2) = (v1+w1, v2+w2) and a(v, w) = (av, aw)...
Healthcare data sets is an interesting topic. What are data sets? Why would a data set...
Healthcare data sets is an interesting topic. What are data sets? Why would a data set be developed? Provide one to two examples only not a list.
If Data A has a correlation coefficient of r = -0.991, and Data B has a...
If Data A has a correlation coefficient of r = -0.991, and Data B has a correlation coefficient of r = 0.991, which correlation is correct? Select one: a. Data A and Data B have the same strength in linear correlation. b. Data A has a weaker linear correlation than Data B. c. Data A has a stronger linear correlation than Data B. Clear my choice Question 12 Not yet answered Marked out of 1.00 Flag question Question text The...
2. Take data sets A and B and delete duplicated values such that each value is...
2. Take data sets A and B and delete duplicated values such that each value is unique even when pooling the two data sets. Just like with the previous problem, treat data sets A and B as hypothetical data on the weights of children whose parents smoke cigarettes, and those whose parents do not respectively. a) Calculate the expected value of the wilcoxon Rank-Sum test statistic E(Wx) assuming the null hypothesis of equal medians being true. b) Conduct a Wilcoxon-Rank-Sum...
Go into R and view all of the data sets preloaded in R by using the...
Go into R and view all of the data sets preloaded in R by using the data() command. As you see there are quite a few data sets loaded into R. Now retrieve the dataset women using data(women). This data is from a random sample of 15 women, recording the height and weight of each woman in the sample. I want you to create a 95% confidence interval for the population mean using this data and R. First find the...
Given two sets A,B prove A<---> B either using the definition of the schroeder-bernstein theorem
Given two sets A,B prove A<---> B either using the definition of the schroeder-bernstein theorem
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT