Question

In: Math

Data can be collected and organized as an ordered pair (x, y). The data can be...

Data can be collected and organized as an ordered pair (x, y). The data can be analyzed to determine the type and strength of a correlation and to calculate a regression line in order to make a prediction.

Use the internet to find a data set of ordered pairs. Key terms to search: Free Public Data Sets and Medical Data Sets.

Create a Post:

Introduce your Data Set.

  1. Which would be the independent variable, and which would be the dependent variable?
  2. Without drawing a scatter plot, would you expect a positive, negative or no correlation? Explain.
  3. Would you categorize your data to have a strong or weak correlation? Why?
  4. What would the r2 value tell you about the data that you selected?
  5. What is the equation of the regression line?
  6. Use the regression line to make a prediction about the data you collected.

Solutions

Expert Solution

1. Which would be the independent variable, and which would be the dependent variable?

Ans: The independent variable is Age in years and the dependent variable is blood pressure.

2. Without drawing a scatter plot, would you expect a positive, negative or no correlation? Explain.


From the above scatter plot, the increases the value of age increases the value of blood pressure and vice versa. Hence, the relationship between age and blood pressure is positive.

Would you categorize your data to have a strong or weak correlation? Why?

Ans: Pearson correlation (r) of Age and Blood pressure is 0.896 and more than 0.5. Hence, these two data set has a strong correlation.

What would the r2 value tell you about the data that you selected?

Ans: The value of r2 is 0.896^2=0.8028. Hence, 80.28% variation of blood pressure is explained by the age.

What is the equation of the regression line?

The regression equation is
Blood pressure = 80.778 + 1.138 Age

Use the regression line to make a prediction about the data you collected.

Age Blood pressure Predicted value
56 147 144.5060189
42 125 128.5739467
72 160 162.7141015
36 118 121.7459157
63 149 152.472055
47 128 134.2639725
55 150 143.3680138
49 145 136.5399828
38 115 124.0219261
42 140 128.5739467
68 152 158.1620808
60 155 149.0580396

Related Solutions

A twin prime is a pair of primes (x, y), such that y = x +...
A twin prime is a pair of primes (x, y), such that y = x + 2. Construct a list of all twin primes less than 1000. The result should be stored in a list of numeric vectors called twin_primes, whose elements are the twin primes pairs. Print the length of the list twin_primes and print the 10th and the 15th elements of the list, i.e. twin_primes[[10]] and twin_primes[[15]]. Please use R-code.
Find x and y so that the ordered data set has a mean of 42 and...
Find x and y so that the ordered data set has a mean of 42 and a median of 35, hence compute the harmonic mean and geometric mean of the data sample.        17, 22, 26, 29, 34, x, 42, 67, 70, y                                                          The lengths, in centimeters, of 18 snakes are given below. Draw an ordered stem-and-leaf diagram for the data.                                                                                                                                     24, 62, 20, 65, 27, 67, 69, 32, 40, 53, 55, 47, 33,...
A sales manager collected the following data on x = years of experience and y =...
A sales manager collected the following data on x = years of experience and y = annual sales ($1,000s). The estimated regression equation for these data is ŷ = 83 + 4x. Salesperson Years of Experience Annual Sales ($1,000s) 1 1 80 2 3 97 3 4 102 4 4 107 5 6 103 6 8 116 7 10 119 8 10 123 9 11 127 10 13 136 (a) Compute SST, SSR, and SSE. SST=SSR=SSE= (b) Compute the coefficient...
A sales manager collected the following data on x = years of experience and y =...
A sales manager collected the following data on x = years of experience and y = annual sales ($1,000s). The estimated regression equation for these data is ŷ = 82 + 4x. Salesperson Years of Experience Annual Sales ($1,000s) 1 1 80 2 3 97 3 4 102 4 4 107 5 6 103 6 8 101 7 10 119 8 10 128 9 11 127 10 13 136 (a) Compute SST, SSR, and SSE. SST=SSR=SSE= (b) Compute the coefficient...
The following rate data was collected for a reaction X + Y -> Products      Exp.     ...
The following rate data was collected for a reaction X + Y -> Products      Exp.            [X]                         [Y]                Rate of formation of a product      1                0.23 M                 0.17 M                    0.33 M/h     2                0.46 M                 0.17 M                    0.66 M/h     3               0.23 M                0.51 M                    0.99 M/h             a. Determine the orders with respect to the two reactants and write the rate law for the reaction b. Calculate the value...
The following data for the dependent variable, y, and the independent variable, x, have been collected...
The following data for the dependent variable, y, and the independent variable, x, have been collected using simple random sampling: X Y 10 120 14 130 16 170 12 150 20 200 18 180 16 190 14 150 16 160 18 200 Construct a scatter plot for these data. Based on the scatter plot, how would you describe the relationship between the two variables? Compute the correlation coefficient.
2. A student collected data on the number of large pizzas consumed, y, while x students...
2. A student collected data on the number of large pizzas consumed, y, while x students were watching a professional football game on TV. The data from five games are given in table below. Number of students, x 2 5 6 3 4 Number of large pizzas, y 1 6 10 3 5 Please show work for full credit. (e) Interpret your interval from part (d). (f) Calculate r2 (follow steps on page 92). (g) Interpret r2. (h) Compute linear...
Consider the following collected observations on the independent (X) and dependent variable (Y). Can we determine...
Consider the following collected observations on the independent (X) and dependent variable (Y). Can we determine the relation between X and Y wit the Pearson coefficient of correlation and why? If yes, determine it and interpret. Test the significance of this coefficient of correlation at 5% level. X Y XY X2 Y 2 errors squared errors 1 6 82 492 36 6724 1,24 1,53 2 2 86 172 4 7396 -9,25 85,54 3 15 43 645 225 1849 -5,16 26,67...
Data collected from a sample of houses are organized and stored in the Table below. Develop...
Data collected from a sample of houses are organized and stored in the Table below. Develop a regression model that predicts the Assessed value of a house based on size of the house and presence of a fireplace (Fireplace=1. No fireplace=0). assessed value size fireplace 234.4 2 1 227.4 1.71 0 225.7 1.45 0 235.9 1.76 1 229.1 1.93 0 220.4 1.2 1 225.8 1.55 1 235.9 1.93 1 228.5 1.59 1 229.2 1.5 1 236.7 1.9 1 229.3 1.39...
Running times (Y) and maximal aerobic capacity (X) for 14 female Runners. Data collected for running...
Running times (Y) and maximal aerobic capacity (X) for 14 female Runners. Data collected for running times and maximal aerobic capacity are listed below X: 61.32 55.29 52.83 57.94 53.31 51.32 52.18 52.37 57.91 53.93 47.88 47.41 47.17 51.05 Y: 39.37 39.80 40.03 41.32 42.03 42.37 43.93 44.90 44.90 45.12 45.60 46.03 47.83 48.55 (a) Calculate the mean, median, MAD, MSD, and standard deviation for each variable. ? [Include all your steps and explain all the steps involved in details]...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT