Question

In: Math

This data talks about a solution of a chemical after sitting for a certain amount of...

This data talks about a solution of a chemical after sitting for a certain amount of time and provides the time it was left sitting.

Concentration of a chemical solution (y) Time after solution was made (x)
0.07 9
0.09 9
0.08 9
0.16 7
0.17 7
0.21 7
0.49 5
0.58 5
0.53 5
1.22 3
1.15 3
1.07 3
2.84 1
2.57 1
3.1 1
  1. Run regression analysis on this. Make sure to output the residuals and residual plots.
  2. Plot the predicted values versus the residual values in a scatter plot.
  3. What assumption is violated in this data?
  4. What solution can be applied here to fix the issues? Try your solution and see if it fixes the problems with your residuals.

Solutions

Expert Solution

a)

Regression Analysis: y versus x

The regression equation is
y = 2.58 - 0.324 x


Predictor Coef SE Coef T P
Constant 2.5753 0.2487 10.35 0.000
x -0.32400 0.04330 -7.48 0.000


S = 0.474314 R-Sq = 81.2% R-Sq(adj) = 79.7%


Analysis of Variance

Source DF SS MS F P
Regression 1 12.597 12.597 55.99 0.000
Residual Error 13 2.925 0.225
Total 14 15.522

b)

y   x   Residual   predicted values
0.07   9   0.410667   -0.34067
0.09   9   0.430667   -0.34067
0.08   9   0.420667   -0.34067
0.16   7   -0.147333   0.30733
0.17   7   -0.137333   0.30733
0.21   7   -0.097333   0.30733
0.49   5   -0.465333   0.95533
0.58   5   -0.375333   0.95533
0.53   5   -0.425333   0.95533
1.22   3   -0.383333   1.60333
1.15   3   -0.453333   1.60333
1.07   3   -0.533333   1.60333
2.84   1   0.588667   2.25133
2.57   1   0.318667   2.25133
3.10   1   0.848667   2.25133

The scatter plot of the predicted values versus the residual values is shown at the above figure.

c) From the scatter plot of the predicted values versus the residual values in b), we know that this violates the shapeless without a clear picture pattern, no obvious outliers, and be generally symmetrically distributed around the 0 line without particularly large residuals. Hence, we can conclude that the assumption of independent and identically distribution on residuals for the linear regression model is violated.

d) One of the solutions that can be applied here to fix the issues is that the log-transformation of the dependent variable Concentration of a chemical solution (y).

From the above scatter plot of residual VS fitted values, we know that it improves the randomness of the scatter plot and may satisfy the assumption of the normal distribution of the residual. Hence, it fixes the problem of residuals.


Related Solutions

Frey talks about the value of life being determined by the amount of enrichment one does...
Frey talks about the value of life being determined by the amount of enrichment one does in their lifespan. How is the value of life determined?
A lab is testing the amount of a certain active chemical compound in a particular drug...
A lab is testing the amount of a certain active chemical compound in a particular drug that has been recently developed. The manufacturer claims that the average amount of the chemical is 110 mg. It is known that the standard deviation in the amount of the chemical is 7 mg. A random sample of 21 batches of the new drug is tested and found to have a sample mean concentration of 104.5 mg of the active chemical. a)Calculate the 95%...
3. A certain chemical pollutant is in the Hudson River. After environmental efforts the average is...
3. A certain chemical pollutant is in the Hudson River. After environmental efforts the average is supposed to be ?=34 ???. We may assume that x follows a normal distribution with ?=6 ???. A random sample at 40 locations has a sample mean of 32.5 ppm. Use a 5% level of significance and test whether the mean amount of pollutant is less than 34 ppm? a) State the null hypothesis H and the alternate hypothesis H. b) What is the...
The end of the chapter talks about DNA repair, and the discussion for this topic will...
The end of the chapter talks about DNA repair, and the discussion for this topic will follow up on this idea further. You should do further research on some aspect of DNA repair. For example: how it works the different types why it's important how it can go wrong what results when it goes wrong its importance to cancer.. Do not use other solutions. I need originality responses as well as IN TEXT CITATIONS AND A WORKS CITED REFERENCE. Please...
The data show the time intervals after an eruption​ (to the next​ eruption) of a certain...
The data show the time intervals after an eruption​ (to the next​ eruption) of a certain geyser. Find the regression​ equation, letting the height of the current eruption be the explanatory variable​ (denoted by​ x). Then use this equation to determine the predicted length of the time interval after an eruption given that the current eruption has a height of 113feet. Height (ft), Interval after (min) 96 66 128 85 75 59 128 86 88 70 73 75 80 73...
The data show the time intervals after an eruption (to the next eruption) of a certain...
The data show the time intervals after an eruption (to the next eruption) of a certain geyser. Find the regression equation, letting the first variable be the independent (x) variable. Find the best predicted time of the interval after an eruption given that the current eruption has a height of 120 feet. Use a significance level of 0.05. Height (ft) Height (ft)   Interval after (min) 96   68 111   80 76   66 91   72 66   58 108   79 116   84 91  ...
The data show the time intervals after an eruption​ (to the next​ eruption) of a certain...
The data show the time intervals after an eruption​ (to the next​ eruption) of a certain geyser. Find the regression​ equation, letting the first variable be the independent​ (x) variable. Find the best predicted time of the interval after an eruption given that the current eruption has a height of 126 feet. Use a significance level of 0.05. Height (ft)   Interval after (min) 84   76 122   77 78   67 108   87 73   61 105   77 122   87 78   70 What...
The data show the time intervals after an eruption​ (to the next​ eruption) of a certain...
The data show the time intervals after an eruption​ (to the next​ eruption) of a certain geyser. Find the regression​ equation, letting the first variable be the independent​ (x) variable. Find the best predicted time of the interval after an eruption given that the current eruption has a height of 149 feet. Use a significance level of 0.05. Height​ (ft) 136 140 134 144 102 109 104 116 Interval after​ (min) 83 84 94 92 67 67 84 84. What...
The data show the time intervals after an eruption​ (to the next​ eruption) of a certain...
The data show the time intervals after an eruption​ (to the next​ eruption) of a certain geyser. Find the regression​ equation, letting the first variable be the independent​ (x) variable. Find the best predicted time of the interval after an eruption given that the current eruption has a height of 148 feet. Use a significance level of 0.05. Height (ft) Interval after (min) 136 83 140 84 134 94 144 92 102 67 109 67 104 84 116 84
The following data were obtained for the concentration vs. time for a certain chemical reaction. Values...
The following data were obtained for the concentration vs. time for a certain chemical reaction. Values were measured at 1.0 s intervals, beginning at 0.00 and ending at 20.0 s. Concentrations in mM are: 10.00, 6.91, 4.98, 4.32, 3.55, 3.21, 2.61 2.50, 2.22, 1.91, 1.80, 1.65, 1.52, 1.36 1.42, 1.23, 1.20, 1.13, 1.09, 1.00, 0.92 a) Plot concentration, c, vs. time, t, ln c vs. t, and 1/c vs. t. b) Decide whether the data best fit zero-order, first-order or...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT