Question

In: Statistics and Probability

To investigate the relationship between yield of potatoes, y, and the level of fertilizer application, x,...

To investigate the relationship between yield of potatoes, y, and the level of fertilizer application, x, an experimenter divides a field into eight plots of equal size and applies differing amounts of fertilizer to each. The yield of potatoes (in pounds) and the fertilizer application (in pounds) are recorded for each plot. The data are as follows: x 1 1.5 2 2.5 3 3.5 4 4.5 y 25 31 27 28 36 35 32 34 Test at the 5% significance level if there is a linear correlation. Find the least squares regression equation. What is the best estimate for the yield of potatoes if 3.2 pounds of fertilizer is applied?

Solutions

Expert Solution

Solution: In order to test the above question, we construct our null and alternative hypotheses as:

H0: rho = 0 vs Ha: rho not equal to 0 where rho is the population correlation coefficient
The test statistic to answer the given question is T=r*sqrt(n-2)/sqrt(1 -(r*r)) where r is the sample correlation coefficient and n is the sample size, sqrt refers to the square root function.

We reject H0 iff|T(observed)| > t(alpha/2,(n-2)) where t(Alpha/2,(n-2)) is the upper alpha/2 point of a Student's t distribution with (n-2) degrees of freedom.

Here, the linear correlation coefficient r = 0.729 (rounded to 3 decimal places) where r is calculated by the formula

r = n*sum(x*y) - sum(x)*sum(y) / sqrt{(n*sum(x2) - sum(x)2)(n*sum(y2) - sum(y)2)}

The value of the test statistic is T(observed) = 27.11139 and t(alpha/2,(n-2)) = 1.963632

Thus, |T(observed)| > t(alpha/2,(n-2)).

Hence we reject H0 at a 5% level of significance and conclude on the basis of the given sample that there is a significant linear correlation between yield of potatoes, y, and the level of fertilizer application, x.

The least squares regression equation is y_hat = b0 + b1*x

where y_hat is the predicted value of the yield of potatoes and x is the given pound of fertilizer applied, b1 is the slope and b0 is the y-intercept.

Also,   and  

Thus, the least squares regression equation is y_hat = 24.452 + 2.381 x

The best estimate for the yield of potatoes if 3.2 pounds of fertilizer is applied is obtained by putting x = 3.2 in the obtained linear regression equation.

It is found to be 32.0712 pounds.


Related Solutions

A researcher is investigating the relationship between economic development (x) and level of religiosity (y) in...
A researcher is investigating the relationship between economic development (x) and level of religiosity (y) in ten countries. (The researcher has interval-level measurements for both variables.)The researcher theorizes that citizens of countries at the lower end of the development scale will profess higher levels of religiosity than will citizens of countries at the higher end of the development scale. As development increases, religiosity decreases. Draw and Label four sets of axes, like this one below: (A.) Is the researcher hypothesizing...
A farmer was interested in a relationship between the amount of fertilizer (x) and the number...
A farmer was interested in a relationship between the amount of fertilizer (x) and the number of bushels (y) of soybeans produced. The farmer conducted an experiment and obtained the following data. Hundreds of pounds per acre (x) Bushels per acre (y) 1.0 25 2.5 32 3.0 35 3.0 32 3.4 35 4.0 39 4.0 41 4.5 40 Draw a scatter plot. Do the sample data appear to indicate a linear relationship between the amount of fertilizer and the number...
. Dr. Chappel wishes to investigate the relationship between birthweight and the estriol level of pregnant...
. Dr. Chappel wishes to investigate the relationship between birthweight and the estriol level of pregnant women. A random sample of eight women were chosen in a Houston hospital and Dr. Chappel tabulated her data as follows. Estriol (mg/24 hr) 9 14 21 16 24 25 18 20 Birthweight (g/100) 25 27 30 32 28 36 29 38 Calculate Pearson’s correlation coefficient r. Calculate a t statistic that can be used to test whether the amount of exercise and blood...
An endocrinologist was interested in exploring the relationship between the level of a steroid (Y) and...
An endocrinologist was interested in exploring the relationship between the level of a steroid (Y) and age (X) in healthy subjects whose ages ranged from 8 to 25 years. She collected a sample of 27 healthy subject in this age range. The data is located in the file problem01.txt, where the first column represents X = age and the second column represents Y = steroid level. For all R programming, print input and output codes and values. (a) Read the...
Kohl’s wishes to investigate the relationship between level of advertising, coupon value and sales. Kohl’s advertises...
Kohl’s wishes to investigate the relationship between level of advertising, coupon value and sales. Kohl’s advertises several times a month and includes a coupon (in dollars) with each advertisement. The value of the coupon remains the same in a month but varies from month to month. Kohl’s expects that the number advertisements and coupon value have a positive impact on sales. In addition, Kohl’s expects that the impact of advertisements increases as the coupon value increases. Kohl’s collects the sales...
Find if there is a relationship between education (in years) X and income Y. x 4...
Find if there is a relationship between education (in years) X and income Y. x 4 6 8 11 12 14 16 17 20 y 6000 12000 14000 10000 17000 16000 13000 16000 19000 Make sure that THREE of your posts for the week are Statistical in nature AND a direct response to the problems given in the discussion
Calculate the covariance between variables X and Y. Is it a positive or negative relationship between...
Calculate the covariance between variables X and Y. Is it a positive or negative relationship between the two variables? b. Calculate correlation coefficient between X and Y. Is it a positive or negative relationship? Is it a strong linear, weak linear or nonlinear relationship between X and Y? c. Use the Y data to calculate mean, range, standard deviation and variance. d. Use the first Y value to calculate the Z-score. Is it an outlier? e. Calculate the 60th percentile...
The conclusion listed below is based on a relationship between X and Y that is completely...
The conclusion listed below is based on a relationship between X and Y that is completely spurious. Do the following: (i) Define and explain what spurious relationship means? (ii) Think up a plausible variable, Z, that defines a compositional difference across the values of X. (iii) Describe how Z creates the relationship between X and Y. Students who smoke (X) earn lower grades (Y) than students who do not smoke. Conclusion: Smoking causes poor grades.
A scientist is studying the relationship between x = inches of annual rainfall and y =...
A scientist is studying the relationship between x = inches of annual rainfall and y = inches of shoreline erosion. One study reported the following data. Use the following information to solve the problem by hand, then use SPSS output to verify your answers. . X         30        25        90        60        50        35       75        110      45        80 Y         0.3       0.2       5.0       3.0       2.0       0.5       4.0       6.0       1.5       4.0 a. What is the equation of the estimated regression line? = ______________ b....
2. Theory gives you the following relationship between variables x and y, y = β0 +...
2. Theory gives you the following relationship between variables x and y, y = β0 + β1x + u. You collect a sample of data on n = 4 sample members. The data are : {x1, y1} = {3, 8},{x2, y2} = {2, 7},{x3, y3} = {1, 6},{x4, y4} = {3, 4} a. State the minimization problem that you need to derive the OLS estimators b. Estimate the relationship between x and y using this sample. What is your estimate...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT