In: Statistics and Probability
A biologist looked at the relationship between number of seeds a plant produces and the percent of those seeds that sprout. The results of the survey are shown below. Seeds Produced 55 63 50 43 67 55 66 68 Sprout Percent 63.5 50.5 66 73.5 53.5 55.5 62 45 Find the correlation coefficient: r = Round to 2 decimal places. The null and alternative hypotheses for correlation are: H 0 : = 0 H 1 : ≠ 0 The p-value is: (Round to four decimal places) Use a level of significance of α = 0.05 to state the conclusion of the hypothesis test in the context of the study. There is statistically insignificant evidence to conclude that a plant that produces more seeds will have seeds with a lower sprout rate than a plant that produces fewer seeds. There is statistically insignificant evidence to conclude that there is a correlation between the number of seeds that a plant produces and the percent of the seeds that sprout. Thus, the use of the regression line is not appropriate. There is statistically significant evidence to conclude that there is a correlation between the number of seeds that a plant produces and the percent of the seeds that sprout. Thus, the regression line is useful. There is statistically significant evidence to conclude that a plant that produces more seeds will have seeds with a lower sprout rate than a plant that produces fewer seeds. r 2 = (Round to two decimal places) Interpret r 2 : There is a large variation in the percent of seeds that sprout, but if you only look at plants that produce a fixed number of seeds, this variation on average is reduced by 68%. There is a 68% chance that the regression line will be a good predictor for the percent of seeds that sprout based on the number of seeds produced. 68% of all plants produce seeds whose chance of sprouting is the average chance of sprouting. Given any group of plants that all produce the same number of seeds, 68% of all of these plants will produce seeds with the same chance of sprouting. The equation of the linear regression line is: ˆ y = + x (Please show your answers to two decimal places) Use the model to predict the percent of seeds that sprout if the plant produces 55 seeds. Percent sprouting = (Please round your answer to the nearest whole number.) Interpret the slope of the regression line in the context of the question: As x goes up, y goes down. The slope has no practical meaning since it makes no sense to look at the percent of the seeds that sprout since you cannot have a negative number. For every additional seed that a plant produces, the chance for each of the seeds to sprout tends to decrease by 0.84 percent. Interpret the y-intercept in the context of the question: If plant produces no seeds, then that plant's sprout rate will be 107.71. The y-intercept has no practical meaning for this study. The average sprouting percent is predicted to be 107.71. The best prediction for a plant that has 0 seeds is 107.71 percent.
a)
correlation coefficient r= | Sxy/(√Sxx*Syy) = | -0.82 |
null hypothesis: Ho: ρ | = | 0 | |
Alternate Hypothesis: Ha: ρ | ≠ | 0 |
test stat t= | r*(√(n-2)/(1-r2))= | -3.5583 |
P value = | 0.0120 |
There is statistically significant evidence to conclude that there is a correlation between the number of seeds that a plant produces and the percent of the seeds that sprout. Thus, the regression line is useful.
r 2 = 0.68
There is a large variation in the percent of seeds that sprout, but if you only look at plants that produce a fixed number of seeds, this variation on average is reduced by 68%.
The equation of the linear regression line is: ˆ y =107.71+(-0.84) x
predicted val=107.705+55*-0.84= | 62 |
For every additional seed that a plant produces, the chance for each of the seeds to sprout tends to decrease by 0.84 percent
The y-intercept has no practical meaning for this study.