Question

In: Statistics and Probability

Consider the following data between number of visitors x and the amount of wildlife seen in...

Consider the following data between number of visitors x and the amount of wildlife seen in Cheaha State Park y,

visitors wildlife
500 25
450 35
475 30
420 32
555 28
600 22
375 19


(a) Find the mean of x and the mean of y.

(b) Find the standard deviation of x and the standard deviation of y. Use Excel.

(c) Find the correlation coefficient. Use Excel. (d) Find the slope and intercept of the linear regression line. Then write down the line.

(e) Make predictions for all the values of x in the table. Then calculate the residuals.

(f) Calculate the sum of squared residuals. (g) Calculate the standard deviation of the regression.

(h) Using the mean of x and your data for x, calculate the sum of squared deviations.

(i) Write down an appropriate null and alternative hypothesis.

(j) Calculate the test-statistic.

(k) Make a prediction for when x is 470, and calculate (xp − ¯ x)2. Then make a confidence interval around your prediction.

Solutions

Expert Solution

Solution-: Let, x=Number of visitors and y= The amount of wildlife

Given data:

F Column G Column

x y
500 25
450 35
475 30
420 32
555 28
600 22
375 19

(a)

Mean (x) =482.14 ( By using MS-Excel function "=AVERAGE(F2:F8)" )

Mean (y) =27.29     ( By using MS-Excel function "=AVERAGE(G2:G8)" )

(b)

Var (x) = 5134.69 ( By using MS-Excel function "=VARP(F2:F8)" )

SD (x) = 76.66   ( By using MS-Excel function "SQRT()" )

Var (y) = 27.35 ( By using MS-Excel function "=VARP(G2:G8)" )

SD (y) = 5.23 ( By using MS-Excel function "SQRT()" )

(c)

r=Corr(x,y)=-0.11   ( By using MS-Excel function "=CORREL(F2:F8,G2:G8)" )

(d) Find the slope and intercept of the linear regression line by using Data Data Analysis Regresssion in MS-Excel

We get output

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.108379
R Square 0.011746
Adjusted R Square -0.1859
Standard Error 6.151098
Observations 7
ANOVA
df SS MS F Significance F
Regression 1 2.248524 2.248524 0.059428 0.817089
Residual 5 189.18 37.83601
Total 6 191.4286
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept 31.099 15.8149 1.966448 0.106406 -9.55432 71.75265 -9.55432 71.75265
x -0.008 0.032445 -0.24378 0.817089 -0.09131 0.075493 -0.09131 0.075493

From this output we get regression line,

y=31.099-0.008*x


Related Solutions

A farmer was interested in a relationship between the amount of fertilizer (x) and the number...
A farmer was interested in a relationship between the amount of fertilizer (x) and the number of bushels (y) of soybeans produced. The farmer conducted an experiment and obtained the following data. Hundreds of pounds per acre (x) Bushels per acre (y) 1.0 25 2.5 32 3.0 35 3.0 32 3.4 35 4.0 39 4.0 41 4.5 40 Draw a scatter plot. Do the sample data appear to indicate a linear relationship between the amount of fertilizer and the number...
Data Table Two below Data Table Two Year Quarter Number of Visitors(in thousands) 2005 Winter 117...
Data Table Two below Data Table Two Year Quarter Number of Visitors(in thousands) 2005 Winter 117 Spring 80.7 Summer 129.6 Fall 76.1 2006 Winter 118.6 Spring 82.5 Summer 121.4 Fall 77 2007 Winter 114 Spring 84.3 Summer 119.9 Fall 75 2008 Winter 120.7 Spring 79.6 Summer 130.7 Fall 69.6 2009 Winter 125.2 Spring 80.2 Summer 127.6 Fall 72 Please post the answer with the work performed in excel and not just the answer, need to show work as I don't...
Here is the data on the number of dog bites seen in the emergency department (ED)...
Here is the data on the number of dog bites seen in the emergency department (ED) each season for the past 4 years. Season 2016 2017 2018 2019 Winter 108 100 101 110 Spring 95 96 94 87 Summer 120 117 117 120 Fall 80 79 77 75 Apply the Least Square Regression model studied in class to forecast demand for the four seasons of the upcoming year (2020). Hint: Review the Excel spreadsheet we used in class to set...
Consider the following data for two variables, x and y.
  Consider the following data for two variables, x and y. x 22 24 26 30 35 40 y 11 20 33 34 39 36 (a) Develop an estimated regression equation for the data of the form ŷ = b0 + b1x. (Round b0 to one decimal place and b1 to three decimal places.)ŷ = −8.3+1.259x    (b) Use the results from part (a) to test for a significant relationship between x and y. Use α = 0.05. Find the...
Consider the following data for two variables, x and y.
Consider the following data for two variables, x and y. x   2 3 4 5 7 7 7 8 9 y 4 5 4 6 4 6 9 5 11 a. Does there appear to be a linear relationship between x and y? Explain.(f-test, to do f-test for the overall significance) b. Develop the estimated regression equation relating x and y. c. Plot the standardized residuals versus yˆ for the estimated regression equation developed in part (b). Do the model assumptions...
solve using r and Consider the data birds.txt. A wildlife ecologist measured X1 = Tail length...
solve using r and Consider the data birds.txt. A wildlife ecologist measured X1 = Tail length (in millimeters) and X2 = Wing length (in millimeters) for a sample of n = 45 female hook-billed kites. (a) Construct both Q-Q plots and histograms from the marginal distributions of tail length (X1) and wing length (X2). Do these data appear to be normally distributed? Discuss. (b) Is the bivariate normal distribution a viable population model? Discuss. (c) Consider the variable, wing length...
solve using r and Consider the data birds.txt. A wildlife ecologist measured X1 = Tail length...
solve using r and Consider the data birds.txt. A wildlife ecologist measured X1 = Tail length (in millimeters) and X2 = Wing length (in millimeters) for a sample of n = 45 female hook-billed kites. (a) Evaluate the sample mean vector x. (b) Evaluate the sample variance-covariance matrix, S. (c) Determine the eigenvalue and eigenvector pairs for S. (d) Evaluate the sample correlation matrix R. Interpret the sample correlation, r12. the data is Tail Wing 191 284 197 285 208...
Consider the following. X = The number of hours a child watches television on a given...
Consider the following. X = The number of hours a child watches television on a given day Give the range of values that the random variable X may assume. {0, 1, 2, 3, ...} {x | 0 ≤ x ≤ 24}     {1, 2, 3, ..., 24} {x | 0 < x < 24} {x | 1 ≤ x ≤ 24} Classify the random variable as one of the following. finite discrete infinite discrete     continuous
Given the following pairs of data where x is the number of years of experience in...
Given the following pairs of data where x is the number of years of experience in a certain company and y is the annual salary: Determine the equation of the linear regression line. Round the slope and y-intercept to the nearest integers. Using your answer in (a), estimate the starting salary for a new hire. Using your answer in (a), estimate the number of years (to the nearest year) an employee would have to work in the company in order...
Given the following data of temperature (x) and the number of times a cricket chirps in...
Given the following data of temperature (x) and the number of times a cricket chirps in a second (y), run regression analysis and state the regression equation as long as there is a statistically significant linear relationship between the variables. Cricket Chirps per Second Temperature (F) 20 88.6 16 71.6 19.8 93.3 18.4 84.3 17.1 80.6 15.5 75.2 14.7 69.7 17.1 82 15.4 69.4 16.2 83.3 15 79.6 17.2 82.6 16 80.6 17 83.5 14.4 76.3
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT