Question

In: Statistics and Probability

This is an open ended question. Could someone run a bivarate and multivatrate regression analysis on...

This is an open ended question.

Could someone run a bivarate and multivatrate regression analysis on either Excel Data Analysis or Excel QM. PLEASE ATTACH A COPY OF THE ORIGINAL EXCEL DATA ANALYS OR EXCEL QM TO YOUR ANSWER, I.E. NO SCREEN SHOTS.

Any data can be used, but the data used in the bivarate must be expanded upon in the multivriate analysis. PLEASE USE REAL DATA AND CITE THE SOURCE OF THE DATA.

The analysis must include the following:

Evaluation using the F and t statistics

Scattergram analysis, residual chart analysis

Discussion and analysis of the slope, y intercept, and regression equation

Discussion of the hypothesis and conclusions based on your analysis.

Thank you so much!

Solutions

Expert Solution

The~following table shows, for each of 18 cinchona plants, the yield of dry bark (in oz.), the height (in inches) and the girth (in inches) at a height of 6" from the ground (Source: Fundamentals of Statistics, Volume 1: Gun, Gupta, Dasgupta)

Here Y=Yield of dry bark, X1= Height, X2=Girth at a height.

First we find the linear regression of Y on X1 using excel. The outputs are as follows:

From the above analysis, we see that correlation coefficient of Y and X1 is 0.7679 which implies Y and X1 are highly +vely correlated and the predicted Y=8.0952+2.4318X1.

Since R-square=0.5896 means this regression line explains 58.96% of total variation of data. We also see that Y-intercept is insignificant since the corresponding p-value=0.1619>0.05 and slope is significantly different from zero since its p-value=0.0002<0.05 and the regression equation is also significant since its p-value=0.0002<0.05.  

From the above scatter plot and fitted line, we observe that the fitting is not highly satisfactory since there is a significant difference between observed and fitted values and this fact is also shown by the following table by computing their residuals:

Since Girth at a height of 6' (in.) (=X2, say) also influences Y so if we add X2 and perform multiple regression analysis then we get followings, where Y=Yield of dry bark (Dependent variable), X1=Height (Independent variable 1) and X2=Girth at a height of 6" (in.):

Here we see that R square=0.7679 which is greater than R-square of bivariate analysis and adjusted R square is closed to R square hence the new variable has significant effect. Here the modl is:

Y=-0.6507+1.7098X1+4.3432X2

Here we see that Y-intercept i.e. constant term is insignifiacnt however regression coefficients corresponding to X1 and X2 are significant and multiple regression is also significant (see ANOVA table). This fact is also observed from the following plot. Hence we conclude that addition of new variable improves the prediction of Y.


Related Solutions

1. Explain the difference between a closed-ended question, an open-ended question, and a probing question. When...
1. Explain the difference between a closed-ended question, an open-ended question, and a probing question. When would you use each? 2. Explain the difference between a top-down and bottom- up interview approach. When would you use each approach? 3. Draw a use case diagram for the following dentist office system, but do not bother to identify the flow of events within each use case. Whenever new patients are seen for the first time, they complete a patient information form that...
Complete with detailed information:- Write an example of the following: •         Open ended question •         Close-ended ordered question...
Complete with detailed information:- Write an example of the following: •         Open ended question •         Close-ended ordered question - Likert scale - Semantic differential •         Close-ended unordered question •         Partially close-ended question •         Branching question with 3 parts
Run a linear regression using Excel’s Data Analysis regression tool. Construct the linear regression equation and...
Run a linear regression using Excel’s Data Analysis regression tool. Construct the linear regression equation and determine the predicted total sales value if the number of promotions is 6. Is there a significant relationship? Clearly explain your reasoning using the regression results. Number of Promotions Total Sales 3 2554 2 1746 11 2755 14 1935 15 2461 4 2727 5 2231 14 2791 12 2557 4 1897 2 2022 7 2673 11 2947 11 1573 14 2980
You run a regression analysis using Data Analysis to answer the following question: Is stock selling price a function of annual dividend?
SUMMARY OUTPUT Regression Statistics Multiple R 0.818616296 R Square 0.67013264 Adjusted R Square 0.658351663 Standard Error 9.16867179 Observations 30 ANOVA df SS MS F Significance F Regression 1 4781.80995 4781.80995 56.8826 3.2455E-08 Residual 28 2353.807187 84.06454239 Total 29 7135.617137 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 28.21496731 3.739591617 7.544932763 3.22E-08 20.55476114 35.87517349 Dividend 2.367177613 0.313863719 7.542055589 3.25E-08 1.724256931 3.010098296 c. You run a regression analysis using Data Analysis to answer the following question: Is stock selling...
a) Run a regression analysis on the following bivariate set of data with y as the...
a) Run a regression analysis on the following bivariate set of data with y as the response variable. x y 10.7 81.6 13.7 81.5 36.7 56.5 4 72.1 50.7 23.2 47.6 -4.8 37.3 31.9 24.3 75.2 21.5 59.3 17.2 54.6 23.6 75.5 22.2 60.8 29.3 51 14 63.4 0.2 102.7 30.7 48.2 10.3 74.8 26.5 48.2 23.1 87 Verify that the correlation is significant at an ?=0.05?=0.05. If the correlation is indeed significant, predict what value (on average) for the...
QUESTION A: A regression was run to determine if there is a relationship between hours of...
QUESTION A: A regression was run to determine if there is a relationship between hours of TV watched per day (x) and number of situps a person can do (y). The results of the regression were: y=a+bx a=26.695 b=-0.65 r2=0.531441 r=-0.729 Assume the correlation is significant (p-value < α), and use this to predict the number of situps a person who watches 13.5 hours of TV can do (to one decimal place) QUESTION PART B: Run a regression analysis on...
You run a regression analysis on a bivariate set of data (n=99). You obtain the regression...
You run a regression analysis on a bivariate set of data (n=99). You obtain the regression equation y=0.843x+6.762 with a correlation coefficient of r=0.954 which is significant at α=0.01 You want to predict what value (on average) for the explanatory variable will give you a value of 150 on the response variable. What is the predicted explanatory value? x = (Report answer accurate to one decimal place.) Here is a bivariate data set. Find the regression equation for the response...
Instructions: If you could open your own business or if you know of someone in your...
Instructions: If you could open your own business or if you know of someone in your family that has recently opened up a business, show me that you can begin to create your own marketing plan.  This is a draft so I do not expect perfection! Format it nicely on the page and bold or bullet point your headings. Simply show me that you know how to create a document like this and throughout the semester we may continue to build...
8. Run a regression analysis on the following bivariate set of data with y as the...
8. Run a regression analysis on the following bivariate set of data with y as the response variable. x y 27.2 68.2 28.1 66.7 28.7 64.6 30.2 66.4 33.7 69.5 31.8 68.3 30.4 67.8 28.6 65.5 32.5 69.4 34.8 67.9 33.3 67.1 28 66.1 - Find the correlation coefficient and report it accurate to three decimal places. r = - What proportion of the variation in y can be explained by the variation in the values of x? Report answer...
Run a regression analysis on the following bivariate set of data with y as the response...
Run a regression analysis on the following bivariate set of data with y as the response variable. x y 15.5 58.8 25.4 62.9 53.7 68.8 46.5 78.6 28.5 57.5 5.7 68.1 -0.4 67.8 43.8 87.7 23.1 64.9 31.3 81.3 48.2 80.1 15.9 71.1 1) Find the correlation coefficient and report it accurate to three decimal places. r = 2) What proportion of the variation in y can be explained by the variation in the values of x? Report answer as...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT