Question

In: Statistics and Probability

i. Use MS Excel Data Analysis ToolPak to perform a multiple regression analysis using Quality as...

i. Use MS Excel Data Analysis ToolPak to perform a multiple regression analysis using Quality as the response variable and Helpfulness and Clarity as the explanatory variables. Write down the corresponding coefficient estimates and provide the regression output.

j. Perform an F-test for the overall usefulness of the model in part i) using a 5% significance level. Make sure you follow all the steps for hypothesis testing indicated in the Instructions section and clearly state your conclusion.

k. Test manually if the Clarity variable is significant in the model in part i). Make sure you follow all the steps for hypothesis testing indicated in the Instructions section and clearly state your conclusion.

l. Using the adjusted R2 criterion, does including Clarity as an additional predictor variable improve the model in part i)? Explain why it is better to use the adjusted R2 over the R2 to determine if the addition of this new variable improves the model.

Regression Statistics ANOVA
Multiple R 0.998544859 df SS MS F Significance F
R Square 0.997091836 Regression 2 255.2639136 127.6319568 62229.00058 0
Adjusted R Square 0.997075813 Residual 363 0.744514614 0.002051004
Standard Error 0.045288017 Total 365 256.0084282
Observations 366
Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Lower 95.0% Upper 95.0%
Intercept -0.020353502 0.010520921 -1.934574223 0.0538193 -0.04104311 0.000336106 -0.04104311 0.000336106
helpfulness 0.538358378 0.007216008 74.60611907 2.8925E-222 0.524167949 0.552548808 0.524167949 0.552548808
clarity 0.465505241 0.00707634 65.78333849 8.6445E-204 0.451589474 0.479421009 0.451589474 0.479421009

Solutions

Expert Solution

## Q i )  Use MS Excel Data Analysis ToolPak to perform a multiple regression analysis using Quality as the response variable and Helpfulness and Clarity as the explanatory variables. Write down the corresponding coefficient estimates and provide the regression output.

y = Quality  and x1 = helpfulness   and x2 = clarity

Quality = β0 + (β1 *x1) + (β2 * x2) ie

here coefficint for intercept is     -0.020353502

and coefficient for helpfullness is 0.538358378 and coefficient for clarity 0.465505241

Quality = -0.020353502 + (0.538358378* x1) + (0.465505241* x2)

here intercept is - 0.020353502 and

slope 1 is 0.538358378 : it is postive as x1 value increases y value increases

slope 2 is 0.465505241 : it is positve as x2 value increases y value increases .

### Q j )  Perform an F-test for the overall usefulness of the model in part i) using a 5% significance level. Make sure you follow all the steps for hypothesis testing indicated in the Instructions section and clearly state your conclusion.

step 1 ) To test : Ho : overall regression model is not significant vs H1 : overall model is significant .

step 2) test statistics : F = 62229.01408

step 3) p value = 0

step 4) decision : we rejct Ho if p value is less than alpha value using p value approach here p value is less than alpha value we reject Ho .

step 5) conclusion : there is enough evidence to conclude that the overall model is significant at given level of significance .

## Q k.) Test manually if the Clarity variable is significant in the model in part i). Make sure you follow all the steps for hypothesis testing indicated in the Instructions section and clearly state your conclusion.

## test for helpfulness : coefficent of x1 :

step 1) to test : β1 = 0 vs H1 : β1 ≠ 0

step 2) test statistics = t = 74.60611907

step 3) p value = 0

step 4) decision : we reject Ho if p value is less than alpha value using p value approach here p value is less than alpha value we reject Ho

step 5) conclusion : there is enogh evidence to conclude that coefficient of x1 is significant at given level of significance .

## test for clarity : coefficent of x2 :

step 1) to test : β2 = 0 vs H1 : β2 ≠ 0

step 2) test statistics = t = 65.78333849   

step 3) p value = 0

step 4) decision : we reject Ho if p value is less than alpha value using p value approach here p value is less than alpha value we reject Ho

step 5) conclusion : there is enough evidence to conclude that coefficient of x2 is significant at given level of significance .

## Q l )  Using the adjusted R2 criterion, does including Clarity as an additional predictor variable improve the model in part i)? Explain why it is better to use the adjusted R2 over the R2 to determine if the addition of this new variable improves the model.

Answer : R squre ( coefficient of determination ) and Adjusted R square value both have value is greater than 0.90

that is variation explained by model is very very good

if we use or consdier Adjusted R sqaure value it is very very good .

## using adjusted R2 criterion, does including Clarity as an additional predictor variable improve the model in part i

yes , it does including Clarity as an additional predictor variable improve the model in part i

and clarity coefficient is also significant hence it include in the model .

## Explain why it is better to use the adjusted R2 over the R2 to determine if the addition of this new variable improves the model.

because here R square and Adjusted R square value is very very good , so if we use addition of new variable improves the model . here in this case the Adjusted R squared compansates for the addition of variables and only increase if the new predictor enhances the model above .


Related Solutions

How can you perform simple linear regression analysis using Excel?
How can you perform simple linear regression analysis using Excel?
Q2: Using MS Excel Analyze a regression of Air Quality on three covariates CO2, NOx, and...
Q2: Using MS Excel Analyze a regression of Air Quality on three covariates CO2, NOx, and Sox Data Set is as following Air Quality CO2 NOx SOx 197 61 22 17 191 33 16 27 224 65 18 18 183 55 18 32 236 50 26 23 200 60 24 26 226 59 16 25 164 54 22 24 100 83 10 17 285 50 21 29 207 73 22 21 336 61 26 35 299 42 36 21 192...
Refer to the TV Revenue data set. Perform a complete multiple regression analysis that might be...
Refer to the TV Revenue data set. Perform a complete multiple regression analysis that might be used to predict net revenue using all provided explanatory variables (there are 4 explanatory variables). Complete all steps for the multiple regression as outlined in class and modify the original model if necessary. Use an alpha = .10 for all hypotheses tests. Make sure you show each required step for any hypothesis test. Provide all required Minitab output with your written responses. Obs NetRevenue...
Use the data set which is labeled below and answer using regression output in excel i....
Use the data set which is labeled below and answer using regression output in excel i. Find the correlation coefficients between Y and X1, X2 and X3 and test the significance of population correlation coefficient using the value of r calculated for X2 and X3. ii. Estimate the regression equation for Y and X1, Y and X2 and report the results and explain the intercept and slope coefficients. iii. Check the significance or insignificance of the independent variables? and explain...
The standard project is to use multiple regression analysis to analyze a data set. The data...
The standard project is to use multiple regression analysis to analyze a data set. The data set is a study of student persistent enrolling in the next semester based on Gender, Age, GPA, a 22 questionnaire on self-efficacy, and student enrollment status. The educational researcher wants to study the relationship between student enrollment status as it relates to gender, age, GPA, and the total response to a 22 questionnaire survey. a. The estimated multiple regression analysis equation. b. Does the...
Using the data in the Excel file Home Market Value, develop a multiple regression model for...
Using the data in the Excel file Home Market Value, develop a multiple regression model for estimating the market value as a function of house age and house size. Predict the value of a house that is 30 years old and has 1800 square feet, and also predict the value of a house that is 5 years old and has 2800 square feet. Conduct your analysis using the following Multiple Regression Model Building and Interpretation Rubric: Identify the dependent variable...
Question #11 – Regression Analysis Use the data provided to: Perform the “Tests to Check the...
Question #11 – Regression Analysis Use the data provided to: Perform the “Tests to Check the Validity of a Regression” Show both the calculated and critical values Estimate Y when X = 4 (round to 2 decimal places) Use a level of significance of 5% (α = .05). Clearly show the null and alternate hypothesis. Graphs are not required. X Y 3 14 7 26 6 23 4 17 7 28 5 20 8 29 2 11
Use Excel to prepare a Linear Regression Analysis. Use data samples below for populations and determine...
Use Excel to prepare a Linear Regression Analysis. Use data samples below for populations and determine if the selected independent variable is affecting the dependent variable. Use an alpha of 5% for ANOVA and Correlation Coefficient. Explain the results. Data samples Group A 104,103,101,99,97,101,101 Group B 101,100,95,99,101,103,97 Group C 100,96,99,95,99,102,106 Group D 97,99,99,101,105,100,99
In this problem, we will perform multiple regression on the Boston housing data. The data contains...
In this problem, we will perform multiple regression on the Boston housing data. The data contains 506 records with 14 variables. The variable medv is the response variable. Solve the following problems in R and print out the commands and outputs : To assess the data use library(MASS) data(Boston) (a) First perform a multiple regression with all the variables, what can you say about the significance of the variables based on only the p-values. Next use the ”step” function to...
Use the Manufacturing database from “Excel Databases.xls” on Blackboard. Use Excel to develop a multiple regression...
Use the Manufacturing database from “Excel Databases.xls” on Blackboard. Use Excel to develop a multiple regression model to predict Cost of Materials by Number of Employees, New Capital Expenditures, Value Added by Manufacture, and End-of-Year Inventories. Locate the observed value that is in Industrial Group 12 and has 7 employees. Based on the model and the multiple regression output, what is the corresponding residual of this observation? Write your answer as a number, round to 2 decimal places. **Answer should...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT