Question

In: Statistics and Probability

On the last page of this file you will find the Excel output for a multiple...

On the last page of this file you will find the Excel output for a multiple linear regression model. The model was built in an attempt to better understand why students at area high schools perform differently on the state high school mathematics test. The average test score for a class of students is what we are trying to predict. In our attempt to understand why these test scores differ, we use 3 independent variables: a rating (0-100) for the quality of the math degree obtained by the instructor, the age of the instructor, and the salary (in thousands) of the instructor.

  1. Estimate the average math score for a class of students whose instructor is 52 years old, earns $48,000, and got her degree in a math program rated 72.
  2. What percentage of the variations in math scores can be explained by this model?
  3. Conduct a test to determine if the model, taken as a whole, provided us with any significant explanation of the differences in math scores. That is, should the model be retained for further analysis?
  4. Which of the independent variables appear to be significant to the model? Which appear to be insignificant? What leads you to these conclusions?

SUMMARY OUTPUT

Regression Statistics

Multiple R

0.597512233

R Square

0.357020869

Adjusted R Square

0.303439274

Standard Error

7.724526046

Observations

40

ANOVA

df

SS

MS

F

Significance F

Regression

3

1192.732105

397.5774

6.663125

0.001076925

Residual

36

2148.058895

59.6683

Total

39

3340.791

Coefficients

Standard Error

t Stat

P-value

Lower 95%

Intercept

35.67761801

7.278849159

4.901547

2.03E-05

20.9154278

Math Degree

0.247481581

0.069845662

3.543263

0.001115

0.105828014

Age

0.244830604

0.185213036

1.321886

0.194545

-0.130798841

Income

0.133296712

0.152818937

0.872253

0.388851

-0.176634456

Solutions

Expert Solution

We are already given the summary of regression and ANOVA output of the model.

Here, the average test score for a class is the response variable

And quality rating for the maths degree of instructor, age and the salary of instructor.

The estimated regression line is given as,

Avg test score = 35.67761801 + 0.247481581 Math degree rating + 0.244830604 Age + 0.133296712 Income

To estimate the average math score for a class of students whose instructor is 52 years old, earns $48,000, and got her degree in a math program rated 72.

Put Math degree rating = 72, Age = 52, Income = 48000

Avg test score = 35.67761801 + 0.247481581*72 + 0.244830604*52 + 0.133296712*48000 = 72.6257

The estimated average math score for a class of students whose instructor is 52 years old, earns $48,000, and got her degree in a math program rated 72 is 72.6257.

We are given the value of adjusted R-squared = 0.303439274

We know that 100R2 % of the total variation is explained by the model.

Hence, 30.344% of the variations in math scores can be explained by the model.

To test: H0: The regression model is not significant.

H1: The regression model is significant.

Test statistic:

Under H0 test statistic follows F-distribution with (3,36) degrees of freedom.

According to the ANOVA output, F = 6.663125

And P-value for the test is 0.001076925.

For 0.01 level of significance, P-value < 0.01

Hence, we reject H0 at 0.01 level of significance.

Hence, the regression model is significant. That is the model should be retained for further analysis.

Let us test if the ith regression coefficient is significant or not.

To test: H0: =0 versus H1: 0

Test statistic:

Under H0 the test statistic follows t distribution with (n-k-1)=36 degrees of freedom.

If the p-value< 0.01 then, we reject H0 at 0.01 level of significance.

Here, the p-value for math degree is less than 0.01, hence we say that math degree is significant in the model.

But the p-values for Age and salary are greater than 0.01, hence they are insignificant in the model.

Quality of the math degree appear to be significant to the model.

Age and income of instructor appears to be insignificant.

I hope you find the solution helpful. If you have any doubt then feel free to ask in the comment section.

Please do not forget to vote the answer. Thank you in advance!!!


Related Solutions

For CB output, you should paste the forecast output in your Excel calculations file. Show the...
For CB output, you should paste the forecast output in your Excel calculations file. Show the split view in all CB output. 1.   Two investments (A and B, below) have been proposed to the Capital Investment committee of your organization; a.      The required rate of return for your company is 15%. What is the NPV for each investment? Assume the initial investments ($150k and $50k) occur at the beginning of the year and all other costs and benefits occur at...
The multiple regression model is estimated in Excel and part of the output is provided below....
The multiple regression model is estimated in Excel and part of the output is provided below. ANOVA df SS MS F Significance F Regression 3 3.39E+08 1.13E+08 1.327997 0.27152899 Residual 76 6.46E+09 85052151 Total 79 6.8E+09 Question 8 (1 point) Use the information from the ANOVA table to complete the following statement. To test the overall significance of this estimated regression model, the hypotheses would state there is    between attendance and the group of all explanatory variables, jointly. there is...
Assignment on Multiple Linear Regression The Excel file BankData shows the values of the following variables...
Assignment on Multiple Linear Regression The Excel file BankData shows the values of the following variables for randomly selected 93 employees of a bank. This real data set was used in a court lawsuit against discrimination. Let = starting monthly salary in dollars (SALARY), = years of schooling at the time of hire (EDUCAT), = number of months of previous work experience (EXPER), = number of months that the individual was hired (MONTHS), = dummy variable coded 1 for males...
Using the data in the Excel file Home Market Value, develop a multiple regression model for...
Using the data in the Excel file Home Market Value, develop a multiple regression model for estimating the market value as a function of house age and house size. Predict the value of a house that is 30 years old and has 1800 square feet, and also predict the value of a house that is 5 years old and has 2800 square feet. Conduct your analysis using the following Multiple Regression Model Building and Interpretation Rubric: Identify the dependent variable...
Answer the following questions in an Excel file. Each questions with multiple parts requires a separate...
Answer the following questions in an Excel file. Each questions with multiple parts requires a separate answer. Label your steps and show each answer (1, 2, 3, and 4) in a separate Excel tab. For problems that require a written answer, use a text box in Excel to record the text. Calculate the PV of $5,000 received 10 years from now compounded annually at discount rates of: 1% 4% 12% Calculate the FV of $5,000 invested for 20 years using...
For the data in the Excel file Education and Income, find 95% confidence intervals for the...
For the data in the Excel file Education and Income, find 95% confidence intervals for the mean annual income of males and the mean annual income of females. Can you conclude that the mean income of one group is larger than the other? Education and Income Gender Age Level of Education Gross Annual Income Female 40-60 Graduate Degree $75,000 Female 25-39 Bachelor's Degree $47,000 Male 40-60 High School/GED $40,000 Female 25-39 Some College $30,000 Female 25-39 Some College $60,000 Female...
. You must use Excel (submit either a pdf, word or Excel file only). . You...
. You must use Excel (submit either a pdf, word or Excel file only). . You must identify the 5 steps (you must address each in detail). Problem: Use the given data to complete a t-test using Excel. Question: Is there a difference in group means between the number of words spelled correctly for two groups of fourth graders? Group Assignment Score 1 3 1 4 1 10 2 14 2 7 2 8 2 10 2 15 2 9...
Find the last node of a linked list of n elements whose index is a multiple...
Find the last node of a linked list of n elements whose index is a multiple of k (counted from 0). For example, if the list is 12 → 75 → 37 → 99 → 12 → 38 → 99 → 60 ↓ and k = 4, then you should return the second 12 (with index 4). Your algorithm should take O(n) time and use O(1) extra space. Implement the following method in LinkedList.java. public T lastK(int k) LinkedList.java. public...
Answer the following problems in an Excel file. Please upload only one Excel file with all...
Answer the following problems in an Excel file. Please upload only one Excel file with all of your answers, including #3 (which requires an explanation rather than a calculation). All problems must be solved using the PV and FV functions in Excel. If I deposit $8,000 in a bank account that pays interest of 1.5%, compounded annually, how much will I have in the account after 10 years? If I deposit $8,000 in a bank account that pays simple interest...
IN JAVA!!! In this project, you will use radix.txt as the input file, and output the...
IN JAVA!!! In this project, you will use radix.txt as the input file, and output the integers SORTED USING RADIX SORT. You may assume all your input consists of integers <=9999. Your main program will input the integers and put them into a QUEUE. It will then pass this queue to a method called radixSort which will sort the numbers in the queue, passing the sorted queue back to main. The main program will then call another method to print...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT