Question

In: Statistics and Probability

1. Consider the following data set. Cl is years of education, C2 is years of job...

1. Consider the following data set. Cl is years of education, C2 is years of job experience, C3 is age, and C4 is annual salary.

a. Estimate the relationship:

C4 = a + b(Cl)+c(C2)+d(C3)

b. Test the hypothesis that the entire model (C1, C2, and C3 combined) does not explain a significant amount of variation in the dependent variable at the 5% level of significance.

c. What fraction of the variation in annual salary is explained by education, experience, and age?

Row

Education

Experience

Age

Salary

1

10

20

45

55139

2

10

5

23

48937

3

10

19

36

57624

4

11

   15

50

58170

5

11

16

42

62202

6

11

8

30

51646

7

11

4

21

52563

8

12

10

34

49434

9

12

8

27

55153

10

12

18

38

63882

11

13

6

25

46067

12

13

10

46

60886

13

14

10

38

57190

14

14

2

22

52094

15

15

8

32

60620

16

16

5

49

59843

17

16

4

28

57288

18

17

7

33

67151

19

18

3

27

61313

20

19

3

32

64175

Solutions

Expert Solution

(a) C4 = 22,228.5143 + 1,959.5483*C1 + 696.0572*C2 + 76.0178*C3

(b) The hypothesis being tested is:

H0: β1 = β2 = β3 = 0

H1: At least one βi ≠ 0

The p-value is 0.0003.

Since the p-value (0.0003) is less than the significance level (0.05), we can reject the null hypothesis.

Therefore, we can conclude that the model is significant.

(c) 0.679

0.679
Adjusted R² 0.619
R   0.824
Std. Error   3489.559
n   20
k   3
Dep. Var. Salary
ANOVA table
Source SS   df   MS F p-value
Regression 41,18,69,047.7521 3   13,72,89,682.5840 11.27 .0003
Residual 19,48,32,298.7979 16   1,21,77,018.6749
Total 60,67,01,346.5500 19  
Regression output confidence interval
variables coefficients std. error    t (df=16) p-value 95% lower 95% upper
Intercept 22,228.5143
Education 1,959.5483 409.9821 4.780 .0002 1,090.4251 2,828.6715
Experience 696.0572 256.1168 2.718 .0152 153.1138 1,239.0006
Age 76.0178 128.6253 0.591 .5628 -196.6556 348.6912

Related Solutions

1. The following data set contains information on years of formal education and incomes in 2015....
1. The following data set contains information on years of formal education and incomes in 2015. Row    Education    Income in         in Years      2015 Dollars 1          7         22587 2         10         28305 3         12         40196 4         13         49483 5         14         54483 6         16         78073 7         18         99540 8         19        155646 9         21        125310 a. Estimate the regression equation Income = a + b(Education). b. What is the predicted increase in Income for a one-year increase in Education? c. What do you...
Suppose that a researcher collected the following set of data on years of education (X) and...
Suppose that a researcher collected the following set of data on years of education (X) and number of children (Y) for a sample of married adults:                  X                   Y                     12                  2                 14                  1                 17                  0                 10                  3                  8                   5                  9                   3                 12                  4                 14                  2                 18                  0                 16                  2     Draw a scatter plot of the data. Write out the regression equation, then calculate and interpret the meaning of...
Consider the data provided in the table below, which shows years of education and income for...
Consider the data provided in the table below, which shows years of education and income for ten individuals. Education Income(1) Income(2) 10 15,000 15 12 20,000 20 12 35,000 35 12 40,000 40 12 60,000 60 16 50,000 50 16 60,000 60 16 70,000 70 18 60,000 60 21 80,000 80 A) Calculate the covariance and correlation of education and income, using the “Income1” column for income, which is measured in dollars. B) Calculate the covariance and correlation of education...
Problem 1: The following data is in the following of Table. Education Expenditure data, Assuming there...
Problem 1: The following data is in the following of Table. Education Expenditure data, Assuming there is unique variance in each region, using two stage approach weighted least square approach to estimate X1, X2 and X3 effect on Y. (sas programming) STATE Y X1 X2 X3 Region ME 189 2828 351 508 1 NH 169 3259 346 564 1 VT 230 3072 348 322 1 MA 168 3835 335 846 1 RI 180 3549 327 871 1 CT 193 4256...
1. Consider the following set of data relating Distance from School and Time to get to...
1. Consider the following set of data relating Distance from School and Time to get to school: Construct a scatter plot for the given data. Does the scatter plot show positive/negative/no correlation? Justify your answer. Find the least-square regression line (Best fit line). If a student lives 6.8 miles away from school, what is her predicted time to get to school? X (miles) Y (minutes) 2 10 3 7 3.1 12 4.5 15 5 20 5.5 27 7 25 8.1...
Consider the following species: Na, Mg, Cl^-1,S,Cl,Ar,Na^+ (a) Order the above species in order of increasing...
Consider the following species: Na, Mg, Cl^-1,S,Cl,Ar,Na^+ (a) Order the above species in order of increasing atomic/ionic radius. (b) Order the above species in order of increasing electron affinity
Consider the following two sample data sets. Set​ 1: 5 3 2 8 6 Set​ 2:...
Consider the following two sample data sets. Set​ 1: 5 3 2 8 6 Set​ 2: 3 12 13 2 7 a. Calculate the coefficient of variation for each data set. b. Which data set has more​ variability? a. The coefficient of variation for set 1 is nothing ​%. ​(Round to one decimal place as​ needed.)
The following data represent the smoking status by level of education for people 18 years old...
The following data represent the smoking status by level of education for people 18 years old or older. Smoking Status Number of Years of Education current former never <12 159 98 196 12 148 81    150 13-15 44 29   33 16 or more   32 26 40 Test whether smoking status and level of education are independent. a. What is the expected value of the number of Former smokers and 13-15 Years of Education, if we assume that the two...
Consider the following data set: 3 -5 5 7 9 10 -3 35 2 1 1...
Consider the following data set: 3 -5 5 7 9 10 -3 35 2 1 1 a) Determine by hand (you can use the calculator) the mean, median and mode of this data set. show enough details in your work for it to be clear that you did this work. b) Use MINITAB to obtain the above results for this data set (and only these results). c) By hand clearly determine the 5 number summary for this data set and...
1. Solve for the optimal values of C1 and C2 in the following optimization problem: MaxC1,C2...
1. Solve for the optimal values of C1 and C2 in the following optimization problem: MaxC1,C2 C11/2 + βC21/2 s.t. C1 + C2 /1 + r = Y1 + Y2/1 + r Hint: ∂C1/2 /∂C = 1/2C−1/2 When r goes up, how does C1 change? Does it increase or decrease?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT