Question

In: Statistics and Probability

The following sample data contains the number of years of college and the current annual salary...

The following sample data contains the number of years of college and the current annual salary for a random sample of heavy equipment salespeople. A researcher believes that spending more years in college leads to higher annual income.

Annual Income

(In Thousands)

Years in College

20

2

23

2

25

3

35

4

28

3

18

1

37

4

30

3

40

4

39

4

a) Which variable is the dependent variable? Which is the independent variable?

b) What is the estimated regression equation.

c) Interpret the meaning of the regression coefficient of independent variable.

d) What is the value of intercept? How to interpret the meaning of the intercept?

e) What is the point estimate of the annual income of a salesperson with three years of college?

f) Test whether the number of years in college has a significant impact on income at α= .05. What are the hypotheses? How do you test the hypotheses? What is the conclusion?

g) What is the value of the correlation coefficient between dependent variable and independent variable? What is the implication of the correlation coefficient?

h) What is the R Square of this model? What does it mean? Use the information in ANOVA table to calculate R square.

Solutions

Expert Solution

a)

Dependent variable: Annual Income

Independent Variable: Years in College

b)

To find the regression, using Excel

So, the regression equation is

Y-hat = 7.9 + 7.2 X

c)

meaning of the regression coefficient of the independent variable

If we increase the year in college by 1 year, the change in annual income is 7.2 (in thousands)

d)

interpret the meaning of the intercept

Intercept = 7.9.

If the year in college is zero, the base annual income is 7.9 (in thousand)

e)

X = 3

Y-hat = 7.9 + 7.2*3 = 29.5

f)

Hypothesis

We see the p-value for beta1 from the output table

p-value = ~0, Since p-value is less than 0.05, we reject null and conclude that beta1 is significant

g)

correlation coefficient = 0.9499

The positive value indicates that the relationship is linear, positive and strong

h)

R2 = 0.9023

It means that 90.23% variability in Y is explained by X


Related Solutions

Suppose the following table was generated from the sample data of 2020 employees relating annual salary...
Suppose the following table was generated from the sample data of 2020 employees relating annual salary to years of education and gender. According to the results, is there a salary difference between men and women at the 0.010.01 level of significance? If yes, write the difference in salary in the space provided, rounded to two decimal places. Else, select "There is not enough evidence." Coefficients Standard Error t Stat P-Value Intercept −12231.082830−12231.082830 2488.3280932488.328093 −4.915382−4.915382 0.0001310.000131 Education 3836.8588903836.858890 148.193548148.193548 25.89086325.890863 0.0000000.000000...
uppose the following data were collected from a sample of 1515 CEOs relating annual salary to...
uppose the following data were collected from a sample of 1515 CEOs relating annual salary to years of experience and the economic sector their company belongs to. Use statistical software to find the following regression equation: SALARYi=b0+b1EXPERIENCEi+b2SERVICEi+b3INDUSTRIALi+eiSALARYi=b0+b1EXPERIENCEi+b2SERVICEi+b3INDUSTRIALi+ei. Is there enough evidence to support the claim that on average, CEOs in the service sector have lower salaries than CEOs in the financial sector at the 0.050.05 level of significance? If yes, write the regression equation in the spaces provided with answers...
Suppose the following data were collected from a sample of 1515 CEOs relating annual salary to...
Suppose the following data were collected from a sample of 1515 CEOs relating annual salary to years of experience and the economic sector their company belongs to. Use statistical software to find the following regression equation: SALARYi=b0+b1EXPERIENCEi+b2SERVICEi+b3INDUSTRIALi+eiSALARYi=b0+b1EXPERIENCEi+b2SERVICEi+b3INDUSTRIALi+ei. Is there enough evidence to support the claim that on average, CEOs in the service sector have lower salaries than CEOs in the financial sector at the 0.050.05 level of significance? If yes, write the regression equation in the spaces provided with answers...
Suppose the following table was generated from the sample data of 2020 employees relating annual salary...
Suppose the following table was generated from the sample data of 2020 employees relating annual salary to years of education and gender. According to the results, is there a salary difference between men and women at the 0.010.01 level of significance? If yes, write the difference in salary in the space provided, rounded to two decimal places. Else, select "There is not enough evidence." Coefficients Standard Error t Stat P-Value Intercept −6351.081919 2302.895692 −2.757868 0.013444 Education 3487.0714766 133.810154 26.059842 0.000000...
Suppose the following data were collected from a sample of 15 CEOs relating annual salary to...
Suppose the following data were collected from a sample of 15 CEOs relating annual salary to years of experience and the economic sector their company belongs to. Use statistical software to find the following regression equation: SALARYi=b0+b1EXPERIENCEi+b2SERVICEi+b3INDUSTRIALi+ei Is there enough evidence to support the claim that on average, CEOs in the service sector have lower salaries than CEOs in the financial sector at the 0.01 level of significance? If yes, write the regression equation in the spaces provided with answers...
Suppose the following data were collected from a sample of 15 CEOs relating annual salary to...
Suppose the following data were collected from a sample of 15 CEOs relating annual salary to years of experience and the economic sector their company belongs to. Use statistical software to find the following regression equation: SALARYi=b0+b1EXPERIENCEi+b2SERVICEi+b3INDUSTRIALi+eiSALARYi=b0+b1EXPERIENCEi+b2SERVICEi+b3INDUSTRIALi+ei. Is there enough evidence to support the claim that on average, CEOs in the service sector have lower salaries than CEOs in the financial sector at the 0.05 level of significance? If yes, write the regression equation in the spaces provided with answers...
The following table was generated from the sample data of 10 college students regarding the number...
The following table was generated from the sample data of 10 college students regarding the number of parking tickets the student receives in a semester, the student's age, and the student's GPA. The dependent variable is the student's GPA, the first independent variable (x1) is the number of parking tickets, and the second independent variable (x2) is the student's age. Intercept   2.120785   2.284291   0.928421   0.395800 Number of Parking Tickets   -0.337479   0.066991   -5.037686   0.003975 Student's Age   0.127003   0.116260   1.092403   0.324473 Step...
The data in the table below show years of teaching experience and annual salary in dollars...
The data in the table below show years of teaching experience and annual salary in dollars for 12 randomly selected mathematics instructors in Illinois colleges/ universities for the 2013-2014 school year. Is there a linear correlation between a mathematics instructor's years of experience and their annual salary? a. Draw the scatterplot for the variables. b. Describe the relationship, if any, shown by the scatterplot. c. Calculate r, the correlation coefficient. How does the correlation coefficient compare to the relationship you...
2. The following data represent Salary, years of experience, and rank (A, B, or C). Salary...
2. The following data represent Salary, years of experience, and rank (A, B, or C). Salary is the dependent variable. For class and the interaction effect between class and years of experience, do the following: a. Indicate a scheme (method) of coding “Rank”. How many variables are needed? b. Using the coding method given in Part a, give the data (the numbers) that you would input for the following cases including the interaction term between rank and experience: Absenteeism (Hrs/Yr)...
The family college data set contains a sample of 792 cases with two variables, teen and...
The family college data set contains a sample of 792 cases with two variables, teen and parents, and is summarized in Table below. The teen variable is either college or not, where the teenager is labeled as college if she went to college immediately after high school. The parent variable takes the value degree if at least one parent of the teenager completed a college degree. Parents Degree Parents No Degree Total Teen College 231 214 445 Teen Not college...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT