Question

In: Statistics and Probability

The variables collected for this sample are average starting salary upon graduation ($), the percentage of...

The variables collected for this sample are average starting salary upon graduation ($), the percentage of applicants to the full-time program who were accepted, the average GMAT test score of students entering the program, program per-year tuition ($), and percent of students with job offers at time of graduation.

  1. Build a simple linear regression model with each of the predictors and the dependent variable. Provide the R2 for each simple linear regression model. Which predictor has the highest correlation with the dependent variable?
University Tuition per year ($) Average GMAT score Acceptance Rate (%) Graduates employed at graduation (%) Average starting salary and bonus ($)
1 61875 732 7.1 73.6 142834
2 58875 726 11.0 76.9 144750
3 62424 728 20.7 84.3 142574
4 61520 724 23.5 87.4 137615
5 63454 713 13.8 79.6 142936
6 61596 713 23.2 80.7 136357
7 63148 716 18.2 75.7 139006
8 61605 716 22.1 83.8 142489
9 60744 721 18.1 74.9 135933
10 58000 690 25.1 81.8 137154
11 58975 719 23.7 69.4 126871
12 58192 692 30.0 79.6 132316
13 51500 699 26.7 84.5 111974
14 58300 687 31.2 75.5 131865
15 48100 678 30.7 81.7 128347
16 52200 691 47.4 70.3 118938
17 46800 686 34.8 71.7 115694
18 51786 684 31.6 64.4 114129
19 47950 688 41.3 77.1 113830

Solutions

Expert Solution

data = read.table("clipboard", header = TRUE)

# Model 1
m1 = lm(data$AverageStartingSalaryAndBonus ~ data$TuitionPerYear)
summary(m1)

##
## Call:
## lm(formula = data$AverageStartingSalaryAndBonus ~ data$TuitionPerYear)
##
## Residuals:
##    Min     1Q Median     3Q    Max
## -9444 -2634 -1220   2914 12839
##
## Coefficients:
##                      Estimate Std. Error t value Pr(>|t|)   
## (Intercept)         31900.800 14023.174   2.275   0.0362 *
## data$TuitionPerYear     1.738      0.244   7.124 1.71e-06 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 5803 on 17 degrees of freedom
## Multiple R-squared: 0.7491, Adjusted R-squared: 0.7343
## F-statistic: 50.75 on 1 and 17 DF, p-value: 1.705e-06

R2_m1 = summary(m1)$r.squared;R2_m1

## [1] 0.7490727

corr_m1 = sqrt(R2_m1);corr_m1

## [1] 0.8654899

# Model 2
m2 = lm(data$AverageStartingSalaryAndBonus ~ data$AverageGMATScore)
summary(m2)

##
## Call:
## lm(formula = data$AverageStartingSalaryAndBonus ~ data$AverageGMATScore)
##
## Residuals:
##    Min     1Q Median     3Q    Max
## -16489 -6429   1081   6693 12735
##
## Coefficients:
##                        Estimate Std. Error t value Pr(>|t|)   
## (Intercept)           -185599.4    76932.3 -2.413 0.027424 *
## data$AverageGMATScore     449.3      109.0   4.121 0.000713 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 8194 on 17 degrees of freedom
## Multiple R-squared: 0.4998, Adjusted R-squared: 0.4703
## F-statistic: 16.98 on 1 and 17 DF, p-value: 0.0007135

R2_m2 = summary(m2)$r.squared;R2_m2

## [1] 0.4997507

corr_m2 = sqrt(R2_m2);corr_m2

## [1] 0.7069305

# Model 3
m3 = lm(data$AverageStartingSalaryAndBonus ~ data$AcceptanceRate)
summary(m3)

##
## Call:
## lm(formula = data$AverageStartingSalaryAndBonus ~ data$AcceptanceRate)
##
## Residuals:
##    Min     1Q Median     3Q    Max
## -18110 -3967   1423   5404   8329
##
## Coefficients:
##                     Estimate Std. Error t value Pr(>|t|)   
## (Intercept)         153739.4     4645.6   33.09 < 2e-16 ***
## data$AcceptanceRate   -886.0      171.7   -5.16 7.85e-05 ***
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 7232 on 17 degrees of freedom
## Multiple R-squared: 0.6103, Adjusted R-squared: 0.5874
## F-statistic: 26.63 on 1 and 17 DF, p-value: 7.848e-05

R2_m3 = summary(m3)$r.squared;R2_m3

## [1] 0.6103376

corr_m3 = sqrt(R2_m3);corr_m3

## [1] 0.7812411

# Model 4
m4 = lm(data$AverageStartingSalaryAndBonus ~ data$GraduatesEmployedAtGraduation)
summary(m4)

##
## Call:
## lm(formula = data$AverageStartingSalaryAndBonus ~ data$GraduatesEmployedAtGraduation)
##
## Residuals:
##    Min     1Q Median     3Q    Max
## -24790 -6525   2086   6444 14529
##
## Coefficients:
##                                    Estimate Std. Error t value Pr(>|t|)
## (Intercept)                         71185.7    32437.4   2.195   0.0424 *
## data$GraduatesEmployedAtGraduation    776.1      417.3   1.860   0.0803 .
## ---
## Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
##
## Residual standard error: 10560 on 17 degrees of freedom
## Multiple R-squared: 0.1691, Adjusted R-squared: 0.1202
## F-statistic: 3.459 on 1 and 17 DF, p-value: 0.08029

R2_m4 = summary(m4)$r.squared;R2_m4

## [1] 0.1690816

corr_m4 = sqrt(R2_m4);corr_m4

## [1] 0.4111954

# TuitionPerYear has highest correlation value with the Response variable "AverageStartingSalaryAndBonus"


Related Solutions

The variables collected for this sample are average startingsalary upon graduation ($), the percentage of...
The variables collected for this sample are average starting salary upon graduation ($), the percentage of applicants to the full-time program who were accepted, the average GMAT test score of students entering the program, program per-year tuition ($), and percent of students with job offers at time of graduation.Build at least five different multiple regression models with different combinations of predictors and transformations. Provide the regression equation for each model and its Adjusted R2 and F statistic.UniversityTuition per year ($)Average...
   (7) To investigate if majors is a factor for starting salary, we collected a random sample...
   (7) To investigate if majors is a factor for starting salary, we collected a random sample of recent graduates. Answered the questions based on the given output.                                                            ANOVA                   Starting Salary Sum of Squares df Mean Square F Sig Between Groups 18296543 3 139432181 2.808 .045 Within Groups 52566751035 47962364 Total 52985047578                                                                            Student-Newman-Keuls Major N Subset for alpha = .05 1 2 a 58 55342.73 c 95 56015.95 56015.95 b 24 57631.70 d 23 57991.03 Sig....
A sample is selected to find a 90% confidence interval for the average starting salary. Here...
A sample is selected to find a 90% confidence interval for the average starting salary. Here are the sample statistics: n = 31, x ̄ = $43, 780, s = $1, 600. a). Find the t− score used in the calculation of the confidence interval. b). Build a 90% confidence interval for the mean starting salary. c). Based on the result of part b), could we make a conclusion that the mean staring salary is below $45, 000? Explain your...
4. (24) The average starting salary of a random sample of 100 high school students was...
4. (24) The average starting salary of a random sample of 100 high school students was found to be $31,840. The population standard deviation for all such individuals is known to be $9,840. a. (12) Ten years ago, the average starting salary was $25,000. Does the sample data support the claim that the starting salary for this group has increased? Use alpha = 0.05. b. (6) Describe in general Type I and Type II errors and the Power of the...
Q: The average starting salary of a random sample of 100 high school students was found...
Q: The average starting salary of a random sample of 100 high school students was found to be $31,840. The population standard deviation for all such individuals is known to be $9,840. a. (12) Ten years ago, the average starting salary was $25,000. Does the sample data support the claim that the starting salary for this group has increased? Use alpha = 0.05. b. (6) Describe in general Type I and Type II errors and the Power of the test....
After graduation, you start your job with a salary of $70,000 per year (paid annually). Starting...
After graduation, you start your job with a salary of $70,000 per year (paid annually). Starting the second year, your salary increases by a rate of 3% per year through year 30 and then you retire. Determine the amount available in your retirement account when you retire if you put 12% of salary in the retirement account on an annual basis. Suppose the interest rate on the retirement account is 5% per year.
The average starting salary for this year's graduates at a large university (LU) is $20,000 with...
The average starting salary for this year's graduates at a large university (LU) is $20,000 with a standard deviation of $4,000. Furthermore, it is known that the starting salaries are normally distributed. All probabilities should be to four decimal places. What is the probability that a randomly selected LU graduate will have a starting salary of at least $30,400? Answer Individuals with starting salaries of less than $15,600 receive a low-income tax break. What percentage of the graduates will receive...
The average starting salary of this year’s graduates of a large university (LU) is $55,000 with...
The average starting salary of this year’s graduates of a large university (LU) is $55,000 with a standard deviation of $4,000. Furthermore, it is known that the starting salaries are normally distributed. What is the probability that a randomly selected LU graduate will have a starting salary of at least $52,700? Individuals with starting salaries of less than $45,00 receive a free class. What percentage of the graduates will receive the free class? What percent of graduates will have their...
The director of a state agency believes that the average starting salary for clerical employees in...
The director of a state agency believes that the average starting salary for clerical employees in the state is less than ​$32 comma 000 per year. To test her​ hypothesis, she has collected a simple random sample of 100 starting clerical salaries from across the state and found that the sample mean is ​$31 comma 950. a. nbsp State the appropriate null and alternative hypotheses. b. Assuming the population standard deviation is known to be ​$1 comma 500 and the...
The average starting salary of this year's graduates of a large university (LU) is $20,000 with...
The average starting salary of this year's graduates of a large university (LU) is $20,000 with a standard deviation of $8,000. Furthermore, it is known that the starting salaries are normally distributed. What are the minimum and the maximum starting salaries of the middle 95.4% of the LU graduates? (please show me how to get this in excel)
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT