Question

In: Statistics and Probability

We wish to predict the salary for baseball players (yy) using the variables RBI (x1x1) and...

We wish to predict the salary for baseball players (yy) using the variables RBI (x1x1) and HR (x2x2), then we use a regression equation of the form ˆy=b0+b1x1+b2x2y^=b0+b1x1+b2x2.

  • HR - Home runs - hits on which the batter successfully touched all four bases, without the contribution of a fielding error.
  • RBI - Run batted in - number of runners who scored due to a batters's action, except when batter grounded into double play or reached on an error
  • Salary is in millions of dollars.

The following is a chart of baseball players' salaries and statistics from 2016.

Player Name RBI's HR's Salary (in millions)
Miquel Cabrera 108 38 28.050
Yoenis Cespedes 86 31 27.500
Ryan Howard 59 25 25.000
Albert Pujols 119 31 25.000
Robinson Cano 103 39 24.050
Mark Teixeira 44 15 23.125
Joe Mauer 49 11 23.000
Hanley Ramirez 111 30 22.750
Justin Upton 87 31 22.125
Adrian Gonzalez 90 18 21.857
Jason Heyward 49 7 21.667
Jayson Werth 70 21 21.571
Matt Kemp 108 35 21.500
Jacoby Ellsbury 56 9 21.143
Chris Davis 84 38 21.119
Buster Posey 80 14 20.802
Shin-Soo Choo 17 7 20.000
Troy Tulowitzki 79 24 20.000
Ryan Braun 91 31 20.000
Joey Votto 97 29 20.000
Hunter Pence 57 13 18.500
Prince Fielder 44 8 18.000
Adrian Beltre 104 32 18.000
Victor Martinez 86 27 18.000
Carlos Gonzalez 100 25 17.454
Matt Holliday 62 20 17.000
Brian McCann 58 20 17.000
Mike Trout 100 29 16.083
David Ortiz 127 38 16.000
Adam Jones 83 29 16.000
Curtis Granderson 59 30 16.000
Colby Rasmus 54 15 15.800
Matt Wieters 66 17 15.800
J.D. Martinez 68 22 6.750
Brandon Crawford 84 12 6.000
Rajai Davis 48 12 5.950
Aaron Hill 38 10 12.000
Coco Crisp 55 13 11.000
Ben Zobrist 76 18 10.500
Justin Turner 90 27 5.100
Denard Span 53 11 5.000
Chris Iannetta 24 7 4.550
Leonys Martin 47 15 4.150
Justin Smoak 34 14 3.900
Jorge Soler 31 12 3.667
Evan Gattis 72 32 3.300
Logan Forsythe 52 20 2.750
Jean Segura 64 20 2.600

a) Use software to find the multiple linear regression equation. Enter the coefficients rounded to 4 decimal places.
ˆy=y^=  +  x1x1 +  x2x2  

b) Use the multiple linear regression equation to predict the salary for a baseball player with an RBI of 49 and HR of 22. Round your answer to 1 decimal place, do not convert numbers to dollars.
millions of dollars

c) Holding all other variables constant, what is the correct interpretation of the coefficient b1=0.111b1=0.111 in the multiple linear regression equation?

  • For each RBI, a baseball player's predicted sallary increases by 0.111 million dollars.
  • If the baseball player's salary increases by 0.111 million dollars, then the predicted RBI will increase by one.
  • If the baseball player's salary increases by 0.111 million dollars, then the predicted RBI will increase by 0.0371.
  • For each HR, a baseball player's predicted sallary increases by 0.111 million dollars.

d) Holding all other variables constant, what is the correct interpretation of the coefficient b2=0.0371b2=0.0371 in the multiple linear regression equation?

  • If the baseball player's salary increases by 0.0371 million dollars, then the predicted HR will increase by one.
  • For each RBI, a baseball player's predicted sallary increases by 0.0371 million dollars.
  • If the baseball player's salary increases by 0.0371 million dollars, then the predicted HR will increase by 0.111.
  • For each HR, a baseball player's predicted sallary increases by 0.0371 million dollars.

Solutions

Expert Solution

SUMMARY OUTPUT
Regression Statistics
Multiple R 0.423071424
R Square 0.17898943
Adjusted R Square 0.142500071
Standard Error 6.96454226
Observations 48
ANOVA
df SS MS F Significance F
Regression 2 475.856829 237.9284 4.90525 0.011826244
Residual 45 2182.7182 48.50485
Total 47 2658.575029
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
Intercept 7.061791068 2.964789136 2.381886 0.021513 1.090399221 13.0331829
RBI's 0.110975724 0.068915288 1.610321 0.114321 -0.027826791 0.24977824
HR's 0.037090238 0.186675426 0.198688 0.843401 -0.33889337 0.41307385

c) For each RBI, a baseball player's predicted sallary increases by 0.111 million dollars.

d) For each HR, a baseball player's predicted sallary increases by 0.0371 million dollars.


Related Solutions

We wish to predict the salary for baseball players (y) using the variables RBI (x1) and...
We wish to predict the salary for baseball players (y) using the variables RBI (x1) and HR (x2), then we use a regression equation of the form ˆy=b0+b1x1+b2x2. HR - Home runs - hits on which the batter successfully touched all four bases, without the contribution of a fielding error. RBI - Run batted in - number of runners who scored due to a batters' action, except when batter grounded into double play or reached on an error Salary is...
A Baseball coach was trying to predict the number of RBIs players hit based on the...
A Baseball coach was trying to predict the number of RBIs players hit based on the amount of time they spend in the batting cage. Assuming the slope (or change in DV for every change in the IV) is 5 and the Y intercept is 10, what will the predicted RBI total be for player 1 who spent 12 hours in the batting cages compared to the predicted RBI total for player 2 who spent 4 hours in the batting...
1. A researcher would like to predict the dependent variable YY from the two independent variables...
1. A researcher would like to predict the dependent variable YY from the two independent variables X1X1 and X2X2 for a sample of N=15N=15 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test the significance of the overall regression model. Use a significance level α=0.05α=0.05. X1X1 X2X2 YY 66.4 76.4 58 34.6 39 65.5 32.7 23.1 65.8 44.4 71.2 73.3 57.3 50.8 57.9 32.7 48 74.6 53.3 51.4 64.4 48.3 51.1 59.2 66.9 81.4 59.4...
A researcher would like to predict the dependent variable YY from the two independent variables X1X1...
A researcher would like to predict the dependent variable YY from the two independent variables X1X1 and X2X2 for a sample of N=11N=11 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test statistics to assess the significance of the regression model and partial slopes. Use a significance level α=0.05α=0.05. X1X1 X2X2 YY 55.3 51.1 56.2 72.1 51.6 76.6 35.2 41.7 51.8 70.4 58 47.9 51 71.6 39.8 66.6 60.4 61.9 61.9 48.9 63.4 46.8 54.3...
A researcher would like to predict the dependent variable YY from the two independent variables X1X1...
A researcher would like to predict the dependent variable YY from the two independent variables X1X1 and X2X2 for a sample of N=18N=18 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test the significance of the overall regression model. Use a significance level α=0.05α=0.05. X1X1 X2X2 YY 48.6 52.9 39.2 40.8 58.8 45.5 43.5 64.3 50.1 45.3 32.7 40.8 50.4 47.4 42.9 46.9 44.1 38.4 90.6 46.6 49.3 50.2 33.6 37.3 54.2 28.2 38.8 24.9...
In 2007, baseball players made on average $1.8 million. Using these randomly sample 12 players (All...
In 2007, baseball players made on average $1.8 million. Using these randomly sample 12 players (All in Millions of Dollars) from 2011 , Is there evidence that salaries are now different? Assume a 5% significance level $2.7, 2.9, 1.5, 2.2, 2.5, 2.0, 1.7, 2.9, 2.8, 2.6, 0.8, 2.7 Provide the following in your final analyses: the distribution to be used and why; all assumptions required for this study and if each assumption is met or not (if an assumption is...
Suppose we wish to build a multiple regression model to predict the cost of rent (dollars)...
Suppose we wish to build a multiple regression model to predict the cost of rent (dollars) in a city based on population (thousands of people), and income (thousands of dollars). Use the alpha level of 0.05. City Monthly Rent ($) 2018 Population (Thousands) 2010 Median Income (Thousands of Dollars) Denver, CO 998 586.158 45.438 Birmingham, AL 711 212.237 301.704 San Diego, CA 1414 1307.402 61.962 Gainesville, FL 741 124.354 28.653 Winston-Salem, NC 750 239.617 41.979 Memphis, TN 819 646.889 36.535...
Suppose we wish to build a multiple regression model to predict the cost of rent (dollars)...
Suppose we wish to build a multiple regression model to predict the cost of rent (dollars) in a city based on population (thousands of people), and income (thousands of dollars). Use the alpha level of 0.05. A. Is the whole regression model effective in predicting the cost of rent? Use alpha of 0.1. Make sure to show which values you use to make the decision. B. Write down the multiple regression equation using actual names of IVs and DVs. C....
The sample mean salary of 500 NBA players is 7 million dollars. We also know that...
The sample mean salary of 500 NBA players is 7 million dollars. We also know that the population standard deviation is 1 million dollars. We want to construct a confidence interval for population mean with 95% confidence. What will be the length of our confidence interval (in terms of million dollars)?
A researcher performed a regression analysis to predict achievement in physics using the following predictor variables:...
A researcher performed a regression analysis to predict achievement in physics using the following predictor variables: student gender (1=male, 0=female), emotional intelligence (ei), intelligence quotient (iq), verbal SAT (vsat), and math SAT (msat). The results are shown below. Call: lm(formula = physics ~ Gender + iq + ei + vsat + msat, data = physachv) Residuals:      Min       1Q   Median       3Q      Max -18.8370 -5.3765 -0.6288   5.5483 21.7685 Coefficients: Estimate Std. Error t value Pr(>|t|)    (Intercept) -60.106031   9.784708 -6.143 4.69e-09 ***...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT