Question

In: Statistics and Probability

id sex status income verbal gamble 1 1 51 2 8 0 2 1 28 2.5...

id sex status income verbal gamble 1 1 51 2 8 0 2 1 28 2.5 8 0 3 1 37 2 6 0 4 1 28 7 4 7.3 5 1 65 2 8 19.6 6 1 61 3.47 6 0.1 7 1 28 5.5 7 1.45 8 1 27 6.42 5 6.6 9 1 43 2 6 1.7 10 1 18 6 7 0.1 11 1 18 3 6 0.1 12 1 43 4.75 6 5.4 13 1 30 2.2 4 1.2 14 1 28 2 6 3.6 15 1 38 3 6 2.4 16 1 38 1.5 8 3.4 17 1 28 9.5 8 0.1 18 1 18 10 5 8.4 19 1 43 4 8 12 20 0 51 3.5 9 0 21 0 62 3 8 1 22 0 47 2.5 9 1.2 23 0 43 3.5 5 0.1 24 0 27 10 4 156 25 0 71 6.5 7 38.5 26 0 38 1.5 7 2.1 27 0 51 5.44 4 14.5 28 0 38 1 6 3 29 0 51 0.6 7 0.6 30 0 62 5.5 8 9.6 31 0 18 12 2 88 32 0 30 7 7 53.2 33 0 38 15 7 90 34 0 71 2 10 3 35 0 28 1.5 1 14.1 36 0 61 4.5 8 70 37 0 71 2.5 7 38.5 38 0 28 8 6 57.2 39 0 51 10 6 6 40 0 65 1.6 6 25 41 0 48 2 9 6.9 42 0 61 15 9 69.7 43 0 75 3 8 13.3 44 0 66 3.25 9 0.6 45 0 62 4.94 6 38 46 0 71 1.5 7 14.4 10. A study of teenage gambling in Britain was performed in 2008. There is 47 observations and 5 variables. Download the data set Gambling from Blackboard and answer the following questions. a) Make a numerical and graphical summary of the data, commenting on any features that you fi interesting. Limit the output your present to a quantity that a busy reader would find sufficient. b) What percent of the variation in the response is explained by these predictors? c) Which observation has the largest (positive) residual? d) Compute the mean and median of the residuals. e) For all other predictors held constant, what would be the difference in predicted expenditure on gambling for a male compared to a female? f) Which variables are statistically significant? g) Predict the amount that a male with average status, income, and verbal score would gamble along with an appropriate 95% CI. Repeat the prediction for a male with maximal values of status, income, and verbal score. Which CI is wider and why is this result expected? h) Fit a model with just income as a predictor and use an F?test to compare it to the full model. i) Check the constant variance, normality, and linearity assumption. De- scribe your findings.

Solutions

Expert Solution

a)

b)

Regression Output:

Since the value of R Square for this model is 0.5279, 52.79% of the variation in the dependent variable is explained by the independent variables.

c)

Observation 24 has the largest Residual.

For this observation,

Gamble = 156

And Gamble Predicted = 61.93

Residual = 94.07

d)

Mean of the residuals = -2.66*1015

Median of the residuals = -1.91

e)

22

Since the coefficient of Sex is -22, difference in predicted expenditure on gambling for a male compared to a female will be 22

f)

Since the p-value for variables sex, income and verbal is less than 0.05, hence they are significant


Related Solutions

DATA 2 ID X1 X2 X3 Y A 0 2 4 9 B 1 0 8...
DATA 2 ID X1 X2 X3 Y A 0 2 4 9 B 1 0 8 10 C 0 1 0 5 D 1 1 0 1 E 0 0 8 10 CORRELATION MATRIX Y X1 X2 X3 Y 1 ? -0.304 +0.889 X1 ? 1 -0.327 0 X2 -0.304 -0.327 1 -0.598 X3 +0.889 0 -0.598 1 1. What is the sum of squares regression for the full model? (Correct answer is 58, please show me how to get...
DATA 2 ID X1 X2 X3 Y A 0 2 4 9 B 1 0 8...
DATA 2 ID X1 X2 X3 Y A 0 2 4 9 B 1 0 8 10 C 0 1 0 5 D 1 1 0 1 E 0 0 8 10 CORRELATION MATRIX Y X1 X2 X3 Y 1 ? -0.304 +0.889 X1 ? 1 -0.327 0 X2 -0.304 -0.327 1 -0.598 X3 +0.889 0 -0.598 1 Comparing the zero order model and full model 1. Did the addition of X2 and X3 significantly increase R2? (correct answer is...
A= 1 2 4 0 1 -2 -1 0 1 2 0 3 8 1 4...
A= 1 2 4 0 1 -2 -1 0 1 2 0 3 8 1 4 . Let W denote the row space for A. (a) Find an orthonormal basis for W and for W⊥. (b) Compute projW⊥(1 1 1 1 1 ).
A = (1 −7 5 0 0 10 8 2 2 4 10 3 −4 8...
A = (1 −7 5 0 0 10 8 2 2 4 10 3 −4 8 −9 6) (1) Count the number of rows that contain negative components. (2) Obtain the inverse of A and count the number of columns that contain even number of positive components. (3) Assign column names (a,b,c,d) to the columns of A. (4) Transform the matrix A into a vector object a by stacking rows. (5) Replace the diagonal components of A with (0,0,2,3). Hint:...
DATA 3 8 2 15 2 2 0 0 4 5 2 7 0 1 5...
DATA 3 8 2 15 2 2 0 0 4 5 2 7 0 1 5 3 0 2 5 4 1 6 9 5 3 1 2 10 6 1 1 2 1 19 6 6 6 7 0 4 1 1 1 0 1 9 2 2 2 1 16 10 10 5 2 3 1 4 4 4 3 6 2 8 5 2 7 1 6 4 0 3 1 1 1 Background: A group of...
Bought Income Children ViewedAd 0 37.00 2 2 1 47.00 1 1 0 47.00 1 2...
Bought Income Children ViewedAd 0 37.00 2 2 1 47.00 1 1 0 47.00 1 2 0 49.00 2 2 1 59.00 1 1 0 13.00 2 1 0 51.00 1 2 0 38.00 1 2 0 60.00 1 1 1 48.00 1 1 0 17.00 1 2 0 60.00 2 2 0 38.00 1 1 0 24.00 1 2 0 15.00 1 2 1 59.00 1 2 0 28.00 1 2 0 36.00 1 2 0 10.00 2 1...
x (Bins) frequency 0 0 1 0 2 0 3 2 4 5 5 8 6...
x (Bins) frequency 0 0 1 0 2 0 3 2 4 5 5 8 6 13 7 33 8 42 9 66 10 77 11 105 12 103 13 110 14 105 15 84 16 70 17 51 18 40 19 27 20 27 21 15 22 5 23 7 24 2 25 2 26 1 27 0 28 0 29 0 30 0 (7) On the Histogram worksheet, calculate all frequencies of the distribution using the table shown....
given the sequences  x1 = [2, 6, -4, 1] x2 = [8, 0, 2, 0, -9,...
given the sequences  x1 = [2, 6, -4, 1] x2 = [8, 0, 2, 0, -9, 0, 1, 0] x3 = [2, 0, -8, -8, 2] x4 = [0, 1, 5i, 0, 6i, 0] x5 = [9, 3, 7] plot the 1. DFT magnitude of the computed sequences in MATLAB  2. phase responses in degrees and radians against frequency and number of samples 3. comment on the plots
Raw data ID X Y A 0 0 B 0 2 C 3 4 D 3...
Raw data ID X Y A 0 0 B 0 2 C 3 4 D 3 4 E 6 6 F 6 8 Standard scores ID STDX STDY A -1.22 -1.55 B -1.22 -0.78 C 0 0 D 0 0 E 1.22 0.78 F 1.22 1.55 1. What is the sum of squares regression? (correct answer is 36, please show work) 2. What can you conclude with ANOVA? (correct answer is Reject the null, p<0.01; type I error is possible,...
ID Affiliation Location Education Confidence 1 1 3 0 72 2 1 3 5 65 3...
ID Affiliation Location Education Confidence 1 1 3 0 72 2 1 3 5 65 3 0 4 5 66 4 0 1 4 78 5 0 3 1 81 6 1 2 5 81 7 1 1 2 83 8 1 3 3 74 9 0 4 0 78 10 0 2 2 85 11 0 1 1 85 12 1 3 5 69 13 1 2 0 69 14 1 3 2 79 15 1 4 1 82...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT