Question

In: Statistics and Probability

Income ($1000s) Household Size Amount Charged ($) 89.31 2.00 10985.47 61.08 5.00 9792.97 43.95 4.00 6527.55...

Income
($1000s) Household
Size Amount
Charged ($)
89.31 2.00 10985.47
61.08 5.00 9792.97
43.95 4.00 6527.55
55.15 6.00 9708.57
40.39 2.00 6335.20
34.06 3.00 4809.57
86.43 4.00 12314.59
79.49 3.00 10823.38
48.40 2.00 7172.03
46.49 3.00 6996.39
50.68 3.00 7349.23
73.77 5.00 11719.66
34.18 4.00 6062.71
81.64 3.00 9861.66
65.08 4.00 10411.15
41.96 3.00 8169.33
41.48 2.00 6705.15
45.21 3.00 5871.21
48.98 5.00 8627.18
35.70 4.00 6466.99
77.36 2.00 9646.36
79.00 3.00 11910.64
52.03 3.00 7624.06
42.91 3.00 7635.68
38.69 5.00 7955.40
59.49 2.00 6076.57
82.77 2.00 11637.13
9.93 3.00 3911.33
59.54 3.00 6756.73
44.92 3.00 7031.92
33.79 3.00 7777.11
27.66 2.00 5470.17
53.17 3.00 8559.86
35.15 4.00 6306.89
89.46 2.00 10466.76
26.45 5.00 4101.41
89.59 6.00 14962.20
73.96 4.00 12153.59
73.15 4.00 11324.99
56.15 3.00 9704.71
46.57 4.00 9592.53
38.29 5.00 7372.79
34.84 4.00 6708.17
74.27 2.00 8743.04
50.16 4.00 9211.51
85.99 3.00 12318.45
50.82 4.00 9109.05
79.99 3.00 12882.03
68.38 5.00 11310.12
64.42 5.00 11356.01
57.78 3.00 8030.32
50.85 3.00 8905.52
41.35 2.00 5863.35
68.88 4.00 10199.39
87.37 4.00 13589.18
42.15 7.00 8958.60
85.91 3.00 9884.07
79.22 4.00 11881.21
72.59 5.00 11091.60
70.79 3.00 12217.53
65.91 4.00 11661.95
50.88 4.00 6898.00
29.77 4.00 5342.99
82.30 2.00 9685.05
44.81 2.00 6882.08
3.99 1.00 1612.58
57.16 6.00 10069.27
23.25 5.00 8063.88
15.31 3.00 6064.65
72.60 3.00 12132.90
72.53 5.00 11562.23
80.31 1.00 9250.01
43.47 6.00 8147.12
65.82 2.00 10219.37
78.58 4.00 11057.59
37.36 5.00 8690.96
50.86 3.00 7186.18
77.72 3.00 12597.46
73.55 2.00 9859.00
73.87 4.00 10205.59
1) Provide graphical summaries of the data. Comment on your findings.
2) Develop an estimated regression equation, using annual income as the independent variable. Insert Regression equation estimation results here (excluding the ANOVA):
a. Interpret the estimated slope coefficient.
b. Interpret the R-square.
c. Interpret the p-value on the slope.
d. Interpret the 95% confidence interval.
3) Develop an estimated regression equation, using household size as the independent variable. Insert Regression equation estimation results here (excluding the ANOVA):
a. Interpret the estimated slope coefficient.
b. Interpret the R-square.
c. Interpret the p-value on the slope.
d. Interpret the 95% confidence interval.
4) Which of the two models is the better predictor of annual credit card charges? Defend your decision.
5) Provide a scatterplot of the standardized residuals from your chosen best model and comment whether the assumption appear to be met.

Solutions

Expert Solution

Using R

df = read.table("../Documents/creditCardData.txt")
colnames(df )= c("Income","Size","Charged")

#2
model1 <- lm (Charged ~Income , data = df)
summary(model1)


#3
model2 <- lm (Charged ~Size , data = df)
summary(model2)

2) Develop an estimated regression equation, using annual income as the independent variable. Insert Regression equation estimation results here (excluding the ANOVA):

a. Interpret the estimated slope coefficient.

slope = 106.482

if income increases by 1000$, amount charged increases by 106.482$ on average

b. Interpret the R-square.

R^2 = 0.7427

this means 74.27 % of variation in amount charged is explained by this model

c. Interpret the p-value on the slope.

p-value = 0

this means that the slope is significantly different from 0

d. Interpret the 95% confidence interval.

95% confidence interval of slope is (92.3556,120.6085)

3) Develop an estimated regression equation, using household size as the independent variable. Insert Regression equation estimation results here (excluding the ANOVA):

a. Interpret the estimated slope coefficient.

similar to Q2

slope = 502.9

if Household size increases by 1 unit , amount charged increases by 502.9$ on average

b. Interpret the R-square.

R^2 = 0.06136

this means 6.136% of variation in amount charged is explained by this model

c. Interpret the p-value on the slope.

p-value = 0.0267

here p-value < 0.05( alpha)

hence we reject the null hypothesis

we conclude that the slope is significant here too

d. Interpret the 95% confidence interval.

95% confidence interval of slope is (59.50223,946.2219)

4) Which of the two models is the better predictor of annual credit card charges? Defend your decision.

Model 1 is better

as this has much higher R^2


Related Solutions

1.00 1.00 2.00 2.00 3.00 1.30 4.00 3.75 5.00 2.25 Make a scatterplot and include here...
1.00 1.00 2.00 2.00 3.00 1.30 4.00 3.75 5.00 2.25 Make a scatterplot and include here Calculate the regression line.
A student mixes 5.00 mL of 2.00 x 10-3 M Fe(NO3)3, 4.00 mL of 2.00 x...
A student mixes 5.00 mL of 2.00 x 10-3 M Fe(NO3)3, 4.00 mL of 2.00 x 10-3 M KSCN, and 1.00 mL of distilled water and finds that in the equilibrium mixture, the concentration of FeSCN2+ is 1.31 x 10-4 M. The equation describing the equilibrium is Fe3+ (aq) + SCN- (aq) →← →← FeSCN2+ (aq) Calculate the concentration of FeSCN2+ at equilibrium. Include appropriate significant figures and units in your response. Example input: 1.31x10^-4 M.
Prob. Ret(A) Ret(B) 10% -5.00% -2.00% 20% 0.00% 1.00% 40% 5.00% 3.00% 20% 10.00% 4.00% 10%...
Prob. Ret(A) Ret(B) 10% -5.00% -2.00% 20% 0.00% 1.00% 40% 5.00% 3.00% 20% 10.00% 4.00% 10% 15.00% 5.00% 1) Please calculate covariance and correlation between A and B. 2) Please calculate the variance of a portfolio consisting of 45% of A and 55% of B. 3) Please find the MVP based on a portfolio consisting of A and B. What is the expected return of the MVP? What is the risk for the MVP? 4) Please find the optimal portfolio...
A student mixes 5.00 mL of 2.00 x 10‐3 M Fe(NO3)3 with 5.00 mL 2.00 x...
A student mixes 5.00 mL of 2.00 x 10‐3 M Fe(NO3)3 with 5.00 mL 2.00 x 10‐3 M KSCN. She finds that in the equilibrium mixture the concentration of FeSCN+2 is 1.40 x 10‐4 M. What is the initial concentration in solution of the Fe+3 and SCN‐? What is the equilibrium constant for the reaction? What happened to the K+ and the NO3‐ ions in this solution?
A 2.90 μFμF capacitor is charged to 470 VV and a 4.00 μFμF capacitor is charged...
A 2.90 μFμF capacitor is charged to 470 VV and a 4.00 μFμF capacitor is charged to 535 V. a). These capacitors are then disconnected from their batteries, and the positive plates are now connected to each other and the negative plates are connected to each other. What will be the potential difference across each capacitor? Enter your answers numerically separated by a comma. V1, V2= b). What will be the charge on each capacitor? Enter your answers numerically separated...
Lot Price Data Lot Price is lot price in $1000s Lot Size is lot size in...
Lot Price Data Lot Price is lot price in $1000s Lot Size is lot size in 1000s of square feet Mature Trees is the number of mature trees on the property Distance from Water is the distance from the edge of property to the water in feet Distance from Road is the distance from the main road to the center of the property in miles Lot Price Lot Size Mature Trees Distance from Water Distance from Road 105.4 41.2 24...
4.00 moles of HI are placed in an evacuated 5.00 L flask and then heated to...
4.00 moles of HI are placed in an evacuated 5.00 L flask and then heated to 800 K. The system is allowed to reach equilibrium. What will be the equilibrium concentration of each species
A proton with a velocity V = (2.00 m / s) i - (4.00 m /...
A proton with a velocity V = (2.00 m / s) i - (4.00 m / s) j - (1.00 m / s) k, a B = (1.00 T) i + (2.00 T) j- (1.00 T) k it moves within the magnetic field. What is the magnitude of the magnetic force (Fe) acting on the particle? (Qproton = 1.6x10-19 C)
Develop an estimated regression equation with annual income and household size as the independent variables. Discuss...
Develop an estimated regression equation with annual income and household size as the independent variables. Discuss your findings - Income ($1000s) Household Size Amount Charged ($) 54 3 4,016 30 2 3,159 32 4 5,100 50 5 4,742 31 2 1,864 55 2 4,070 37 1 2,731 40 2 3,348 66 4 4,764 51 3 4,110 25 3 4,208 48 4 4,219 27 1 2,477 33 2 2,514 65 3 4,214 63 4 4,965 42 6 4,412 21 2 2,448...
a 5.00 kg ball, moving to the right at a velocity of 2.00 m/s on the...
a 5.00 kg ball, moving to the right at a velocity of 2.00 m/s on the frictionless table, collide head on with a stationary 7.50 kg ball. find the final velocities of the balls if the collision is elastic, completely inelastic the balls stick together
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT