Question

In: Statistics and Probability

A multiple regression model is to be constructed to model the time spent using the internet...

A multiple regression model is to be constructed to model the time spent using the internet per week among internet users. The explanatory variables are age, hours spent working per week and annual income.

Data has been collected on 30 randomly selected individuals:

Time using internet
(minutes)
Age Hours working
per week
Annual income
('000)
140 56 39 28
257 35 31 79
163 35 35 34
115 33 52 27
182 45 36 37
214 51 57 80
187 44 37 50
142 26 55 41
251 19 47 35
203 21 42 36
243 28 25 26
244 23 26 28
131 48 56 46
174 24 54 63
131 51 52 78
178 38 39 79
135 22 50 36
124 31 57 27
173 57 44 60
189 33 35 58
179 59 30 35
230 37 27 51
121 59 46 53
150 36 40 49
151 42 47 80
147 38 56 35
195 54 32 59
134 58 52 44
190 39 28 27
197 58 40 72

a.) Find the multiple regression equation using all three explanatory variables. Assume that x1 is age, x2 is hours working per week and x3 is annual income. Give your answers to 3 decimal places.

y^ = _____ + _______age + ________ hours working + ________ annual income

b.) At a level of significance of 0.05, the result of the F test for this model is that the null hypothesis (is) (is not) rejected.

c.) The value of R2 for this model, to 3 decimal places, is equal to __________

d.) The value of s for this model, to 3 decimal places, is equal to _________

e.) The least significant explanatory variable in this model is:

a.) age
b.) hours working per week
c.) annual income

f.) Construct a new multiple regression model by removing the variable annual income. Give your answers to 3 decimal places.

The new regression model equation is:

y^ = ________ + _________ age + ________ hours working

g.) In the new model compared to the previous one, the value of R2 (to 3 decimal places) is:

a.) increased
b.) decreased
c.) unchanged

h.) In the new model compared to the previous one, the value of s (to 3 decimal places) is:

a.) increased
b.) decreased
c.) unchanged

i.) The better model is the:

a.) original model
b.) reduced model

Solutions

Expert Solution


Related Solutions

Fit a multiple regression model that relates the salary to education, work experience, and time spent...
Fit a multiple regression model that relates the salary to education, work experience, and time spent at the bank so far. a - State what your model is. b - Determine whether the independent variables are significant, or not, at a level of significance of 5%. c - Which independent variable is most significant in explaining salary? Which is least significant? d - Is your overall model significant? Provide statistical proof by conducting an F-test for overall fit of the...
Fit a multiple regression model that relates the salary to education, work experience, and time spent...
Fit a multiple regression model that relates the salary to education, work experience, and time spent at the bank so far. a - State what your model is. b - Determine whether the independent variables are significant, or not, at a level of significance of 5%. c - Which independent variable is most significant in explaining salary? Which is least significant? d - Is your overall model significant? Provide statistical proof by conducting an F-test for overall fit of the...
A multiple regression model is to be constructed to predict the final exam score of a...
A multiple regression model is to be constructed to predict the final exam score of a university student doing a particular course based upon their mid-term exam score, the average number of hours spent studying per week and the average number of hours spent watching television per week. Data has been collected on 30 randomly selected individuals: hide data Download the data Final score Mid-term Score Hours studying per week Hours watching TV per week 76 85 19 34 60...
A multiple regression model is to be constructed to predict the heart rate in beats per...
A multiple regression model is to be constructed to predict the heart rate in beats per minute (bpm) of a person based upon their age, weight and height. Data has been collected on 30 randomly selected individuals: hide data Heart Rate (bpm) Age (yrs) Weight (lb) Height (in) 78 23 245 70 91 44 223 68 79 42 178 67 60 33 200 58 57 25 99 68 59 35 123 64 78 30 204 62 98 56 200 63...
Using the data, fit an appropriate regression model to determine whether time spent studying (hours) is...
Using the data, fit an appropriate regression model to determine whether time spent studying (hours) is a useful predictor of the chance of passing the exam (result, 0=fail 1=pass). Formally assess the overall fit of the model. DATA three; INPUT result hours; /* result=0 is fail; result=1 is pass */ cards; 0 0.8 0 1.6 0 1.4 1 2.3 1 1.4 1 3.2 0 0.3 1 1.7 0 1.8 1 2.7 0 0.6 0 1.1 1 2.1 1 2.8 1...
3. Fit a multiple regression model that relates the salary to education, work experience, and time spent at the bank so far.
SALARY EDUC EXPER TIME 39000 12 0 1 40200 10 44 7 42900 12 5 30 43800 8 6 7 43800 8 8 6 43800 12 0 7 43800 12 0 10 43800 12 5 6 44400 15 75 2 45000 8 52 3 45000 12 8 19 46200 12 52 3 48000 8 70 20 48000 12 6 23 48000 12 11 12 48000 12 11 17 48000 12 63 22 48000 12 144 24 48000 12 163 12...
A sample of 357 subscribers to Wired magazine shows the mean time spent using the Internet...
A sample of 357 subscribers to Wired magazine shows the mean time spent using the Internet is 14.7 hours per week, with a sample standard deviation of 7.9 hours. Find the 80% confidence interval for the mean time Wired subscribers spend on the Internet. (Round the final answers to 2 decimal places.) The confidence interval is between and .
A sample of 337 subscribers to Wired magazine shows the mean time spent using the Internet...
A sample of 337 subscribers to Wired magazine shows the mean time spent using the Internet is 14.4 hours per week, with a sample standard deviation of 5.8 hours. Find the 80% confidence interval for the mean time Wired subscribers spend on the Internet. (Round the final answers to 2 decimal places.) The confidence interval is between  and  .
Discuss the underlying assumptions of a simple linear regression model; multiple regression model; and polynomial regression.
Discuss the underlying assumptions of a simple linear regression model; multiple regression model; and polynomial regression.
What is the drawback of using the step_wise model in multiple linear regression? How is feature...
What is the drawback of using the step_wise model in multiple linear regression? How is feature importance addressed in decision trees? Is there a guarantee that an ensemble method always outperforms a simple decision tree? Elaborate on your answer.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT