Question

In: Statistics and Probability

1. A researcher would like to predict the dependent variable YY from the two independent variables...

1. A researcher would like to predict the dependent variable YY from the two independent variables X1X1 and X2X2 for a sample of N=15N=15 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test the significance of the overall regression model. Use a significance level α=0.05α=0.05.

X1X1 X2X2 YY
66.4 76.4 58
34.6 39 65.5
32.7 23.1 65.8
44.4 71.2 73.3
57.3 50.8 57.9
32.7 48 74.6
53.3 51.4 64.4
48.3 51.1 59.2
66.9 81.4 59.4
31.2 40.1 59.1
44.6 33.1 54.8
45.5 49.7 65.7
30.5 9.9 44.4
62.8 75.3 66.7
56.1 61 51.2



SSreg=
SSres=
R2=
F=
P-value =

2.

Complete the missing information for this regression model. Note:  N=23N=23.

ˆYY^ = 29.692 + 2.311X12.311X1 −- 1.374X21.374X2 + X3X3
2.195 1.005 1.016 1.415 Standard Errors
13.527 2.3   t-ratios
0.059   P-values

(Except for P-values, report all values accurate to 3 decimal places. For P-values, report accurate to 4 decimal places.)

Solutions

Expert Solution

In order to solve this question I used R software.

R codes and output:

> d=read.table('data.csv',header=T,sep=',')
> head(d)
    X1   X2    Y
1 66.4 76.4 58.0
2 34.6 39.0 65.5
3 32.7 23.1 65.8
4 44.4 71.2 73.3
5 57.3 50.8 57.9
6 32.7 48.0 74.6
> attach(d)
The following objects are masked from d (pos = 3):

    X1, X2, Y

> fit=lm(Y~X1+X2)
> summary(fit)

Call:
lm(formula = Y ~ X1 + X2)

Residuals:
    Min      1Q  Median      3Q     Max 
-8.9976 -3.0325  0.4088  3.9166  8.1187 

Coefficients:
            Estimate Std. Error t value Pr(>|t|)    
(Intercept)  68.8802     6.0662  11.355 8.93e-08 ***
X1           -0.6906     0.2119  -3.259  0.00684 ** 
X2            0.4928     0.1358   3.630  0.00345 ** 
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Residual standard error: 5.89 on 12 degrees of freedom
Multiple R-squared:  0.528,     Adjusted R-squared:  0.4493 
F-statistic: 6.711 on 2 and 12 DF,  p-value: 0.01106

> anova(fit)
Analysis of Variance Table

Response: Y
          Df Sum Sq Mean Sq F value   Pr(>F)   
X1         1   8.57    8.57  0.2469 0.628242   
X2         1 457.15  457.15 13.1757 0.003452 **
Residuals 12 416.36   34.70                    
---
Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
> 

SSReg =SS(X1) +SS(X2) = 8.57 + 457.15 = 465.72

SSres = 416.36

R2 = 0.528

F = 6.711 [ It is in the output of summary(fit) command ]

P-value = 0.01106


Related Solutions

A researcher would like to predict the dependent variable YY from the two independent variables X1X1...
A researcher would like to predict the dependent variable YY from the two independent variables X1X1 and X2X2 for a sample of N=11N=11 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test statistics to assess the significance of the regression model and partial slopes. Use a significance level α=0.05α=0.05. X1X1 X2X2 YY 55.3 51.1 56.2 72.1 51.6 76.6 35.2 41.7 51.8 70.4 58 47.9 51 71.6 39.8 66.6 60.4 61.9 61.9 48.9 63.4 46.8 54.3...
A researcher would like to predict the dependent variable YY from the two independent variables X1X1...
A researcher would like to predict the dependent variable YY from the two independent variables X1X1 and X2X2 for a sample of N=18N=18 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test the significance of the overall regression model. Use a significance level α=0.05α=0.05. X1X1 X2X2 YY 48.6 52.9 39.2 40.8 58.8 45.5 43.5 64.3 50.1 45.3 32.7 40.8 50.4 47.4 42.9 46.9 44.1 38.4 90.6 46.6 49.3 50.2 33.6 37.3 54.2 28.2 38.8 24.9...
A researcher would like to predict the dependent variable Y from the two independent variables X1...
A researcher would like to predict the dependent variable Y from the two independent variables X1 and X2 for a sample of N=13 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test statistics to assess the significance of the regression model and partial slopes. Use a significance level α=0.02. X1X1 X2X2 YY 58 64.7 35.9 45 35.4 78.8 69.4 45.5 64.1 63.2 71.9 9 58 42.8 84.3 24.5 55.5 63.5 27.5 49.3 77.3 41.8 50.5...
A researcher would like to predict the dependent variable Y from the two independent variables X1...
A researcher would like to predict the dependent variable Y from the two independent variables X1 and X2 for a sample of N=16 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test the significance of the overall regression model. Use a significance level α=0.05. X1             X2             Y 48 42.3 47.1 36.3 58.7 65.4 43.4 40.2 63.6 49.5 37.9 45.6 45.5 37.2 50.8 40.6 64.7 42.4 42.5 46.7 63.1 42.7 40 35.8 55.8 10.6 52.1 40.9...
A researcher would like to predict the dependent variable Y from the two independent variables X1...
A researcher would like to predict the dependent variable Y from the two independent variables X1 and X2 for a sample of N=12 subjects. Use multiple linear regression to calculate the coefficient of multiple determination and test statistics to assess the significance of the regression model and partial slopes. Use a significance level α=0.05.    X1          X2         Y 34.4 26.4 59.4 53.7 38.3 90.4 72.8 43.2 71.3 25.4 21.2 64.5 75.9 46.5 71.1 60.4 27.9 72.6 28 56.4 29.9 40.1...
Given here are the data from a dependent variable and two independent variables. The second independent...
Given here are the data from a dependent variable and two independent variables. The second independent variable is an indicator variable with several categories. Hence, this variable is represented by x2, x3, and x4. How many categories are there for this independent variable? Use a computer to perform a multiple regression analysis on this data to predict y from the x values. Discuss the output and pay particular attention to the dummy variables. y x1 x2 x3 x4 11 1.9...
1- In discriminant analysis, the dependent variable is ________ and the independent variables are ________ in...
1- In discriminant analysis, the dependent variable is ________ and the independent variables are ________ in nature. a interval; categorical b ordinal; interval c categorical; interval d ordinal; categorica 2- s one of the statistics associated with discriminant analysis, ________ are obtained when the unstandardized coefficients are multiplied by the values of the variables and the products are summed and added to the constant term. a Wilks' λ b discriminant scores c discriminant function coefficients d eigenvalue
1- In discriminant analysis, the dependent variable is ________ and the independent variables are ________ in...
1- In discriminant analysis, the dependent variable is ________ and the independent variables are ________ in nature. a interval; categorical b ordinal; interval c categorical; interval d ordinal; categorica 2- s one of the statistics associated with discriminant analysis, ________ are obtained when the unstandardized coefficients are multiplied by the values of the variables and the products are summed and added to the constant term. a Wilks' λ b discriminant scores c discriminant function coefficients d eigenvalue
Refresher on Variables: Classifying variables as independent or dependent: The independent variable is the one being...
Refresher on Variables: Classifying variables as independent or dependent: The independent variable is the one being manipulated or grouped in the study for comparison. A tip to remember this is to think that independent starts with “I” and I, the researcher, manipulate that variable or have control over who is grouped in the study. The way we will be using independent variables in this course is in terms of between group analysis, so our focus will be on groups being...
Identify the dependent variable and the independent variable (or variables) and comment on the validity of...
Identify the dependent variable and the independent variable (or variables) and comment on the validity of the implied causal relationship: A study that looks at students’ college GPAs and how they are affected by the students’ number of weekly study hours, high school GPA, and number of weekly hours of extra-curricular activities. A young pitcher wants to understand how she can improve her pitches. She collects data on the number of strikes thrown in a pitched game, the average number...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT