Question

In: Statistics and Probability

Question No. 01: Linear Regression Analysis in SPSS Statistics a. Assume a case study to use...

Question No. 01: Linear Regression Analysis in SPSS Statistics

a. Assume a case study to use simple linear regression for analysis and precisely interpret the results of your
study. Also, use Y=aX + b to predict the results.
b. Suppose another case study to use multiple linear regression, Interpret the results tactfully. Also, use
Z=aX+bY+c to predict the results. (Use screenshots as required).

Solutions

Expert Solution

Simple linear regression model in SPSS:

Let Y : Price (dependent variable)

X: income ( independent variable)

There are 5 steps to analyse your data using linear regression in SPSS Statistics,

1.Click Analyze > Regression > Linear... on the top menu, as shown below:

You will be presented with the Linear Regression dialogue box:

2.Transfer the independent variable, Income, into the Independent(s):box and the dependent variable, Price, into the Dependent: box. You can do this by either drag-and-dropping the variables or by using the appropriate buttons. You will end up with the following screen.

3. You now need to check four of the assumptions discussed below

a) no significant outliers .

b) independence of observations.

c)Homoscedasticityassumption

d)and normal distribution of errors/residuals .

You can do this by using the statistics button and plots features, and then selecting the appropriate options within these two dialogue boxes. In our enhanced linear regression guide, we show you which options to select in order to test whether your data meets these four assumptions.

4.Click on the ok button. This will generate the results.

Output:

The first table of interest is the Model Summary table, as shown below:

This table provides the R and R2 values. The R value represents the simple correlation and is 0.873 , which indicates a high degree of correlation. The R2 value indicates how much of the total variation in the dependent variable, Price, can be explained by the independent variable, Income. In this case, 76.2% can be explained, which is very large.

The next table is the ANOVA table, which reports how well the regression equation fits the data (i.e., predicts the dependent variable) and is shown below:

Look at the "Regression" row and go to the "Sig." column. This indicates the statistical significance of the regression model that was run. Here, p < 0.0005, which is less than 0.05, and indicates that, overall, the regression model statistically significantly predicts the outcome variable (i.e., it is a good fit for the data).

The Coefficients table provides us with the necessary information to predict price from income, as well as determine whether income contributes statistically significantly to the model (by looking at the "Sig." column). Furthermore, we can use the values in the "B" column under the "Unstandardized Coefficients" column, as shown below:

The fitted simple linear regression equation as:

Price = 8287 + 0.564(Income)

Multiple linear regression analysis in SPSS :

Open data file.

In our example,

Y: Murder rate ( dependent variable) and population, burglary, larceny, and vehicle theft are independent variable.

In our example, we need to enter the variable “murder rate” as the dependent variable and the population, burglary, larceny, and vehicle theft variables as independent variables.

In the field “Options…” we can set the stepwise criteria. We want to include variables in our multiple linear regression model that increase the probability of F by at least 0.05 and we want to exclude them if the increase F by less than 0.1.


The “Statistics…” menu allows us to include additional statistics that we need to assess the validity of our linear regression analysis.


It is advisable to include the collinearity diagnostics and the Durbin-Watson test for auto-correlation. To test the assumption of homoscedasticity and normality of residuals we will also include a special plot from the “Plots…” menu.

The next table shows the multiple linear regression model summary and overall fit statistics. We find that the adjusted R² of our model is .398 with the R² = .407. This means that the linear regression explains 40.7% of the variance in the data. The Durbin-Watson d = 2.074, which is between the two critical values of 1.5 < d < 2.5. Therefore, we can assume that there is no first order linear auto-correlation in our multiple linear regression data.

The F-test is highly significant, thus we can assume that the model explains a significant amount of the variance in murder rate.

The next table shows the multiple linear regression estimates including the intercept and the significance levels.

we find that only burglary and motor vehicle theft are significant predictors. We can also see that motor vehicle theft has a higher impact than burglary by comparing the standardized coefficients (beta = .507 versus beta = .333).

The information in the table above also allows us to check for multicollinearity in our multiple linear regression model. Tolerance should be > 0.1 (or VIF < 10) for all variables, which they are.


Related Solutions

(20 pts) Use the “Distance.sav” (SPSS) data set (located below) to perform a linear regression analysis....
(20 pts) Use the “Distance.sav” (SPSS) data set (located below) to perform a linear regression analysis. This dataset shows how far on average a person in Illinois drives each year. Write your findings using the format presented in the class slides. (2 pts) How much of the variation in the dependent variable is explained by the variation in the independent variable? What statistic did you use? (2 pts) Is the linear model significantly different than zero? Why or why not?...
Case Study Chapter 18 Forecasting Use simple linear regression analysis with seasonality to forecast demand. Rebar...
Case Study Chapter 18 Forecasting Use simple linear regression analysis with seasonality to forecast demand. Rebar Sizes    The standard Rebar sizes that the Company manufactures and sells are as follows: 10mm, 12mm, 16mm, 20mm, 25mm, 28mm, 32mm, and 36mm. For special orders, it can also make 40mm and 50mm bars. Case 1 Introduction Over the past years, the demand for 25mm rebars have fluctuated with a seasonal pattern being observed.   Since rebars are manufactured on a “made to order” basis,...
In this problem, we will use linear regression and residual analysis to study the relationship between...
In this problem, we will use linear regression and residual analysis to study the relationship between square footage of a house and the home sales price. (a) Go to the course webpage and under Datasets, download the CSV file “homes.csv” and follow the accompanying Minitab instructions. Copy and paste the Fitted Line Plots and the Residual Plots in a blank document. Print these out and attach them to your homework. (b) Based on the fitted line and residual plots for...
Use SPSS to follow the steps below and conduct a simple linear regression of the following...
Use SPSS to follow the steps below and conduct a simple linear regression of the following data: Calories (Xi) Sodium (Yi) 186 495 181 477 176 425 149 322 184 482 190 587 158 370 139 322 175 479 148 375 State your hypotheses (e.g. HA: “calories will significantly predict sodium”) Create a scatterplot of the data. State if the scatterplot appears to contain a linear relationship. Conduct the analysis in SPSS. Include all of the important outputs (e.g. ANOVA...
In a multi linear regression case study, the dependent variable is house_value, the independent variables are...
In a multi linear regression case study, the dependent variable is house_value, the independent variables are house_age, crime_rate, tax_rate, trying to build a model to predict the house value, how to state model assumptions? What's the assumption in this case? Thanks!
Importance of linear regression in research analysis.
Importance of linear regression in research analysis.
Run a linear regression using Excel’s Data Analysis regression tool. Construct the linear regression equation and...
Run a linear regression using Excel’s Data Analysis regression tool. Construct the linear regression equation and determine the predicted total sales value if the number of promotions is 6. Is there a significant relationship? Clearly explain your reasoning using the regression results. Number of Promotions Total Sales 3 2554 2 1746 11 2755 14 1935 15 2461 4 2727 5 2231 14 2791 12 2557 4 1897 2 2022 7 2673 11 2947 11 1573 14 2980
This question involves the use of simple linear regression on the fat dataset that can be...
This question involves the use of simple linear regression on the fat dataset that can be found in the faraway library. data set. Use the lm() function to perform a simple linear regression with brozek (percent body fat using the reference method) on abdom (abdomen circumference in cm) as the predictor. Print the results of the summary(function) and submit along with your answers to the following questions. Is there a relationship between the predictor and the response? How strong is...
In a simple linear regression analysis, will the estimate of the regression line be the same...
In a simple linear regression analysis, will the estimate of the regression line be the same if you exchange X and Y? Why or why not?
Use Excel to prepare a Linear Regression Analysis. Use data samples below for populations and determine...
Use Excel to prepare a Linear Regression Analysis. Use data samples below for populations and determine if the selected independent variable is affecting the dependent variable. Use an alpha of 5% for ANOVA and Correlation Coefficient. Explain the results. Data samples Group A 104,103,101,99,97,101,101 Group B 101,100,95,99,101,103,97 Group C 100,96,99,95,99,102,106 Group D 97,99,99,101,105,100,99
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT