Question

In: Statistics and Probability

I’m working on an ANOVA homework problem, I have 5 group’s individual income based on their...

I’m working on an ANOVA homework problem, I have 5 group’s individual income based on their years of education. (<12, 12, 13-15, 16 and 16+). Here is the problem:

Use an extra sum of squares F-test (BYOA: Build Your Own ANOVA!) to use all the data (to increase the degrees of freedom and thus the power of the test!) to compare only the bachelor’s degree group (16) income to the more than bachelor’s degree group (>16) income.Show your final ANOVA table and your 6-step complete analysis.You will need to assume that the standard deviations of the log-transformed data are again equal to proceed here.A two-sample t-test between these two groups (assuming equal standard deviations on logged data) yields a p-value of .1648 (try it!), but it only uses 778 degrees of freedom (from a pooled t-test).Make note again of how many degrees of freedom were used to estimate the pooled standard deviation in your extra sum of squares test.You may use SAS or R.

I need help!I’m using SAS.For the first step, I ran the ANOVA on the logged data to determine if there was a difference in any of the means.Test concluded there was.

Dependent Variable: logincome2005

Source

DF

Sum of Squares

Mean Square

F Value

Pr > F

Model

4

217.653784

54.413446

62.87

<.0001

Error

2579

2232.120383

0.865498

Corrected Total

2583

2449.774168

R-Square

Coeff Var

Root MSE

logincome2005 Mean

0.088846

8.913094

0.930322

10.43770

Then, I put all the non-16 year subjects in one group and ran the ANOVA to compare the 16 year group to the combined group of the others to see if there was a difference.Test concluded there was.

Dependent Variable: logincome2005

Source

DF

Sum of Squares

Mean Square

F Value

Pr > F

Model

1

62.214640

62.214640

67.28

<.0001

Error

2582

2387.559527

0.924694

Corrected Total

2583

2449.774168

R-Square

Coeff Var

Root MSE

logincome2005 Mean

0.025396

9.212857

0.961610

10.43770

Next, I grouped the non-16+ subjects together and ran the ANOVA to compare the 16+ group to the combined group of the others.Test concluded there was a difference.

Dependent Variable: logincome2005

Source

DF

Sum of Squares

Mean Square

F Value

Pr > F

Model

1

92.614028

92.614028

101.45

<.0001

Error

2582

2357.160140

0.912920

Corrected Total

2583

2449.774168

R-Square

Coeff Var

Root MSE

logincome2005 Mean

0.037805

9.154018

0.955469

10.43770

I then “built” my own ANOVA tables:The first comparing the 16 group comparison run against the original and then the 16+ group comparison run against the original.

16 years educ different (comparing to original result)

df

SS

MS

F

Pr > F

Model (Full)

3

155.44

51.8133333

59.8653238

0

Error (from Full)

2579

2232.12

0.86549826

Total (From Reduced)

2582

2387.56

16+ years educ different (comparing to original result)

df

SS

MS

F

Pr > F

Model (Full)

3

125.04

41.68

48.1572317

0

Error (from Full)

2579

2232.12

0.86549826

Total (From Reduced)

2582

2357.16

I’m stuck on what to do next in order to compare only the 16 yr group against the 16+ year.Guidance would be appreciated.

Solutions

Expert Solution

After you ran the first ANOVA

Dependent Variable: logincome2005

Source

DF

Sum of Squares

Mean Square

F Value

Pr > F

Model

4

217.653784

54.413446

62.87

<.0001

Error

2579

2232.120383

0.865498

Corrected Total

2583

2449.774168

R-Square

Coeff Var

Root MSE

logincome2005 Mean

0.088846

8.913094

0.930322

10.43770

You should have run Post-hoc tests. Post-hoc tests are used to check where the difference exists

between groups if there is an overall significance. Since there exists a difference in any of the means of 5 groups, a post-hoc test will provide the evidence where this difference occurred. A Bonferroni post-hoc test, Tukey HSD, or any other test is generally used to do so. This will compare each group with all the other groups as :

<12 with <12 , <12 with 12 , <12 with 13-15, <12 with 16, <12 with 16+

and so on.

In this way you will get a comparison of 16 with 16+ as well.


Related Solutions

Java homework problem: I need the code to be able to have a message if I...
Java homework problem: I need the code to be able to have a message if I type in a letter instead of a number. For example, " Please input only numbers". Then, I should be able to go back and type a number. import java.awt.event.ActionEvent; import java.awt.event.ActionListener; import javax.swing.JButton; import javax.swing.JFrame; import javax.swing.JLabel; import javax.swing.JPanel; import javax.swing.JTextField; public class LoginGui {    static JFrame frame = new JFrame("JFrame Example");    public static void main(String s[]) {        JPanel panel...
Attached is the problem I am working on I have to use phantoms, and i have...
Attached is the problem I am working on I have to use phantoms, and i have already completed steps p and H, I need help help with step A , which is to "state and check the assumptions for the hypothesis test", I think the correct hypothesis test to use would be the 2 sample t test, but im not sure. The number of cell phones per 100 residents in countries in Europe is given in table #9.3.9 for the...
Solution for problem one (P1) below: ME 311 Thermodynamics I 1 Homework 5 Problem from Cengel...
Solution for problem one (P1) below: ME 311 Thermodynamics I 1 Homework 5 Problem from Cengel and Boles 8th Edition P1) Carbon dioxide flows steadily in a pipe at 3000 kPa, 31 °C, and at a rate of 1.5 kg/s. Determine the density of carbon dioxide using (a) the ideal gas law and (b) the compressibility chart.
I have this homework and i have to prepare a case study for it  Develop...
I have this homework and i have to prepare a case study for it  Develop a case study or scenario on a business/economics/related area problem.  Collect and define a set of data on this scenario.  Summarize and analyze the data set by using R-program, or Excel or SPSS.  Apply at least five statistical data description techniques (descriptive measures),  Then solve your case problems by using confidence intervals, determining sample size, hypothesis testing ( single population,...
Based on your group’s personal experiences, have they ever read a review on a web site...
Based on your group’s personal experiences, have they ever read a review on a web site like TripAdvisor and not believed that it was in fact written by a customer? Take a look at a popular travel destination on Trip Advisor and see if your group agrees with what is written about a resort or tour and if they would rely on some of the information posted there. Why or why not? Reference the resort or tour and provide a...
Lili was working on her mathematic homework, and suddenly, she found another classic problem. The problem...
Lili was working on her mathematic homework, and suddenly, she found another classic problem. The problem is: “Given three light bulbs X , Y , and Z. Bulb X light up every A seconds, bulbs Y light up every B seconds, and bulb Z light up every C seconds. If three bulbs light up together for the 1-st time at 0-th second, the 2-nd time this three bulbs will light up together at the same time will be at K1-th...
You’re working on a team-based homework assignment with a partner, Deidre, that consists of an essay...
You’re working on a team-based homework assignment with a partner, Deidre, that consists of an essay and graphing questions. You can write an essay answer in 15 minutes while Deidre takes 20 minutes to write an essay of similar quality. You can answer a graphing question in 30 minutes and it also takes Deidre 30 minutes. What are you and your partner’s opportunity cost of answering essay questions and of finishing graphing questions? Use the opportunity cost principle to determine...
More than anything I need 5 - 7 of this homework. You have been asked by...
More than anything I need 5 - 7 of this homework. You have been asked by your supervisors at A&L Engineering to design a roller coaster for a new theme park. Because this design is in the initial stages, you have been asked to create a track for the ride. Your coaster should have at least two peaks and two valleys, and launch from an initial height of 75 meters. Each peak and valley should represent a vertical change of...
I’m having a very hard time. I have to do a poster pensebtation which I have...
I’m having a very hard time. I have to do a poster pensebtation which I have no clue how to do on Identify the key mental health disorders that affect the elderly. Determine how these key mental disorders differ across racial, ethic, gender, and socioeconomic lines. Thank you,
I have a homework class, but I don't really understand anything and I have to submit...
I have a homework class, but I don't really understand anything and I have to submit my homework next week. Homework must be written in C ++ program language. Can someone help me please... Working with classes (everything written below is one task): Define a class Date that contains integer variables for day, month, and year. 1.1. Create the necessary methods for the class: set, get, default constructor, constructor with arguments. 1.2. Create a method that calculates the number of...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT