Question

In: Statistics and Probability

Assignment Chapter 2 The dataset below contains information on the 50 states in the US, including...

Assignment Chapter 2

The dataset below contains information on the 50 states in the US, including 2 categorical and 13 quantitative variables. In the questions that follow, I ask you to use technology to do some analysis of this dataset.

  1. What are the cases? What is the sample size?
  2. Choose one of the two categorical variables and create a frequency table and a relative frequency table of the values..
  3. Choose one of the quantitative variables and use technology to create a histogram. Describe the shape of the histogram. For the same variable, create a boxplot. Are there any outliers? Finally, for the same variable, give summary statistics: mean, standard deviation, and the five number summary.
  4. Choose any quantitative variable and any categorical variable and create a sideby-side boxplot. Describe what you see in the graph, and discuss any association that might exist between variables, as evidenced by the graph.
  5. Create a two-way table of the two categorical variables. Find appropriate proportions to help you determine if there is an association between the two variables, and explain your reasoning.
  6. Choose any two quantitative variables and use technology to create a scatterplot. Describe the scatterplot: Is there an obvious positive or negative linear trend? Are there any outliers? Use technology to find the correlation and the least squares line to predict one variable from the other. Interpret the slope of the line in context.

State

HouseholdI ncome

IQ

McCain

Vote

Region

Obama

McCain

Population

HighSchool

GSP

Alabama

38160

95.7

0.604

S

M

4.525375

82.4

33264

Alaska

57071

99

0.602

W

M

0.657755

90.2

59238

Arizona

46693

97.4

0.538

W

M

5.739879

84.4

36457

Arkansas

37458

97.5

0.588

S

M

2.75

79.2

31215

California

54385

95.5

0.372

W

O

35.842038

81.3

44894

Colorado

53900

101.6

0.449

W

O

4.601821

88.3

46416

Connecticut

60551

103.1

0.383

NE

O

3.498966

88.8

55193

Delaware

52676

100.4

0.37

NE

O

0.830069

86.5

66961

Florida

45038

98.4

0.484

S

O

17.38543

85.9

37846

Georgia

48388

98

0.522

S

M

8.918129

85.2

40103

Hawaii

61005

95.6

0.266

W

O

1.262124

88.0

42361

Idaho

45919

101.4

0.615

W

M

1.39514

87.9

33020

Illinois

49328

99.9

0.369

MW

O

12.712016

86.8

43878

Indiana

44618

101.7

0.49

MW

O

6.226537

87.2

38037

Iowa

48075

103.2

0.447

MW

O

2.952904

89.8

38280

Kansas

44478

102.8

0.568

MW

M

2.733697

89.6

38465

Kentucky

38694

99.4

0.575

MW

M

4.141835

81.8

33666

Louisiana

37472

95.3

0.586

S

M

4.506685

78.7

37183

Maine

45503

103.4

0.405

NE

O

1.314985

87.1

34030

Maryland

63082

99.7

0.368

NE

O

5.561332

87.4

43967

Massachusetts

56592

104.3

0.362

NE

O

6.407382

86.9

50935

Michigan

48043

100.5

0.409

MW

O

10.104206

87.9

37175

Minnesota

56102

103.7

0.44

MW

O

5.096546

92.3

45697

Mississippi

34343

94.2

0.564

S

M

2.900768

83.0

27829

Missouri

44487

101

0.494

MW

M

5.759532

87.9

37251

Montana

39821

103.4

0.497

W

M

0.92692

91.9

31940

Nebraska

48820

102.3

0.57

MW

M

1.747704

91.3

40185

Nevada

51036

96.5

0.427

W

O

2.332898

86.3

46108

New Hampshire

60441

104.2

0.448

NE

O

1.299169

90.8

42033

New Jersey

66752

102.8

0.421

NE

O

8.685166

87.6

49447

New Mexico

40126

95.7

0.42

W

O

1.903006

82.9

35714

New York

48472

100.7

0.367

NE

O

19.280727

85.4

49748

North Carolina

41616

100.2

0.495

S

O

8.540468

80.9

39921

North Dakota

42311

103.8

0.533

MW

M

0.636308

89.5

38319

Ohio

45776

101.8

0.472

MW

O

11.450143

88.1

38461

Oklahoma

38859

99.3

0.656

S

M

3.523546

85.2

34243

Oregon

46349

101.2

0.408

W

O

3.591363

87.4

39625

Pennsylvania

48148

101.5

0.443

NE

O

12.394471

86.5

39344

Rhode Island

52421

99.5

0.353

NE

O

1.079916

81.1

40687

South Carolina

40583

98.4

0.538

S

M

4.197892

83.6

32906

South Dakota

44996

102.8

0.532

MW

M

0.770621

87.5

39848

Tennessee

40696

97.7

0.569

S

M

5.893298

82.9

38440

Texas

43044

100

0.555

S

M

22.471549

78.3

43283

Utah

55619

101.1

0.629

W

M

2.420708

91.0

36758

Solutions

Expert Solution


Related Solutions

R Assignment 2    Below are the average starting teacher salaries for the 50 states in...
R Assignment 2    Below are the average starting teacher salaries for the 50 states in the US along with District of Columbia and Federal Education Association for year 2016-2017. Construct a histogram. Label the horizontal axis and give histogram a heading. Describe the shape of the distribution for your histogram. State Avg. Starting Salary Alabama $38,477 Alaska $46,785 Arkansas $33,973 Arizona $34,068 California* $44,782 Colorado $32,980 Connecticut $45,280 District of Columbia* $51,359 Delaware $41,415 Federal Education Association $49,120 Florida...
Accounting 2 chapter 13 . this is only the information for us to answer
Accounting 2 chapter 13 . this is only the information for us to answer
The dataset for this assignment contains house prices as well as 19 other features for each...
The dataset for this assignment contains house prices as well as 19 other features for each property. Those features are detailed below and include information about the house (number of bedrooms, bathrooms…), the lot (square footage…) and the sale conditions (period of the year…) The overall goal of the assignment is to predict the sale price of a house by using a linear regression. For this assignment, the training set is in the file "house_prices_train.csv" and the test set is...
The dataset ‘diamondpricesbyrater’ (available in Canvas) contains information on the prices of samples of diamonds rated...
The dataset ‘diamondpricesbyrater’ (available in Canvas) contains information on the prices of samples of diamonds rated by agencies IGI and by HRD. Use R to conduct a hypothesis test to determine if there is a difference in the mean price of diamonds rated by the two agencies. State your hypotheses and conclusions. diamondpricesbyrater.txt IGI HRD 823 3778 765 3432 803 3851 803 3346 705 3130 725 3995 967 3701 1050 3529 967 3667 863 3202 800 3256 842 3415 800...
Problem 1 (50 pts). This problem will involve the nycflights13 dataset (including tables airlines, airports, planes...
Problem 1 (50 pts). This problem will involve the nycflights13 dataset (including tables airlines, airports, planes and weather), which we saw in class. It is available in both R and Python, however R is recommended for at least the visualization portion of the question. Start by installing and importing the dataset to your chosen platform. We will first use joins to search and manipulate the dataset, then we will produce a flightpath visualization. Question e) Produce a map that colors...
Question 2 chapter 15 Handout Assignment
Millet Sales Corp., a public company, is planning to acquire new computers with a total value of $ 60,000 on January 1, 2021. They have a choice of leasing the computers for a three-year period, or purchasing them and financing the purchase by issuing a note payable. Details of the two alternative arrangements are as follows: 1. Lease option: Three annual lease payments of $ 22,446 due on December 31 of each year. Millet would purchase the computers at the end...
This chapter and assignment further explore standards for interoperability. The HIMSS definition of interoperability states, “The...
This chapter and assignment further explore standards for interoperability. The HIMSS definition of interoperability states, “The ability of two or more systems or components to exchange information and to use the information that has been exchanged.” HIM professionals often are the subject matter experts when a question about standards arises and need to help guide the IT analysts and workers at their organization. An example from a recent conference I attended: “The IT professionals will say, we can make that...
INSTRUCTIONS: Read the information below, including Parts a and b of Question 2. Create a new...
INSTRUCTIONS: Read the information below, including Parts a and b of Question 2. Create a new Excel spreadsheet. Record your answers to all parts of the question into the spreadsheet. Use bold text to clearly label your responses to Part a and Part b. Save your work regularly. Perfect Binding Ltd provides specialist binding services to the printing industry. The company’s production manager is investigating whether to replace an old burst binding machine and has provided you with the following...
The dataset starbucks in the open intro package contains nutritional information on 77 Starbucks food items....
The dataset starbucks in the open intro package contains nutritional information on 77 Starbucks food items. Spend some time reading the help file of this dataset. For this problem, you will explore the relationship between the calories and carbohydrate grams in these items. Please complete in R Studio showing all steps. Create a scatterplot of this data with calories on the x-axis and carbohydrate grams on the y-axis, and describe the relationship you see. In the scatterplot you made, what...
The therm dataset contains information on survey respondents’ opinions about various public figures. These are “feeling...
The therm dataset contains information on survey respondents’ opinions about various public figures. These are “feeling thermometer” scores, which range from 0 (total dislike of the person) to 100 (total like). The relevant variables for this question are: • white: a dummy variable indicating whether the respondent is white (ie, 1 for white and 0 for non-white) • ideology: the respondent’s ideology on a scale of 1 (most liberal) to 7 (most conservative) • obama: the respondent’s “feeling thermometer” score...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT