Questions
The percentage of cotton in material used to manufacture men's shirts follows. Construct a stem-and-leaf display...

The percentage of cotton in material used to manufacture men's shirts follows. Construct a stem-and-leaf display for the data. Calculate the median and quartiles of these data. Choose the correct answer.

33.6 37.0 34.0 32.1 33.5 35.6 34.9 35.4 32.2 36.1 34.7 32.4 33.9 36.8 32.3 33.7 34.8 36.9 34.4 33.5 33.5 37.5 32.6 34.9 35.8 32.9 35.0 37.2 35.3 35.2 32.7 33.2 36.9 34.7 36.9 33.9 34.7 36.0 34.0 35.1 34.0 36.5 36.8 32.1 35.9 32.6 36.0 36.7 34.1 34.2 35.9 32.9 34.8 32.3 36.8 35.0 32.0 36.0 33.4 33.8 34.6 34.7 36.2 34.1

In: Statistics and Probability

Write the regression equation. Discuss the statistical significance of the model using the appropriate regression statistic...

  1. Write the regression equation.

  1. Discuss the statistical significance of the model using the appropriate regression statistic at a 95% level of confidence.
  2. Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence.
  3. Interpret the coefficient for the independent variable.
  4. What percentage of the observed variation in income is explained by the model?
  5. Predict the value of a person’s income with 3 children, using this regression model.

SUMMARY OUTPUT

Regression Statistics

Multiple R

0.039342

R Square

0.001548

Adjusted R Square

0.000672

Standard Error

1.554036

Observations

1142

ANOVA

df

SS

MS

F

Significance F

Regression

1

4.267905

4.267905

1.767229

0.183991

Residual

1140

2753.131

2.415027

Total

1141

2757.398

Coefficients

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

Lower 95.0%

Upper 95.0%

Intercept

1.621275

0.067736

23.93504

6.6E-103

1.488373

1.754177

1.488373

1.754177

X Variable 1

1.64E-06

1.23E-06

1.329372

0.183991

-7.8E-07

4.06E-06

-7.8E-07

4.06E-06

In: Statistics and Probability

Study 1: Unemployment Rates in Areas Where Minimum Wages Are Higher in the Cities Than in...

Study 1: Unemployment Rates in Areas Where Minimum Wages Are Higher in the Cities Than in Surrounding Suburbs City Unemployment Rates Suburb Unemployment Rates P1 P 1 = 0.069 P2 P 2 = 0.062 N1 N 1 = 52 cities N2 N 2 = 56 suburbs The Z(obtained) test statistic is 1.99. Using a significance level of .05, the Z(critical) is +1.645. Which of the following is the appropriate conclusion to your hypothesis test? The difference between the unemployment rates in the cities and the suburbs is statistically significant. The difference between the unemployment rates in the cities and the suburbs is not statistically significant. Suppose you conduct a second study and ask half as many people the same question. Suppose the proportions remain approximately the same. The new results are shown in the following table: Study 2: Unemployment Rates in Areas Where Minimum Wages Are Higher in the Cities Than in Surrounding Suburbs City Unemployment Rates Suburb Unemployment Rates P1 P 1 = 0.068 P2 P 2 = 0.061 N1 N 1 = 26 cities N2 N 2 = 28 suburbs When compared with the first study, you would expect the Z(obtained) test statistic to and the Z(critical) to . Without computing the test statistic for the second study, you your conclusion will be the same as your conclusion for the first study. Now that you have a sense of how changing the sample size affects the statistical significance of the statistical finding, what about whether the test is one-tailed or two-tailed? How does moving from a one-tailed test to a two-tailed test change the probability of rejecting the null hypothesis? The probability of rejecting the null hypothesis does not change. The probability of rejecting the null hypothesis decreases. The probability of rejecting the null hypothesis increases. How does changing the sample size, or changing from one-tailed to two, affect the importance of the statistical finding? Check all that apply. Sample size does not affect the importance of a statistical finding. Changing from two-tailed to one is more likely to produce a statistically significant finding, but that doesn't mean it will be more important. A small sample is more likely to result in an important statistical finding. A large sample is more likely to result in an important statistical finding.

In: Statistics and Probability

In an effort to determine whether any correlation exists between the price of stocks of airlines,...

In an effort to determine whether any correlation exists between the price of stocks of airlines, an analyst sampled six days of activity of the stock market. Using the following prices of Delta stock and Southwest stock, compute the coefficient of correlation. Stock prices have been rounded off to the nearest tenth for ease of computation.

Delta Southwest
47.6 15.1
46.4 15.4
50.6 16.1
52.6 15.6
52.4 16.4
53.2 18.1

In: Statistics and Probability

A researcher conducts a hypothesis test and concludes that his hypothesis is correct. Explain why this...

A researcher conducts a hypothesis test and concludes that his hypothesis is correct. Explain why this conclusion is never an appropriate decision in hypothesis testing.

In: Statistics and Probability

Answer & show work andswer and show work answer n show work Using a sample of...

Answer & show work

andswer and show work

answer n show work

Using a sample of 20 people, the testing agency found that 14 of them had better protection than that provided by the competitor. Do you have enough evidence to say/claom that your suncreen lotion provides better protection than the competitiors in a majority of cases? Use alpha = 0.01 to answer.

1. What are the apporiate hypotheses for situation?

2. the appropriate rejection rule is?

3. the calculated value of the appropriate test statistic is?

4. pretend that your test statistic is +4. what can you conclude?

5. WHat is the assumption of this test?

In: Statistics and Probability

The Centers for Disease Control and Prevention Office on Smoking and Health (OSH) is the lead...

The Centers for Disease Control and Prevention Office on Smoking and Health (OSH) is the lead federal agency responsible for comprehensive tobacco prevention and control. OSH was established in 1965 to reduce the death and disease caused by tobacco use and exposure to secondhand smoke. One of the many responsibilities of the OSH is to collect data on tobacco use. The following data show the percentage of U.S. adults who were users of tobacco for a recent 11-year period

Year Percentage of Adults Who Smoke
1 22.7
2 21.9
3 21.2
4 20.8
5 20.8
6 20.2
7 19.3
8 20.6
9 20.6
10 19.8
11 18.9
  1. Use simple linear regression analysis to find the parameters for the line that minimizes MSE for this time series. Do not round your interim computations and round your final answers to three decimal places. For subtractive or negative numbers use a minus sign. (Example: -300)

    y-intercept, b0 =

    Slope, b1 =

    MSE =
  2. One of OSH’s goals is to cut the percentage of U.S. adults who were users of tobacco to 12% or less within nine years of the last year of these data. Does your regression model from part (b) suggest that OSH is on target to meet this goal?



    Use your model from part (b) to estimate the number of years that must pass after these data have been collected before OSH will achieve this goal. Round your answer to the nearest whole number. Yes or No

    years=

In: Statistics and Probability

For Exercises 49–52, identify the independent and dependent variables for each study. 49.A study was conducted...

For Exercises 49–52, identify the independent and dependent variables for each study.

49.A study was conducted to determine if crocodiles raised in captivity (i.e., in a zoo) grew faster than crocodiles living in the wild. Identify the explanatory variable and the outcome variable.

50.People who walk at least 3 miles a day are randomly selected, and their blood triglyceride levels are measured in order to determine if the number of miles that they walk has any influence on these levels.

51.In an article in the British Journal of Nutrition, two types of mice were randomly selected. One group received a thyme supplement for a specific time, while another group was used as a control group and received no supplements. The brains of the mice were then analyzed, and it was found that the brains of the group of mice that received the thyme supplements had antioxidant levels similar to those of younger mice. It was concluded that the thyme supplement increased the antioxidants in the brains of the mice.

52.A study was conducted to determine if workers who had a flexible work schedule had greater job satisfaction than those workers who worked a regular nine-to-five work schedule. Independent variable—type of work schedule.

In: Statistics and Probability

An apartment complex surveyed its tenants and asked a variety of questions about their living arrangements....

An apartment complex surveyed its tenants and asked a variety of questions about their living arrangements. One question asked for the renter’s marital status using categorical responses of single, married, divorced, and widowed. Another question asked for the renter’s pet ownership with the categorical Dog, Cat, both (cat and dog) and none (neither cat nor dog).

  1. Methods: Explicitly state and conduct the appropriate hypothesis test to determine if there is any relationship between pet ownership and marital status? (All steps should be included)
  2. Discussion: Interpret the statistical decision you arrived at in terms that would be useful for apartment policies and procedures. This section should also include suggestions for further research and/or discussion of additional variables that may provide more precise results.
    Marital Status Pets
    Divorced none
    Widowed both
    Single cat
    Divorced dog
    Single Dog
    Married cat
    Single both
    Married cat
    Single cat
    Widowed none
    Divorced cat
    Married Dog
    Single none
    Married none
    Divorced Dog
    Married cat
    Widowed both
    Married none
    Married none
    Widowed cat
    Divorced both
    Single cat
    Single Dog
    Single Dog
    Married none
    Married Dog
    Single Dog
    Married cat
    Single cat
    Single both
    Married cat
    Divorced Dog
    Divorced none
    Married both
    Married both
    Widowed dog
    Married both
    Married cat
    Single cat
    Widowed none
    Divorced cat
    Married Dog
    Divorced both
    Single cat
    Single Dog

In: Statistics and Probability

SUMMARY OUTPUT Regression Statistics Multiple R 0.396235 R Square 0.157002 Adjusted R Square 0.156262 Standard Error...

SUMMARY OUTPUT

Regression Statistics

Multiple R

0.396235

R Square

0.157002

Adjusted R Square

0.156262

Standard Error

18.42647

Observations

1142

ANOVA

df

SS

MS

F

Significance F

Regression

1

72088.71

72088.71

212.3161

3.12E-44

Residual

1140

387069.6

339.5348

Total

1141

459158.4

Coefficients

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

Lower 95.0%

Upper 95.0%

Intercept

26.35917

0.803163

32.8192

7.4E-167

24.78333

27.93501

24.78333

27.93501

X Variable 1

0.000213

1.46E-05

14.57107

3.12E-44

0.000184

0.000242

0.000184

0.000242

a. Write the reqression equation.

  1. Discuss the statistical significance of the model using the appropriate regression statistic at a 95% level of confidence.
  2. Discuss the statistical significance of the coefficient for the independent variable using the appropriate regression statistic at a 95% level of confidence.
  3. Interpret the coefficient for the independent variable.
  4. What percentage of the observed variation in income is explained by the model?
  5. Predict the value of a person’s income who works 50 hours a week, using this regression model.

In: Statistics and Probability

A survey was conducted to determine whether differences in health care expenditure exist between men and...

A survey was conducted to determine whether differences in health care expenditure exist between men and women. Dataset contains total amounts spent on health care. a. Formulate the hypothesis that can be used to determine whether the sample data support the hypothesis that men and women spend different amounts on health care. b. What is the point estimate of the difference between the means for the two populations? c. Using the p-value and critical value approaches, what is your conclusion at α = 0.05? Make sure to report the p-value and critical values you used.

Men Women
75.45 155.84
1869.44 1420.88
487.22 267.56
1529.57 1843.48
423.12 338.49
279.68 757.35
794.43 442.36
1.13 329.31
56.78 305.57
699.41 514.4
1278.74 1048.98
395.54 526.7
2217.96 2404.58
996.27 622.56
640.77 459.78
1866.03 1777.64
587.89 807.67
520.63 726.32
1477.49 1609.63
392.41 1298.86
1724.05 1350.07
506.07 608.41
1357.56 1155.45
259.9 406.43
432.8 570.8
3033.65 3450
978.01 792.47
1953.09 1828.4
722.98 1069.65
1806.8 2237.03
1031.63 1184.15
822.6 319.43
1828.28 1845.77
2408.31 2734.14
2676.72 2523.3
1870.92 1235.51
2751.37 3183.55
1405.73 1746.62
1530.83 1114.66
1796.1 2091.21
2537.42 2844.19
1291.7 1216.35
1013.79
1443.57
1822.24

In: Statistics and Probability

Please answer a-b A clinical study shows the results of several people who have been analyzed...

Please answer a-b

A clinical study shows the results of several people who have been analyzed to see if they have the gene that increases the risk of getting cancer. The study reveals that the probability that a person has the cancerous gene is 0.1. Experts estimate that when they find that five people who have the cancer gene is that a medicine called ABX should be applied.

a) What is the probability that 7 or more people have to be analyzed to determine that the ABX medicine should be applied?

b) On average how many people should be tested before applying the ABX medication

In: Statistics and Probability

The building specifications in a certain city require that the sewer pipe used in residential areas...

The building specifications in a certain city require that the sewer pipe used in residential areas have a mean breaking strength of more than 2500 pounds per lineal foot. A manufacturer who would like to supply the city with sewer pipe has submitted a bid and provided the following information: An independent contractor randomly selected seven sections of the manufacturer’s pipe and tested each for breaking strength. The results (pounds per lineal foot) are as follows: 2610, 2750, 2420, 2510, 2540, 2490, and 2680.

1. What type of test should be conducted?

Dependent t test

Independent t test

One sample t test


2. State the null hypothesis in equation format.

3. State the alternative hypothesis in equation format.

4. What is the calculated t-value (to 3 significant digits)?

5. What is the critical t-value (to 3 significant digits)? Use alpha = 0.05.

6. Is the null hypothesis accepted or rejected? Use alpha = 0.05.

7. Is there sufficient evidence to conclude that there is a difference between the mean rate of increase of total phosphorus of the control algal and the water hyacinth? Use alpha = 0.05. Explain in one sentence.

In: Statistics and Probability

On each turn, bonnie is tossing a fair coin and Clyde is rolling a fair die....

On each turn, bonnie is tossing a fair coin and Clyde is rolling a fair die. They stop once Clyde rolls an odd number for the first time. Let X be the number of "Heads" that Bonnie's coin showed.

a) Compute E[X]

b) Compute var(X)

In: Statistics and Probability

Please answer these questions using SPSS. Thank you Note: For all assignments, you must show the...

Please answer these questions using SPSS. Thank you

Note: For all assignments, you must show the requested output from SPSS.

Example. Determine the descriptive statistics for three quantitative variables. Which variable has the highest mean? The most variability?

Answer: Output should include 3 boxes of descriptive statistics, one for each variable. There should also be one page that gives the answer to the other two questions.

The following sample data are used. We are interested in the descriptive statistics from these variables: sleep duration of undergrads, the number of NBA wins and the number of words in a résumé). Assume that they are all samples from populations. Use the instructions from the first two recitation worksheets to help you.

Sleep duration for undergraduates (minutes)

Wins2

(NBA teams)

Words3

(in résumé)

405

388

60

190

367

475

53

339

368

425

50

220

373

425

49

295

376

535

46

180

410

580

41

214

379

383

40

257

383

440

38

201

385

488

38

242

387

523

22

240

400

28

223

390

32

301

425

25

267

488

53

284

523

48

238

600

67

251

420

56

278

488

56

294

488

51

266

523

55

227

328

55

281

367

50

312

368

45

332

390

45

376

39

377

38

379

30

383

29

385

21

400

16

N = 40

N = 30

N = 23

1 This is a survey of the amount of sleep for one night for 40 undergraduates.

2 These are the numbers of wins for the 30 National Basketball Association teams for the 2014-2015 NBA season.

3 A large university offers training on résumé construction. This lists the number of words in each of 23 résumés.

Note: Here are the 5% Trimmed Means for these variables. This will help you make sure you’ve input the data correctly: Amount of sleep: 419.94 minutes. Wins: 42.69. Words: 257.72.

  1. Input the data the way you learned in recitation. You can input all of the data (3 data sets) at the same time, but you must run the analysis of each variable individually, otherwise SPSS won’t read it correctly, because the sample sizes are different. [Compare your descriptive statistics with the means and standard deviations on your graphs; this will help you make sure100405405 your numbers are correct.] In addition, I want you to add another variable next to the words variable which will divide the variable “Words” categorically. I want you to call this variable “wordscat.” (Note: this is the variable name. The variable labelcan be more descriptive, e.g., Words by Category). Create this variable using the Transform procedure you used in lab. Remember that you will have to give this variable Value Labels in the Variable View.

The scores should be divided into categories like this:

Low: 100-200              Moderate: 201-300           High: 301-400

  1. For all the quantitative variables (wordscat is categorical) find descriptive statistics (using Explore). Print out “Descriptives” and Stemplot. Remember to do this for each variable individually. If you try to do them at the same time your answers will be incorrect.
  1. For all the quantitative variables (wordscat is categorical) create and print out frequency distributions and histograms (using Frequencies). Remember to do this for each variable individually. If you try to do them at the same time your answers will be incorrect. Note: When doing the histogram, if you check the box “Show Normal Curve” this does not mean your variable is normal. It just shows what normal would look like and you can compare the shape of your variable to what is normal.
  1. For the categorical variable (wordscat) create and print out a bar chart. Make sure the variable is categorical on your chart and remember to label the chart.
  1. Create a variable (call it “sleeptwo”) that multiplies each student’s amount of sleep by 2 (do this by using Transform > Compute like in Lab 2, but instead of adding, we are multiplying by 2. Use this sign * for multiplication). Repeat steps 2 and 3 above for the new variable.

On your separate page please answer the following questions (worth 8 points):

  1. Calculate the SS for each of the three main quantitative variables using the variance you obtained in SPSS (this should be on a sheet separate from the SPSS output). Note: Remember the relationship between variance and SS to help you do this.
  1. Write 1-2 sentences (on your separate page) for each of the three main quantitative variables including the following: values of the mean, median, standard deviation and approximate shape of the distribution (look at the histogram). The shape may not be obvious; this is ok. Just do your best to describe what you think is going on with the data (e.g., mention whether you think there is skew, are there are any outliers, is it bimodal?).
  1. Answer the following questions:
    1. What happened to the mean and standard deviation of sleeptwo compared to the original mean and standard deviation of that variable?
    2. What happened to the shape of the distribution of the sleep variable after multiplying by two (compared to the original variable)? What is the explanation for this?

In: Statistics and Probability