Questions
Refer to the Lincolnville School District bus data. Conduct a test of hypothesis to reveal whether...

Refer to the Lincolnville School District bus data.

Conduct a test of hypothesis to reveal whether the mean maintenance cost is equal for each of the bus manufacturers. Use the .01 significance level.

Conduct a test of hypothesis to determine whether the mean miles traveled since the last maintenance is equal for each bus manufacturer. Use the .05 significance level.

Show work in Excel.

ID Manufacturer Engine Type Engine Type (0=diesel) Capacity Maintenance cost Age Odometer Miles Miles
122 Bluebird Gasoline 1 55 9394 10 116580 11967
279 Bluebird Diesel 0 55 1008 2 22672 11925
500 Bluebird Gasoline 1 55 5329 5 50765 11922
520 Bluebird Diesel 0 55 4794 10 119130 11896
714 Bluebird Diesel 0 42 3742 7 73703 11837
875 Bluebird Diesel 0 55 4376 9 97947 11814
600 Bluebird Diesel 0 55 4832 10 119860 11800
953 Bluebird Diesel 0 55 5160 10 117700 11798
101 Bluebird Diesel 0 55 1955 4 41096 11789
358 Bluebird Diesel 0 55 2775 6 70086 11782
29 Bluebird Gasoline 1 55 5352 6 69438 11781
686 Bluebird Diesel 0 55 1569 3 34674 11757
887 Bluebird Diesel 0 55 3743 8 93672 11704
464 Bluebird Gasoline 1 55 2540 3 34530 11698
43 Bluebird Gasoline 1 55 8263 9 102969 11615
704 Bluebird Diesel 0 55 4218 8 83424 11610
814 Bluebird Diesel 0 55 2028 4 40824 11576
39 Bluebird Gasoline 1 55 5821 6 69444 11533
699 Bluebird Gasoline 1 55 9069 9 98307 11518
75 Bluebird Diesel 0 55 3011 6 71970 11462
982 Bluebird Diesel 0 55 505 1 10276 11359
321 Bluebird Diesel 0 42 2732 6 70122 11358
884 Bluebird Diesel 0 55 4364 9 92457 11231
57 Bluebird Diesel 0 55 3190 7 79240 11222
731 Bluebird Diesel 0 42 3213 6 68526 11168
135 Bluebird Diesel 0 55 3560 7 76426 11127
692 Bluebird Diesel 0 55 3770 8 93248 11048
200 Bluebird Diesel 0 55 5168 10 103700 11018
540 Bluebird Gasoline 1 55 3656 4 45284 10945
660 Bluebird Gasoline 1 55 6213 6 64434 10911
482 Bluebird Gasoline 1 55 10575 10 116534 10802
984 Bluebird Diesel 0 55 3809 8 87664 10760
977 Bluebird Diesel 0 55 3769 7 79422 10759
326 Bluebird Diesel 0 55 4563 9 107343 10724
554 Bluebird Diesel 0 42 1826 4 44604 10662
695 Bluebird Diesel 0 55 1061 2 23152 10633
861 Bluebird Gasoline 1 55 9669 10 106040 10551
883 Bluebird Gasoline 1 55 1881 2 20742 10344
954 Bluebird Diesel 0 42 5284 10 101000 10235
768 Bluebird Diesel 0 42 3173 7 71778 10227
490 Bluebird Gasoline 1 55 10133 10 106240 10210
725 Bluebird Diesel 0 55 2356 5 57065 10209
507 Bluebird Diesel 0 55 3690 7 72849 10095
40 Bluebird Gasoline 1 55 9573 10 118470 10081
918 Bluebird Diesel 0 55 2470 5 53620 10075
387 Bluebird Gasoline 1 55 6863 8 89960 10055
418 Bluebird Diesel 0 55 4513 9 104715 10000
10 Keiser Gasoline 1 14 4646 5 54375 11973
751 Keiser Diesel 0 14 1078 2 22444 11948
759 Keiser Diesel 0 55 3952 8 87872 11883
365 Keiser Diesel 0 55 3065 6 63384 11778
162 Keiser Gasoline 1 55 3143 3 31266 11758
370 Keiser Gasoline 1 55 7766 8 86528 11707
948 Keiser Diesel 0 42 4342 9 97956 11691
678 Keiser Diesel 0 55 3361 7 75229 11668
481 Keiser Gasoline 1 6 3097 3 34362 11662
693 Keiser Gasoline 1 55 9193 9 101889 11461
989 Keiser Diesel 0 55 4795 9 106605 11418
724 Keiser Diesel 0 42 3754 8 91968 11344
732 Keiser Diesel 0 42 4640 9 101196 11342
880 Keiser Gasoline 1 55 8410 9 97065 11336
61 Keiser Diesel 0 55 4139 9 103536 11148
754 Keiser Diesel 0 14 7380 14 146860 11003
353 Keiser Gasoline 1 55 4279 4 45744 10902
705 Keiser Diesel 0 42 2152 4 47596 10755
767 Keiser Diesel 0 55 2985 6 71538 10726
120 Keiser Diesel 0 42 4723 10 110320 10674
9 Keiser Gasoline 1 55 3527 4 46848 10591
603 Keiser Diesel 0 14 2116 4 44384 10518
427 Keiser Gasoline 1 55 6927 7 73423 10355
45 Keiser Diesel 0 55 3124 6 60102 10167
38 Keiser Gasoline 1 14 5976 6 61662 10140
396 Thompson Diesel 0 14 1072 2 21858 11969
193 Thompson Diesel 0 14 5922 11 128711 11248
833 Thompson Diesel 0 14 3920 8 90968 11112
671 Thompson Gasoline 1 14 6733 8 89792 11100
398 Thompson Diesel 0 6 4752 9 95922 10802
156 Thompson Diesel 0 14 6212 12 140460 10473
168 Thompson Gasoline 1 14 7004 7 83006 10315
314 Thompson Diesel 0 6 5408 11 128117 10128

In: Math

The dataset TrafficFlow gives the delay time in seconds for 24 simulation runs in Dresden, Germany,...

The dataset TrafficFlow gives the delay time in seconds for 24 simulation runs in Dresden, Germany, comparing the current timed traffic light system on each run to a proposed flexible traffic light system in which lights communicate traffic flow information to neighboring lights. On average, public transportation was delayed 105 seconds under the timed system and 44 seconds under the flexible system. Since this is a matched pairs experiment, we are interested in the difference in times between the two methods for each of the 24 simulations. For the n=24 differences D, we were given that x¯D=61 seconds with sD=15.19 seconds. We wish to estimate the average time savings for public transportation on this stretch of road if the city of Dresden moves to the new system.

what parameter are we estimating? give correct notation

suppose that we write the 24 differences on 24 slips of paper. describe how to physically use the paper slips to create a bootstrap sample.

what statistic do we for this one bootstrap sample?

if we create a bootstrap distribution using many of these bootstrap statistics what shape do we expect it to be centered?

how can we use the values in the bootstrap distribution to find the standard error?

the standard error 3.1 for one set of 10,000 bootstrap samples. find and interpret a 95% confidence interval for the average time savings.

In: Math

You may need to use the appropriate appendix table or technology to answer this question. Consider...

You may need to use the appropriate appendix table or technology to answer this question.

Consider the following hypothesis test.

H0: μ = 30

Ha: μ ≠ 30

The population standard deviation is 14. Use  α = 0.05. How large a sample should be taken if the researcher is willing to accept a 0.10 probability of making a type II error when the actual population mean is 34?

In: Math

A researcher selects a sample of 49 participants from a population with a mean of 12...

A researcher selects a sample of 49 participants from a population with a mean of 12 and a standard deviation of 3.5. What is the probability of selecting a sample mean that is at least equal to the population mean? 0.50 equal to the probability of selecting a sample mean that is at most equal to the population mean all of the above none of the above

In: Math

Have you ever tried to get out of jury duty? About 25% of those called will...

Have you ever tried to get out of jury duty? About 25% of those called will find an excuse (work, poor health, travel out of town, etc.) to avoid jury duty.†

(a) If 11 people are called for jury duty, what is the probability that all 11 will be available to serve on the jury? (Round your answer to three decimal places.)


(b) If 11 people are called for jury duty, what is the probability that 5 or more will not be available to serve on the jury? (Round your answer to three decimal places.)


(c) Find the expected number of those available to serve on the jury. What is the standard deviation? (Round your answers to two decimal places.)

μ = people
σ = people


(d) How many people n must the jury commissioner contact to be 95.9% sure of finding at least 12 people who are available to serve? (Enter your answer as a whole number.)
people

In: Math

For patients who have been given a diabetes test, blood-glucose readings are approximately normally distributed with...

For patients who have been given a diabetes test, blood-glucose readings are approximately normally distributed with mean 128 mg/dl and a standard deviation 8 mg/dl. Suppose that a sample of 3 patients will be selected and the sample mean blood-glucose level will be computed. Enter answers rounded to three decimal places.

According to the empirical rule, in 95 percent of samples the SAMPLE MEAN blood-glucose level will be between the lower-bound of _________ and the upper-bound of ______

In: Math

Part 1 Binomial Distribution [Mark 20%/cancer type, 40% total mark] Five year survival chance from any...

Part 1 Binomial Distribution [Mark 20%/cancer type, 40% total mark]

Five year survival chance from any cancer depends on many factors like availability of treatment options, expertise of attending medical team and more. Five year survival rate is also an important measure and it is used by medical practitioners to report prognosis to patients and family. We will be analyzing five year survival rate of two types of cancer, very aggressive and very treatable cancer and to have comparative analysis of cancer in Norway.

  1. Five year survival rate of breast cancer is 87.7%
  2. Five year survival rate of esophageal cancer is 16.5%

(NOTE: due to limitation imposed by our available probability distribution table assume survival rate for breast cancer is 90% and for esophageal cancer is 20%)

To simplify our comparative analysis, we will assume 480 patients were admitted in January 2018. For each type of cancer:

  1. Calculate the number of patients that are expected to survive at least 5 years, calculate variance and standard deviation expected for 5 year survival.
  2. For a selected 15 patients in each category, calculate probability of at least 7 patients surviving past 5 years, compare your calculated value with estimated number from table in page 329. How accurate is our estimation? Do you prefer estimation or calculation methods?
  3. Use the table on page 329 of your textbook to determine survival probability for 0 to 15 patients and complete table below.

Selected number of patient will survive 5 years

Probability of breast cancer patient will survive 5 years

Probability of esophageal cancer patient will survive 5 years

0

1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

  1. Draw frequency distribution bar graph for each type of cancer
  2. What is the minimum/maximum number of survivors expected for given cancer type at 5 years. [HINT: use mean and 3 standard deviations to report outcome].

Part 2 Normal distribution [Mark 30%]

Daily discharge from phosphate mine is normally distributed with a mean daily discharge of 38 mg/L and a standard deviation of 12 mg/L. What proportion of days will the daily discharge exceed 58 mg/L?

Part 3 Normal approximation of binomial Probability Distribution [Mark 30%]

Airlines and hotels often grant reservation in excess to their available capacity, to minimize loss and maximize profitability due to no shows. Suppose that the records of Air Georgian shows that on average, 10% of their prospective passengers will not show up at departure gates. If Air Georgian sells 215 tickets  and their plane has capacity for 200 passengers.

  1. Use binomial probability distribution to calculate, the mean of passengers showing up at the gate out of the 215 reservations made
  2. Calculate the standard deviation.
  3. Calculate standard z score for 200 passengers showing up at the gates.
  4. Using normal probability distribution, determine probability of at least 200 passengers will show up!
  5. Determine probability of more than 200 passengers showing up at the gate?
  6. Determine probability of all 215 passengers showing up at the gate!

In: Math

n 1998, the Nabisco Company launched a “1000 Chips Challenge” advertising campaign in which it was...

n 1998, the Nabisco Company launched a “1000 Chips Challenge” advertising campaign in which it was claimed that every 18-ounce bag of their Chips Ahoy cookies contains 1000 chips (on average). A curious statistics student purchased 8 randomly selected bags of cookies and counted the chocolate chips. The data is given below:

1200 1019 1214 1087 1214 900 1200 825

a) The student concluded that the data was not normally distributed and wanted to use a Wilcoxon Signed-Rank test to test the company’s claim. What assumption is needed in this case?

b) Assuming the assumption in part a. is met, at the 1% significance level, do the data provide sufficient evidence to conclude that the average number of chocolate chips in a bag of Chips Ahoy cookies differs from 1000? Carry out the Wilcoxon Signed-Rank Test by hand.

In: Math

( PLEASE SHOW ALL YOUR WORK) You will need your ticker code (company abbreviation) for stock...

( PLEASE SHOW ALL YOUR WORK)

You will need your ticker code (company abbreviation) for stock prices for this question. Use your ticker code to obtain the closing prices for the following two time periods to obtain two data sets:

March 2, 2019 to March 16, 2019

Data set A

February 16, 2019 to February 28, 2019

Data set B

Take the closing prices from data set B and add 0.5 to each one of them. Treat data sets A and B as hypothetical sample level data on the weights of newborns whose parents smoke cigarettes (data set A), and those whose parents do not (data set B).

a) Conduct a hypothesis test to compare the variances between the two data sets.

b) Conduct a hypothesis to compare the means between the two data sets. Selecting the assumption of equal variance or unequal variance for the calculations should be based on the results of the previous test.

c) Calculate a 95% confidence interval for the difference between means

A B
84.09 83.74
83.22 84.45
82.35 84.37
83.39 83.77
82.65 84.66
82.09 85.5
82.49 85.35
82.19 86.29
82.32
82.4
83.06
Mean 82.75 84.76625
SD 0.617770184 0.887612166

In: Math

8) What proportion of a normal distribution is located between each of the following z-score boundaries?...

8) What proportion of a normal distribution is located between each of the following z-score boundaries?

a. z = –0.25 and z = +0.25

b. z = –0.67 and z = +0.67

c. z = –1.20 and z = +1.20

13) A normal distribution has a mean of μ = 30 and a standard deviation of σ = 12. For each of the following scores, indicate whether the body is to the right or left of the score and find the proportion of the distribution located in the body.

a. X = 33

b. X = 18

c. X = 24

d. X = 39

19) A report in 2010 indicates that Americans between the ages of 8 and 18 spend an average of μ = 7.5 hours per day using some sort of electronic device such as smart phones, computers, or tablets. Assume that the distribution of times is normal with a standard deviation of σ = 2.5 hours and find the following values.

a. What is the probability of selecting an individual who uses electronic devices more than 12 hours a day?

b. What proportion of 8- to 18-year-old Americans spend between 5 and 10 hours per day using electronic devices? In symbols, p (5 < X < 10) = ?

In: Math

A political poll immediately prior to a local election revealed the following result. ​ Female Voters...


A political poll immediately prior to a local election revealed the following result.

Female Voters

Male voters

Vote Democrat

1200

1150

Vote Republican

2100

950

Total

    n1 = 3300

n2 = 2100

Please show Excel work:

A. Use α = .05. Test to determine whether the proportions of female and male voters who intend to vote for the Democrat candidate differ? Report the test statistic and the p-value.

B. Provide a 99% confidence interval for the difference in the proportion of female and male voters who intend to vote for the Democrat candidate

In: Math

1. What demographic variables were measured at the nominal level of measurement in the Oh et...

1. What demographic variables were measured at the nominal level of measurement in the Oh et al. (2014) study? Provide a rationale for your answer. 2. What statistics were calculated to describe body mass index (BMI) in this study? Were these appropriate? Provide a rationale for your answer. 3. Were the distributions of scores for BMI similar for the intervention and control groups? Provide a rationale for your answer. 4. Was there a signifi cant difference in BMI between the intervention and control groups? Provide a rationale for your answer.

In: Math

We are interested in whether math score (math – a continuous variable) is a significant predictor...

We are interested in whether math score (math – a continuous variable) is a significant predictor of science score (science – a continuous variable) using the High School and Beyond (hsb2) data.

State the null and alternative hypotheses and the level of significance you intend to use.

Ho:β=0

H1:β≠0

Alph:0.05

Write the equation for the appropriate test statistic.

t =b/SE(b)

What is your decision rule? Be sure to include the degrees of freedom.

If our t value is greater than the critical value of 1.96 we reject the null hypothesis.

FD= n-2=200-2= 198=1.96

Using SAS, estimate the means, variances and covariances for math and science scores. Copy and paste the relevant SAS output below.

variable label DF Peramieter Estimate Standered Error tvalue Pr>\t\ 95% CI
intercept intercept 1 21.7 2.75 7.88 <0.001 16.26,27.13
science science score 1 0.596

0.052

11.44 <0.001 0.49,0.69

Using the output from (d), calculate by hand the slope. Be sure to show your work.

Using the output from (d), calculate by hand the intercept. Be sure to show your work

In: Math

Baseball's World Series is a maximum of seven games, with the winner being the first team...

Baseball's World Series is a maximum of seven games, with the winner being the first team to win four games. Assume that the Atlanta Braves and the Minnesota Twins are playing in the World Series and that the first two games are to be played in Atlanta, the next three games at the Twins' ballpark, and the last two games, if necessary, back in Atlanta. Taking into account the projected starting pitchers for each game and the home field advantage, the probabilities of Atlanta winning each game are as follows:

Game 1 2 3 4 5 6 7
Probability of Win 0.65 0.4 0.45 0.55 0.47 0.42 0.6

a. Set up a spreadsheet simulation model for which whether Atlanta wins or loses each game is a random variable. What is the probability that the Atlanta Braves win the World Series? If required, round your answer to two decimal places.

b. What is the average number of games played regardless of winner? If required, round your answer to one decimal place.

In: Math

Assume that a sample is used to estimate a population proportion p. Find the 95% confidence...

Assume that a sample is used to estimate a population proportion p. Find the 95% confidence interval for a sample of size 380 with 125 successes. Enter your answer as a tri-linear inequality using decimals (not percents) accurate to three decimal places.

___ < p < ____

Answer should be obtained without any preliminary rounding. However, the critical value may be rounded to 3 decimal places.

In: Math