Questions
Crescent Oil has developed three new blends of gasoline – Blend X, Blend Y and Blend...

Crescent Oil has developed three new blends of gasoline – Blend X, Blend Y and Blend Z, and must decide which blend or blends to produce and distribute. A study of the miles per gallon ratings of the three blends is being conducted to determine if the mean ratings are the same for the three blends.

Five automobiles- 1, 2 3, 4 and 5, have been tested using each of the three gasoline blends and the miles per gallon ratings are shown on the accompanying Excel spreadsheet.

Based on the sample data we would like to determine whether the different blends of gasoline, produce significant differences in the average mpg. We would like to use the methods we have learnt so far in 361A to see if our result is statically significant. (Statistical significance refers to a result that is not likely to occur randomly but rather is likely to be attributable to a specific cause – in this case the different gasoline blends and different cars.)

Carry out the following tests and make preliminary findings:

  1. For the sample data, calculate the means and standard deviations for the mpg for each of the three blends of gasoline – Blend X, Blend Y and Blend Z.
  2. Draw three boxplots using Excel for mpg, one for each blend of gasoline.
  3. Run three two sample t-tests between the different blends of gasoline i.e. compare the means of Blend X and Blend Y, then Blend X and Blend Z and finally Blend Y and Blend Z. Are they the same or are they different?
  4. Armed with this information above determine whether or not your results show that the three different gasoline blends produce the same average mpg or not. Your answers should specifically site the information you re using to make your determination.
  5. Re-run parts 1 - 4 above but this time your focus is on the mpg of each car for the 5 cars (car 1 through car 5) not for the three gasoline blends.

Your report should have the following sections, arranged sequentially:

1. Introduction and problem background

2. Data description and the business questions to be answered

3. Initial data exploration – descriptive statistics/graphs

4. Analyses

5. Interpretation of results, deficiencies in methods, final conclusions and recommendations for decision-making

Automobile

Blend X

Blend Y

Blend Z

1

31

30

30

2

30

29

29

3

29

29

28

4

33

31

29

5

26

25

26

In: Statistics and Probability

An experiment was conducted to determine the effect of a high salt mean on the systolic...

An experiment was conducted to determine the effect of a high salt mean on the systolic blood pressure (SBP) of subjects. Blood pressure was determined in 12 subjects before and after ingestion of a test meal containing 10.0 gms of salt. The data obtained were:

Subject

SBP before meal

SBP after meal

1

120

147

2

130

140

3

139

148

4

120

115

5

123

122

6

140

157

7

131

144

8

123

134

9

125

140

10

130

165

11

131

133

12

142

153

  1. Is a one-sided or two sided test needed here?
  2. What is the mean SBP for each time period?
  3. What is the standard deviation for each time period?
  4. Which statistical test is appropriate to use on these data?
  5. Carry out the hypothesis test(s) in question in above d. Use α=0.01
  6. Are the means statistically different?
  7. Find the 99% confidence interval for the difference of the two means on SBP. Interpret your finding                                                                                     

In: Statistics and Probability

Media periodically discuss the issue of heights of winning presidential candidates and heights of their main...

Media periodically discuss the issue of heights of winning presidential candidates and heights of their main opponents. The accompanying table lists the heights​ (cm) from several recent presidential elections. Construct a​ scatterplot, find the value of the linear correlation coefficient​ r, and find the​ P-value of r. Determine whether there is sufficient evidence to support a claim of linear correlation between the two variables. Should we expect that there would be a​ correlation? Use a significance level of alphaequals0.01

PRESIDENT: 180 177 194 170 181 178 195 178 172 187 190 187 178 194

OPPONENT: 180 176 184 171 178 184 176 183 180 181 168 182 189 175

1. The linear correlation coefficient r is __________. ​(Round to three decimal places as​ needed.)

2. Determine the null and alternative hypotheses. (Type integers or decimals. Do not​ round.)

H 0​: p ______ _______

H1: p ______ _______

3. The​ P-value is ________. ​(Round to three decimal places as​ needed.)

4. Because the​ P-value of the linear correlation coefficient is ▼ less than or equal to, greater than the significance​ level, there ▼ is, is not sufficient evidence to support the claim that there is a linear correlation between the heights of winning presidential candiates and the heights of their opponents.

5. Should we expect that there would be a​ correlation?

A. ​No, because presidential candidates are nominated for reasons other than height.

B. ​No, because height is the main reason presidential candidates are nominated.

C. ​Yes, because presidential candidates are nominated for reasons other than height.

D. ​Yes, because height is the main reason presidential candidates are nominated. Click to select your answer(s) and then click Check Answer.

In: Statistics and Probability

You wish to test the following claim (HaHa) at a significance level of α=0.001 For the...

You wish to test the following claim (HaHa) at a significance level of α=0.001 For the context of this problem, μd=μ2-μ1 where the first data set represents a pre-test and the second data set represents a post-test.

      Ho:μd=0
      Ha:μd>0

You believe the population of difference scores is normally distributed, but you do not know the standard deviation. You obtain pre-test and post-test samples for n=16 subjects. The average difference (post - pre) is ¯d=19.3 with a standard deviation of the differences of sd=28.6

What is the test statistic for this sample? (Report answer accurate to three decimal places.)
test statistic =

What is the p-value for this sample? (Report answer accurate to four decimal places.)
p-value =

The p-value is...

  • less than (or equal to) αα
  • greater than αα



This test statistic leads to a decision to...

  • reject the null
  • accept the null
  • fail to reject the null



As such, the final conclusion is that...

  • There is sufficient evidence to warrant rejection of the claim that the mean difference of post-test from pre-test is greater than 0.
  • There is not sufficient evidence to warrant rejection of the claim that the mean difference of post-test from pre-test is greater than 0.
  • The sample data support the claim that the mean difference of post-test from pre-test is greater than 0.
  • There is not sufficient sample evidence to support the claim that the mean difference of post-test from pre-test is greater than 0.

In: Statistics and Probability

1. The results of a recent poll on the preference of shoppers regarding two products are...

1. The results of a recent poll on the preference of shoppers regarding two products are shown below.

Product

Shoppers Surveyed

Shoppers Favoring
This Product

A

800

560

B

900

612

At 95% confidence, the margin of error is

a.

.0225.

b.

.044.

c.

.025.

d.

.064.

2. The following information was obtained from independent random samples taken of two populations.
Assume normally distributed populations with equal variances.

Sample 1

Sample 2

Sample Mean

45

42

Sample Variance

85

90

Sample Size

10

12

The degrees of freedom for the t distribution are

a.

24.

b.

20.

c.

21.

d.

22.

3. ​ For a sample size of 30, changing from using the standard normal distribution to using the t distribution in a hypothesis test,

a.

​will result in the area corresponding to the critical value being larger.

b.

​will have no effect on the area corresponding to the critical value.

c.

​will result in the area corresponding to the critical value being smaller.

d.

​Not enough information is given to answer this question.

In: Statistics and Probability

Find the range for the population mean value with 95% and 65% confidence intervals for each...

Find the range for the population mean value with 95% and 65% confidence intervals for each set of data. mean1=3.611, Standard Deviation1=0.02c m, n=24, Mean2=3.632, Standard Deviation2=0.06 cm, n2=17

In: Statistics and Probability

Use the data and Excel to answer this question. It contains the United States Census Bureau’s...

Use the data and Excel to answer this question. It contains the United States Census Bureau’s estimates for World Population from 1950 to 2014. You will find a column of dates and a column of data on the World Population for these years. Generate the time variable t. Then run a regression with the Population data as a dependent variable and time as the dependent variable. Have Excel report the residuals.

(a) Based on the ANOVA table and t-statistics, does the regression appear significant?

(b) Calculate the Durbin-Watson Test statistic. Is there a serial correlation problem with the data? Explain.

(d) What affect might your answer in part (b) have on your conclusions in part (a)?

Year Population
1950 2,557,628,654
1951 2,594,939,877
1952 2,636,772,306
1953 2,682,053,389
1954 2,730,228,104
1955 2,782,098,943
1956 2,835,299,673
1957 2,891,349,717
1958 2,948,137,248
1959 3,000,716,593
1960 3,043,001,508
1961 3,083,966,929
1962 3,140,093,217
1963 3,209,827,882
1964 3,281,201,306
1965 3,350,425,793
1966 3,420,677,923
1967 3,490,333,715
1968 3,562,313,822
1969 3,637,159,050
1970 3,712,697,742
1971 3,790,326,948
1972 3,866,568,653
1973 3,942,096,442
1974 4,016,608,813
1975 4,089,083,233
1976 4,160,185,010
1977 4,232,084,578
1978 4,304,105,753
1979 4,379,013,942
1980 4,451,362,735
1981 4,534,410,125
1982 4,614,566,561
1983 4,695,736,743
1984 4,774,569,391
1985 4,856,462,699
1986 4,940,571,232
1987 5,027,200,492
1988 5,114,557,167
1989 5,201,440,110
1990 5,288,955,934
1991 5,371,585,922
1992 5,456,136,278
1993 5,538,268,316
1994 5,618,682,132
1995 5,699,202,985
1996 5,779,440,593
1997 5,857,972,543
1998 5,935,213,248
1999 6,012,074,922
2000 6,088,571,383
2001 6,165,219,247
2002 6,242,016,348
2003 6,318,590,956
2004 6,395,699,509
2005 6,473,044,732
2006 6,551,263,534
2007 6,629,913,759
2008 6,709,049,780
2009 6,788,214,394
2010 6,858,584,755
2011 6,935,999,491
2012 7,013,871,313
2013 7,092,128,094
2014 7,169,968,185

Can you please give detailed steps to do on excel?

In: Statistics and Probability

Write an accurate concluding statement for the following hypothesis tests. (a) You claim that the mean...

Write an accurate concluding statement for the following hypothesis tests.

(a) You claim that the mean volume of all 12 ounce cans of Fizzy Pop is less than 12 ounces. After analyzing the data and performing a hypothesis test, you fail to reject the null hypothesis.

(b) Fizzy Pop claims that most 12 ounce cans of Fizzy Pop contain more than 12 ounces. After analyzing the data and performing a hypothesis test, you reject the null hypothesis.

(c) You claim that the average speed of cars going down a certain stretch of highway is 72 mph. After analyzing the data and performing a hypothesis test, you fail to reject the null hypothesis.

In: Statistics and Probability

Could you please explain how you would find(estimate, not calculate) the 95% confidence interval for a...

Could you please explain how you would find(estimate, not calculate) the 95% confidence interval for a set of categorical data. example: 30 students pick what their favorite food is

pizza=13

hamburgers=7

fries=5

icecream=5

In: Statistics and Probability

Account Balance Income Years of Education Size of Household 8976 63 12 2 8308 37 14...

Account Balance Income Years of Education Size of Household
8976 63 12 2
8308 37 14 2
10028 52 16 2
11256 64 15 4
9869 47 17 2
10194 74 15 2
8706 49 12 2
9557 58 14 2
10565 70 16 3
9434 69 11 3
9687 25 18 3
9490 57 15 1
8806 46 14 3
9561 48 16 2
11757 80 15 3
9406 66 14 2
11150 46 15 3
7671 28 12 2
8803 53 13 1
9571 52 15 2
9566 77 12 3
7885 32 14 3
9773 55 11 1
9121 52 15 2
9298 43 14 3
10285 65 15 2
7801 38 12 1
9323 52 14 2
8643 36 16 3
12466 85 15 2
9447 64 14 2
10727 86 15 2
9243 57 15 3
9311 68 12 2
11033 74 14 3
11721 82 16 2
8727 24 15 3
8438 37 15 3
8317 55 12 2
8617 50 14 1
9052 39 16 3
10889 73 15 3
7766 26 14 1
9189 47 15 2
  1. Use data in BUSI1013 Credit Card Balance.xlsx to complete the following. You will need to use a statistical package such as StatTools or the Regression program within Excel’s Data Analysis Add-in to generate the estimated regression equation and the ANOVA etc. (12 points)
    1. What is the estimated regression equation using Account Balance as the dependent variable, and Income, Years of Education, as well as Size of Household as the independent variable?
    2. Comment on the goodness of fit of the model using the coefficient of determination.
    3. Conduct an F test with the critical value approach to see whether the overall model is significant. Use α = 0.01.
    4. Perform a t test with p-value approach for the significance of the Income variable. Use α = 0.05.
    5. Perform a t test with the p-value approach for the significance of the Size of Household variable. Use α = 0.05.
    6. Estimate the Account Balance of a customer who has an income of $62 thousand, 14 years of education, and a household size of 3.

In: Statistics and Probability

How many distinct arrangements can be formed from all the letters of ʺstudentsʺ?

How many distinct arrangements can be formed from all the letters of ʺstudentsʺ?

In: Statistics and Probability

Account Balance Income Years of Education Size of Household 8976 63 12 2 8308 37 14...

Account Balance Income Years of Education Size of Household
8976 63 12 2
8308 37 14 2
10028 52 16 2
11256 64 15 4
9869 47 17 2
10194 74 15 2
8706 49 12 2
9557 58 14 2
10565 70 16 3
9434 69 11 3
9687 25 18 3
9490 57 15 1
8806 46 14 3
9561 48 16 2
11757 80 15 3
9406 66 14 2
11150 46 15 3
7671 28 12 2
8803 53 13 1
9571 52 15 2
9566 77 12 3
7885 32 14 3
9773 55 11 1
9121 52 15 2
9298 43 14 3
10285 65 15 2
7801 38 12 1
9323 52 14 2
8643 36 16 3
12466 85 15 2
9447 64 14 2
10727 86 15 2
9243 57 15 3
9311 68 12 2
11033 74 14 3
11721 82 16 2
8727 24 15 3
8438 37 15 3
8317 55 12 2
8617 50 14 1
9052 39 16 3
10889 73 15 3
7766 26 14 1
9189 47 15 2
  1. The data in BUSI1013 Credit Card Balance.xlsx is collected for building a regression model to predict credit card balance of retail banking customers in a Canadian bank. Use this data to perform a simple regression analysis between Account balance and Income (in thousands). (12 points)
    1. Develop a scatter diagram using Account Balance as the dependent variable y and Income as the independent variable x.
    2. Develop the estimated regression equation.
    3. Use the estimated regression equation to predict the Account Balance of a customer with Income of $58 thousands.
    4. Use the critical-value approach to perform an F test for the significance of the linear relationship between account balance and Income at the 0.05 level of significance.
    5. What percentage of the variability of Account Balance can be explained by its linear relationship with Income?
    6. Use the p-value approach to perform a t test for the significance of the linear relationship between Account Balance and Income at the 0.05 level of significance.

In: Statistics and Probability

1 The monthly sales of mufflers follow the normal distribution with a mean of 1250 and...

1 The monthly sales of mufflers follow the normal distribution with a mean of 1250 and a standard deviation of 255. The manufacturer would like to establish inventory levels such that there is only a 2% chance of running out of stock. Refer to the table in Appendix B.1.  

Where should the manufacturer set the inventory levels?

part B

1 The manufacturer of a laser printer reports the mean number of pages a cartridge will print before it needs replacing is 11,300. The distribution of pages printed per cartridge closely follows the normal probability distribution and the standard deviation is 680 pages. The manufacturer wants to provide guidelines to potential customers as to how long they can expect a cartridge to last. Refer to the table in Appendix B.1.  

How many pages should the manufacturer advertise for each cartridge if it wants to be correct 95 percent of the time? (Round z-value to 2 decimal places and the final answer to the nearest whole number.)

Pages           

In: Statistics and Probability

Test whether p 1 greater than p 2. The sample data are x 1 equals 124​,...

Test whether p 1 greater than p 2. The sample data are x 1 equals 124​, n 1 equals 259​, x 2 equals 139​, and n 2 equals 305. Determine test statistic and P value

In: Statistics and Probability

According to a government energy agency, the mean monthly household electricity bill in the United States...

According to a government energy agency, the mean monthly household electricity bill in the United States in 2011 was $109.54. Assume the amounts are normally distributed with standard deviation $25.00. Use the TI-84 Plus calculator to answer the following.

(a) What proportion of bills are greater than $132?

(b) What proportion of bills are between $90 and $145?

(c) What is the probability that a randomly selected household had a monthly bill less than

$129? Round the answers to at least four decimal places.

In: Statistics and Probability