Questions
Engineers concerned about a tower's stability have done extensive studies of its increasing tilt. Measurements of...

Engineers concerned about a tower's stability have done extensive studies of its increasing tilt. Measurements of the lean of the tower over time provide much useful information. The following table gives measurements for the years 1975 to 1987. The variable "lean" represents the difference between where a point on the tower would be if the tower were straight and where it actually is. The data are coded as tenths of a millimeter in excess of 2.9 meters, so that the 1975 lean, which was 2.9642 meters, appears in the table as 642. Only the last two digits of the year were entered into the computer.

Year 75 76 77 78 79 80 81 82 83 84 85 86 87
Lean 642 645 656 667 674 689 696 698 714 717 726 743 758

(a) Plot the data. Consider whether or not the trend in lean over time appears to be linear. (Do this on paper. Your instructor may ask you to turn in this graph.)

(b) What is the equation of the least-squares line? (Round your answers to three decimal places.)
y =  +  x

What percent of the variation in lean is explained by this line? (Round your answer to one decimal place.)
%

(c) Give a 99% confidence interval for the average rate of change (tenths of a millimeter per year) of the lean. (Round your answers to two decimal places.)
(  ,  )

In: Statistics and Probability

Serum glutamic oxaloacetic transaminase (SGOT) is an enzyme that elevates when the heart muscle is damaged....

Serum glutamic oxaloacetic transaminase (SGOT) is an enzyme that elevates when the heart muscle is damaged. Assume that the population distribution of SGOT levels in adults with healthy hearts is known to be normally distributed with ? = 18.5 units/L and ? = 13.2 units/L.

a. What proportion of adults with a healthy heart have an SGOT level of above 25 units/L? (0.5pts)

b. For a sample size of ? = 40 from this population, what would be the sampling distribution of the mean SGOT level? Include the type of distribution and the values of the defining parameters. (0.5pts)

c. Now suppose you collected a sample of ? = 40 adults who recently underwent heart surgery. If the sample mean SGOT level you got was above 25 units/L, do you think that the SGOT level of people who have recently undergone heart surgery actually has a distribution with ? = 18.5 units/L? Why or why not?

For each of the following scenarios, state the appropriate null hypothesis and indicate whether the result corresponds to a Type-I error, a Type-II error, or no error. Use α=0.05. (1pt each)

a. A new method of relaxation training, which includes mediation and biofeedback, is reported to be successful at reducing high blood pressure, even though it doesn’t actually have any effect on blood pressure.

b. A study that tested if there was any relationship between a person’s heart rate and consuming a small amount of caffeine resulted in a p-value of 0.21. In truth, there is no relationship between heart rate and caffeine in this population.

c. Park officials tested if new design of bear-proof trash bins is more successful at keeping bears out than the bins they currently use. Fact is that the new design is actually more successful than the current one. Their test resulted in a p-value of 0.08.

In: Statistics and Probability

Suppose a coin is tossed 100 times and the number of heads are recorded. We want...

Suppose a coin is tossed 100 times and the number of heads are recorded. We want to test whether the coin is fair. Again, a coin is called fair if there is a fifty-fifty chance that the outcome is a head or a tail. We reject the null hypothesis if the number of heads is larger than 55 or smaller than 45.

Write your H_0 and H_A in terms of the probability of heads, say p.

Find the Type I error rate if the null hypothesis is true.

Calculate the power of the test if the true chance of head is 0.4

Find the p-value if the observed number of heads is 65.

In: Statistics and Probability

•Example: A manager of a liquor store believes that the proportions of customers preferring CURES Beer,...

Example: A manager of a liquor store believes that the proportions of customers preferring CURES Beer, HAMMERKILL Beer, and MULER Beer are 40%, 30% and 30% respectively. The following results were obtained from a random sample of 200 customers:

Cures Hammerkill Muler
#choosing this brand 70 63 67

•a) What type of data do we have here?

•b) Do the data provide sufficient evidence to indicate that the distribution of customer preferences is significantly different from the manager’s belief? Use a 1% level of significance.

In: Statistics and Probability

Question 4 Researchers studied four different blends of gasoline to determine their effect on miles per...

Question 4 Researchers studied four different blends of gasoline to determine their effect on miles per gallon (MPG) of a car. An experiment was conducted with a total of 28 cars of the same type, model, and engine size, with 7 cars randomly assigned to each treatment group. The gasoline blends are referred to as A,B,C, and D.The MPGs are shown below in the table Gasoline Miles Per Blend Gallon A 26 28 29 23 24 25 26 B 27 29 31 32 25 24 28 C 29 31 32 34 24 28 27 D 30 31 37 38 36 35 29 We want to test the null hypothesis that the four treatment groups have the same mean MPG vs. the alternative hypothesis that not all of the means are equal. a) Before carrying out the analysis, check the validity of any assumptions necessary for the analysis you will be doing. Write a brief statement of your findings b) Test the null hypothesis that the four gasoline blends have the same mean MPGs, i.e., Test Ho: ua=ub=uc=ud vs. the alternative hypothesis Ha: not all the means are equal. c) If your hypothesis test in (b) indicates a significant difference among the treatment groups, conduct pairwise multiple comparison tests on the treatment group means. Underline groups of homogeneous means. d) Briefly state your conclusions. ( Use IBM SPSS for all calculations)

In: Statistics and Probability

3. At Phil & Jims steak Shop in Parkside [best cheese steak on the planet] they...

3. At Phil & Jims steak Shop in Parkside [best cheese steak on the planet] they sell about 750 cheese steak sandwiches per day.
They expect that the mix of sandwiches will be 360 with fried onions only, 165 pizza steask, 120 mushroom steaks & 105 with no cheese.
Last Wednesday they sold 370 with fried onions, 175 pizza steaks, 110 mushroom steaks & 95 with no cheese.  
At a 95% level of confidence, did their observations fit with their expectations?
Ho: fofits fe   (observed frequencies are consistent with those expected)
H1: fo doe not fit fe   (observed frequencies are not consistent with those expected)
χ2 =
χ2.05, 3 =
Ho is accepted/rejected
Conclusion:

In: Statistics and Probability

You are given the returns for the following three stocks: Return Year Stock A Stock B...

You are given the returns for the following three stocks:

Return

Year

Stock A

Stock B

Stock C

1

8%

7%

-22%

2

8%

15%

35%

3

8%

3%

15%

4

8%

12%

3%

5

8%

3%

9%

Calculate the arithmetic return, geometric return, and standard deviation for each stock. Do you notice anything about the relationship between an asset’s arithmetic return, standard deviation, and geometric return? Do you think this relationship will always hold?

In: Statistics and Probability

In Lesson Seven you've learned to convert raw scores to standard scores and used the empirical...

In Lesson Seven you've learned to convert raw scores to standard scores and used the empirical rule to determine probabilities associated with those standard scores.

Random Decimal Fraction Generator

Here are your random numbers:

0.1613
0.8590
0.7911
  1. Use the random decimal fraction generator at Random.org, linked here, to generate a list of three fractions with four decimal places(LIST random decimal fraction generator  IS ON TOP). Assume those decimal fractions represent probability values associated with z-scores. Then use the standard normal table to look up the z-score that is closest to matching with that probability. List them below. (3 points)
    1. probability value (generated fraction) =
    2. associated z-score =
    3. probability value (generated fraction) =
    4. associated z-score =
    5. probability value (generated fraction) =
    6. associated z-score =
  2. Use another random decimal fraction generator at Random.org, linked here, to generate a list of ten two-digit random numbers between 10 and 30. Calculate the z-score of the median of the data set. (3 points)
  3. What does the z-score of the data set median just above tell you about the shape of the distribution? How do you know this? (3 points)
  4. If you were to take repeated random samples of n = 5 from the data set just above, what would be the expected value of the mean of the sampling distribution of sample means? (3 points)
  5. Considering the set of ten two-digit random numbers above as a population, calculate the standard error of the mean for the samples in question 4. (3 points)

In: Statistics and Probability

Outcome Outcome Met/Not Met/In Process Evidence 1.Statistically significant difference between treatment and comparison groups in mathematics...

Outcome

Outcome Met/Not Met/In Process

Evidence

1.Statistically significant difference between treatment and comparison groups in mathematics grades 3–8

T=3.626 P<3.0533536280097256E-4

2.Statistically significant difference between treatment and comparison groups in science grades 4 & 8

T=1.77

P<0.07857488293853984

3a.Statistically significant difference between treatment and comparison groups in math Regents exams

T=-2.315

P<.015

3b.Statistically significant difference between treatment and comparison groups in science Regents exams

T=2.227

P<0.022816361682797652

4.Positive trend data in percentage of students enrolling in secondary math and science courses

Analysis of enrollment data for high school math and science courses reveal an overall increase of 684 students or 12.4% increase in enrollment

My question is based on the chart above : In the comments section following Goal #1, based upon your analysis of the evidence, summarize the findings for Goal #1. My comments should include any suggestions for further research or dat gathering based on questions arising out of these data that I feel still need to be addressed.

The problem is I am not good with this at all. Any help would be greatly appreciated.

In: Statistics and Probability

Expected number of time intervals until the Markov Chain below is in state 2 again after...

Expected number of time intervals until the Markov Chain below is in state 2 again after starting in state 2?

Matrix:

[.4,.2,.4]

[.6,0,.4]

[.2,.5,.3]

In: Statistics and Probability

It is rare that you will find a gas station these days that only sells gas....

It is rare that you will find a gas station these days that only sells gas. It has become more common to find a convenient store that also sells gas. The data named “Convenient Shopping data” the sales over time at a franchise outlet of the major US oil company. Each row summarize sales for one day. This particular station sells gas and has a convenient store and car awash. The column labeled Sales gives the dollar sales of the convenient store and the column Volume gives the number of gallons of gas sold.

Sales (Dollars) Volume (Gallons)
1756 2933
2203 3329
1848 3043
2016 3043
2346 3450
2410 3478
2050 3347
2097 3708
2311 3467
2419 4114
2523 3721
2061 3448
2247 3230
3479 3557
2135 3060
2102 3619
2536 3256
1227 1757
1966 2891
2219 3381
2226 2970
1969 3301
2044 3178
2360 3426
1907 3118
2156 3037
1816 3537
1897 3808
2051 3145
2079 3766
2328 2916
1841 3957
2104 3980
1973 3675
2089 3516
2266 4149
2327 3733
2032 3738
2137 4012
2186 4114
2369 3795
2087 3543
2273 3681
2113 3618
2181 4452
2776 4346
2652 4073
2250 4260
2548 4113
2678 3829
2878 4137
2220 4269
2303 3989
2718 4238
2317 3658
2338 4005
2143 3996
2402 4077
2401 3610
2051 3701
2468 3844
2398 3904
2106 3879
2461 3266
2466 3513
2745 4052
1994 4052
2020 2874
2241 3526
2648 3487
2022 3499
2524 3236
1919 2422
2164 2876
2074 2883
2310 2771
2062 2362
1807 2564
1976 2708
2171 2519
1745 2638
2108 3448
2057 1993
1679 2560
2014 2777
2109 3097
2274 2750
2640 3260
1664 2050
1913 2921
2331 2970
1920 2624
2074 3496
2272 3729
1651 2302
1996 2672
2093 3150
1995 2948
2337 3520
2433 3195
1731 2232
2183 2979
1795 3178
1689 2618
2040 3117
2076 2847
1483 2150
930 1528
1674 2309
1934 2805
2011 2721
2172 2812
1612 2173
1780 2767
2116 2544
1937 2805
1866 2131
2099 3292
2082 2221
1788 2816
2004 2686
1868 3207
2038 2925
2596 3603
1700 2165
1815 3338
1917 3107
2143 2906
2420 3448
2486 3433
1812 2104
2463 3283
2222 3750
2324 3494
2219 3154
2505 3465
2047 2216
2231 3236
2067 3425
2293 3667
2152 3618
1366 2257
2210 3606
2029 3460
2742 2336
2161 3113
2223 3058
2186 2429
2306 3501
1933 3183
2485 3337
2817 3566
2491 3398
1896 2519
2382 3716
2552 3856
2094 3488
2447 3457
2440 3831
2041 2280
2261 2411
2114 3208
2866 3539
2752 3719
2502 4150
1786 2927
2157 3044
2025 3390
2327 3840
2502 3697
2552 4104
2017 3749
2019 3511
2302 3972
2419 3413
2921 3882
2273 3950
2183 3292
2428 3979
2489 4668
2037 3832
2324 3930
2591 3853
2362 4014
3001 4759
1801 2661
1744 4165
2428 4139
2409 3664
2819 3851
1897 2522
1536 1208
2475 3844
2484 3766
2117 3535
2488 3900
2553 3900
2251 3814
2435 3387
2446 4009
2063 1951
2582 3779
1663 2368
2302 3379
2248 3549
2712 3807
2307 4009
2576 3759
1978 2378
2116 4090
2292 3241
2373 3874
2444 4142
2578 3645
1953 2419
2151 3289
2901 3872
2514 4136
2078 3626
2492 4240
1897 2415
2072 3028
2538 3731
2422 3851
2415 3818
2969 4268
1775 2514
2082 3708
2121 3367
2471 3685
2467 3415
2671 4226
1876 2061
1976 3805
2156 3427
2339 3670
2258 3939
2776 3798
2084 2668
2346 3945
2320 3787
2539 3854
2393 3598
2629 3717
2044 2536
2018 401
2350 2361
2452 4005
2041 2391
2038 3129
2181 3874
2516 4072
2181 3603
2427 4173
2111 3993
2182 3153
2794 3812

1) Draw a scatter plot for Sales on Volume where Sales is dependent on Volume of gas sold. Does there appear to be a linear pattern that relates to these two sequences?

2) Estimate the linear regression model using excel analysis tool I showed you in class. Write the linear model and interpret the slope (b1).

3) Interpret the R2 and tell if your linear model is a good fit or not.

4) Estimate the difference in sales at the convenient store (on average) between a day with 3,500 gallons sold and a day with 4,000 gallons sold.

5) With regard to inference statistics, formulate a hypothesis test for the slope (b1) and decide if it is statistically significant or not.

6) Construct a 95% confidence interval for the slope.

In: Statistics and Probability

The excel data are flash durations, in milliseconds, of a sample of 35 male fireflies of...

The excel data are flash durations, in milliseconds, of a sample of 35 male fireflies of the species Photinus(Cratsley and Lewis 2003). You must use excel tools data analysis and descriptive statistics to find the confidence interval. Use the video as a guide to complete this excel project. If you do not see the data analysis, I have steps on how to get the data analysis tool on this document.

  1. Estimate the sample mean flash duration. What does this quantity estimate?
  2. Is the estimate in part(a) likely to equal the population parameter? Why or why not?
  3. Paste the descriptive statistics output from excel
  4. Use the descriptive statistics output to record the standard error for your estimate.
  5. What does the quantity in part(d) measure?
  6. Use excel output to calculate the 95% confidence interval for the population mean.
  7. Provide an interpretation for the interval you calculated in part(f)

PLEASE SHOW ALL WORK WHEN POSSIBLE!

Data:

94
96
95
95
95
96
98
98
98
101
103
106
108
109
112
113
118
116
119

In: Statistics and Probability

A school psychologist wishes to determine whether a new anti-smoking film actually reduces the daily consumption...

A school psychologist wishes to determine whether a new anti-smoking film actually reduces the daily consumption of cigarettes by teenage smokers. The mean daily cigarette consumption is calculated for each of eight teenage smokers during the month before and the month after the film presentation, with the following results: MEAN DAILY CIGARETTE CONSUMPTION

SMOKER NUMBER BEFORE FILM (X1) AFTER FILM (X2)

1 28 26

2 29 27

3 31 32

4 44    44

5 35 35

6 20 16

7 50 47

8 25 23

A) Is there a significant difference in the number of cigarettes smoked before the film as compared to the number of cigarettes smoked after the film?

B) What does this NOT necessarily mean?

C) What might be done to improve the design of this experiment?

In: Statistics and Probability

COMPUTER CALCULATIONS: I need to know how to code in R for the solutions, not by...

COMPUTER CALCULATIONS:

I need to know how to code in R for the solutions, not by hand.

2. Look at the data in Table 7.18 on page 368 of the textbook. These data are also

given in the SAS code labeled “SAS_basketball_goal_data” and R code labeled basketball goal data .

The dependent variable is goals and the independent variable is height of basketball players.

Complete a SAS /R program and answer the following questions about the data set:

(a) Does a scatter plot indicate a linear relationship between the two variables?

Is there anything disconcerting about the scatter plot? Explain.

(b) Fit the least-squares regression line (using SAS / R) and interpret the estimated slope

in the context of this data set. Does it make sense to interpret the estimated intercept? Explain.

(c) For these data, what is the unbiased estimate of the error variance? (Give a number.)

(d) Using the SAS / R output, test the hypothesis that the true slope of the regression line

is zero (as opposed to nonzero). State the appropriate null and alternative hypotheses,

give the value of the test statistic and give the appropriate P-value. (Use significance

level of 0.05.) Explain what this means in terms of the relationship between the two

variables.

(e) Using SAS / R, find a 95% confidence interval for the mean basketball goal for

a player with a height of 77 inches. In addition find a 95% prediction interval for

basketball goal for a player with a height of 77 inches.

Data:

Height Goals
71 15
74 19
70 11
71 15
69 12
73 17
72 15
75 19
72 16
74 18
71 13
72 15
73 17
72 16
71 15
75 20
71 15
75 19
78 22
79 23
72 16
75 20
76 21
74 19
70 13

In: Statistics and Probability

A sample of 1000 customers was selected in Rhode Island to determine various information concerning consumer...

A sample of 1000 customers was selected in Rhode Island to determine various information concerning consumer behavior. Among the questions asked was “Do you enjoy shopping at Shopping Center A?” The results are summarized in the following table.

Enjoy shopping at Shopping Center A

Gender

Total

Male

Female

Yes

272

448

720

No

208

72

280

Total

480

520

1000

  1. Is there evidence of a significant difference between males and females in the proportion who enjoy shopping at Shopping Center A at the 0.01 level of significance?
  2. Construct and interpret a 95% confidence interval estimate for the difference between the proportion of males and females who enjoy shopping at Shopping Center A.

In: Statistics and Probability