Compare ‘all data’ of Maradi and Zinder using a histogram. Do both regions have the same distribution? If they are different, how are they different? Explain your answer using modality and skewness.Can you answer using R.
GW Maradi
XCoor |
YCoor |
Water Depth (m) |
7.009722 |
14.63611 |
44.19 |
8.110833 |
13.83306 |
38.12 |
7.674445 |
13.40667 |
83.29 |
7.686389 |
13.36861 |
90.58 |
7.705555 |
13.38583 |
73.65 |
7.723889 |
13.40278 |
59.33 |
7.724444 |
13.39583 |
55.87 |
7.741667 |
13.50806 |
46.87 |
7.509444 |
13.84056 |
63.4 |
7.617778 |
13.34333 |
87.3 |
7.628056 |
13.39444 |
61.39 |
7.694167 |
13.31083 |
65.67 |
7.710556 |
13.32056 |
69.35 |
7.755556 |
13.47111 |
50.95 |
7.842778 |
13.50667 |
49.63 |
7.566667 |
13.35333 |
56.59 |
7.588611 |
13.36778 |
67.65 |
7.59 |
13.39972 |
58.43 |
7.592778 |
13.30917 |
65.43 |
7.683889 |
13.27306 |
66.05 |
7.459167 |
13.50667 |
46.95 |
7.460278 |
13.44361 |
40.3 |
7.467778 |
13.41583 |
70.15 |
7.503889 |
13.40056 |
70.72 |
7.509722 |
13.44306 |
35.73 |
7.521667 |
13.44583 |
37.3 |
7.537222 |
13.43917 |
42.55 |
7.540555 |
13.41833 |
43.95 |
7.540833 |
13.4075 |
46.83 |
7.423056 |
13.39917 |
55.47 |
7.443056 |
13.38917 |
60.44 |
GW ZInder
XCoor |
YCoor |
GW Depth (m) |
8.84 |
14.15361 |
52.75 |
8.898611 |
14.15889 |
53.7 |
8.942778 |
14.16833 |
54 |
8.876389 |
14.19417 |
57.85 |
8.907222 |
14.09028 |
61.4 |
8.9225 |
14.11444 |
61.75 |
8.898611 |
14.15889 |
54 |
8.942778 |
14.16833 |
55.3 |
8.84 |
14.15361 |
57.7 |
8.876389 |
14.19417 |
57.9 |
8.9225 |
14.11444 |
59.25 |
8.907222 |
14.09028 |
61.5 |
8.84 |
14.15361 |
52.3 |
8.898611 |
14.15889 |
54.9 |
8.876389 |
14.19417 |
56.9 |
8.942778 |
14.16833 |
56.9 |
8.9225 |
14.11444 |
58.3 |
8.907222 |
14.09028 |
60.8 |
In: Statistics and Probability
Please Answer both parts of the question below
Answer the flowing questions that summarize the chapter.
Part A-
How do confidence intervals and significance tests relate?
Part B-
What are the conditions (or assumptions) that must be checked when performing a significance test for the slope of a regression line?
In: Statistics and Probability
The state of California has a mean annual rainfall of 23 inches, whereas the state of New York has a mean annual rainfall of 53 inches ( Current Results website, October 27,2012). Assume that the standard deviation for both states is 2 inches. A sample of 30 years of rainfall for California and a sample of 45 years of rainfall for New York has been taken.
a. Show the probability distribution of the sample mean annual rainfall for California.
b. What is the probability that the sample mean is 1 inch of the population mean for California?
c. What is the probability that the sample mean is within 1 inch of the population mean for New York?
In: Statistics and Probability
Thirty-three small communities in Connecticut (population near 10,000 each) gave an average of x = 138.5 reported cases of larceny per year. Assume that σ is known to be 43.9 cases per year.
(a) Find a 90% confidence interval for the population mean annual number of reported larceny cases in such communities. What is the margin of error? (Round your answers to one decimal place.)
lower limit | |
upper limit | |
margin of error |
(b) Find a 95% confidence interval for the population mean annual
number of reported larceny cases in such communities. What is the
margin of error? (Round your answers to one decimal place.)
lower limit | |
upper limit | |
margin of error |
(c) Find a 99% confidence interval for the population mean annual
number of reported larceny cases in such communities. What is the
margin of error? (Round your answers to one decimal place.)
lower limit | |
upper limit | |
margin of error |
(d) Compare the margins of error for parts (a) through (c). As the
confidence levels increase, do the margins of error increase?
As the confidence level increases, the margin of error decreases.As the confidence level increases, the margin of error remains the same. As the confidence level increases, the margin of error increases.
(e) Compare the lengths of the confidence intervals for parts (a)
through (c). As the confidence levels increase, do the confidence
intervals increase in length?
As the confidence level increases, the confidence interval decreases in length.As the confidence level increases, the confidence interval increases in length. As the confidence level increases, the confidence interval remains the same length.
How much does a sleeping bag cost? Let's say you want a sleeping bag that should keep you warm in temperatures from 20°F to 45°F. A random sample of prices ($) for sleeping bags in this temperature range is given below. Assume that the population of x values has an approximately normal distribution.
70 | 55 | 105 | 105 | 100 | 90 | 30 | 23 | 100 | 110 |
105 | 95 | 105 | 60 | 110 | 120 | 95 | 90 | 60 | 70 |
(a) Use a calculator with mean and sample standard deviation keys to find the sample mean price x and sample standard deviation s. (Round your answers to two decimal places.)
x = | $ |
s = | $ |
(b) Using the given data as representative of the population of
prices of all summer sleeping bags, find a 90% confidence interval
for the mean price μ of all summer sleeping bags. (Round
your answers to two decimal places.)
lower limit | $ |
upper limit | $ |
Do you want to own your own candy store? Wow! With some interest in running your own business and a decent credit rating, you can probably get a bank loan on startup costs for franchises such as Candy Express, The Fudge Company, Karmel Corn, and Rocky Mountain Chocolate Factory. Startup costs (in thousands of dollars) for a random sample of candy stores are given below. Assume that the population of x values has an approximately normal distribution.
98 | 170 | 128 | 97 | 75 | 94 | 116 | 100 | 85 |
(a) Use a calculator with mean and sample standard deviation keys to find the sample mean startup cost x and sample standard deviation s. (Round your answers to one decimal place.)
x = | thousand dollars |
s = | thousand dollars |
(b) Find a 90% confidence interval for the population average
startup costs μ for candy store franchises. (Round your
answers to one decimal place.)
lower limit | thousand dollars |
upper limit | thousand dollars |
In: Statistics and Probability
Need it in Excel format in detail.
Quality Associates, Inc., a consulting firm, advises its clients about sampling and statistical procedures that can be used to control their manufacturing processes. In one particular application, a client gave Quality Associates a sample of 800 observations taken during a time in which that client’s process was operating satisfactorily. The sample standard deviation for these data was .21; hence, with so much data, the population standard deviation was assumed to be .21. Quality Associates then suggested that random samples of size 30 be taken periodically to monitor the process on an ongoing basis. By analyzing the new samples, the client could quickly learn whether the process was operating satisfactorily. When the process was not operating satisfactorily, corrective action could be taken to eliminate the problem. The design specification indicated the mean for the process should be 12. The hypothesis test suggested by Quality Associates follows.
Corrective action will be taken any time is rejected.
The following samples were collected at hourly intervals during the first day of operation of the new statistical process control procedure. These data are available in the data set Quality.
Sample 1 |
Sample 2 |
Sample 3 |
Sample 4 |
---|---|---|---|
11.55 |
11.62 |
11.91 |
12.02 |
11.62 |
11.69 |
11.36 |
12.02 |
11.52 |
11.59 |
11.75 |
12.05 |
11.75 |
11.82 |
11.95 |
12.18 |
11.90 |
11.97 |
12.14 |
12.11 |
11.64 |
11.71 |
11.72 |
12.07 |
11.80 |
11.87 |
11.61 |
12.05 |
12.03 |
12.10 |
11.85 |
11.64 |
11.94 |
12.01 |
12.16 |
12.39 |
11.92 |
11.99 |
11.91 |
11.65 |
12.13 |
12.20 |
12.12 |
12.11 |
12.09 |
12.16 |
11.61 |
11.90 |
11.93 |
12.00 |
12.21 |
12.22 |
12.21 |
12.28 |
11.56 |
11.88 |
12.32 |
12.39 |
11.95 |
12.03 |
11.93 |
12.00 |
12.01 |
12.35 |
11.85 |
11.92 |
12.06 |
12.09 |
11.76 |
11.83 |
11.76 |
11.77 |
12.16 |
12.23 |
11.82 |
12.20 |
11.77 |
11.84 |
12.12 |
11.79 |
12.00 |
12.07 |
11.60 |
12.30 |
12.04 |
12.11 |
11.95 |
12.27 |
11.98 |
12.05 |
11.96 |
12.29 |
12.30 |
12.37 |
12.22 |
12.47 |
12.18 |
12.25 |
11.75 |
12.03 |
11.97 |
12.04 |
11.96 |
12.17 |
12.17 |
12.24 |
11.95 |
11.94 |
11.85 |
11.92 |
11.89 |
11.97 |
12.30 |
12.37 |
11.88 |
12.23 |
12.15 |
12.22 |
11.93 |
12.25 |
Managerial Report
Conduct a hypothesis test for each sample at the .01 level of significance and determine what action, if any, should be taken. Provide the test statistic and p-value for each test.
Compute the standard deviation for each of the four samples. Does the assumption of .21 for the population standard deviation appear reasonable?
Compute limits for the sample mean around such that, as long as a new sample mean is within those limits, the process will be considered to be operating satisfactorily. If exceeds the upper limit or if is below the lower limit, corrective action will be taken. These limits are referred to as upper and lower control limits for quality control purposes.
Discuss the implications of changing the level of significance to a larger value. What mistake or error could increase if the level of significance is increased?
In: Statistics and Probability
An economist wants to estimate the variance of employee test scores. A random sample of 38 scores had a sample standard deviation of 10.4. Find a 95% confidence interval for the population variance.
In: Statistics and Probability
Data collected over a long period of time showed that 1 in 1000 high school students like mathematics. A random sample of 30,000 high school students was surveyed. Let X be the number of students in the sample who like mathematics
a) What is the probability distribution of X?
b) What distribution can be used to approximate the distribution of X? Explain.
c) Find the approximate probability of observing a value of X equal to 40 or more?
d) Find the approximate probability of observing a value of X between 35 and 40 inclusive ?
In: Statistics and Probability
Suppose that the probability that a passenger will miss a flight is 0.09420. Airlines do not like flights with empty seats, but it is also not desirable to have overbooked flights because passengers must be "bumped" from the flight. Suppose that an airplane has a seating capacity of 58 passengers.
(a) If 60 tickets are sold, what is the probability that 59 or 60 passengers show up for the flight resulting in an overbooked flight?
(b) Suppose that 64 tickets are sold. What is the probability that a passenger will have to be "bumped"?
(c) For a plane with a seating capacity of 53 passengers, how many tickets may be sold to keep the probability of a passenger being "bumped" below 5%?
In: Statistics and Probability
Jobs for the homeless! A philanthropic foundation bought a used school bus that stops at homeless shelters early every weekday morning. The bus picks up people looking for temporary, unskilled day jobs. The bus delivers these people to a work center and later picks them up after work. The bus can hold 139 people, and it fills up every morning. Not everyone finds work, so at 11 A.M. the bus goes to a soup kitchen where those not finding work that day volunteer their time. Let us view each person on the bus looking for work as a binomial trial. Success means he or she got a day job. The random variable r represents the number who got jobs. The foundation requested a P-Chart for the success ratios. For the past 3 weeks, we have the following data.
Day | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 |
r | 60 | 53 | 61 | 66 | 67 | 55 | 53 | 58 |
p̂ = r / 139 | 0.43 | 0.38 | 0.44 | 0.47 | 0.48 | 0.40 | 0.38 | 0.42 |
Day | 9 | 10 | 11 | 12 | 13 | 14 | 15 |
r | 60 | 52 | 46 | 52 | 61 | 70 | 58 |
p̂ = r / 139 | 0.43 | 0.37 | 0.33 | 0.37 | 0.44 | 0.50 | 0.42 |
Make a P-Chart. (Use 4 decimal places.)
Center line | = |
–2.0 SL | = |
2.0 SL | = |
–3.0 SL | = |
3.0 SL | = |
List any out-of-control signals by type (I, II, or III). (Select all that apply.)
Out-of-control signal I occurs on day 11.Out-of-control signal I occurs on day 14.Out-of-control signal III occurs on days 4 and 5.Out-of-control signal III occurs on days 14 and 15.There are no out-of-control signals.
Interpret the results.
In: Statistics and Probability
Policy Holder # | Life Expectancy at 65 |
1 | 20.4 |
2 | 22.2 |
3 | 17.6 |
4 | 27.2 |
5 | 24.5 |
6 | 20.3 |
7 | 21.3 |
8 | 22.5 |
9 | 26.7 |
10 | 18.3 |
11 | 23.5 |
12 | 25.6 |
13 | 22.1 |
14 | 24.2 |
15 | 15.4 |
16 | 23.4 |
17 | 25.3 |
18 | 18.5 |
19 | 24.2 |
20 | 20.3 |
21 | 26.8 |
22 | 28.1 |
23 | 19.9 |
24 | 25.5 |
25 | 22.3 |
26 | 23.9 |
27 | 31.7 |
28 | 26.0 |
29 | 22.8 |
30 | 23.3 |
31 | 25.9 |
32 | 17.7 |
33 | 19.6 |
34 | 21.8 |
35 | 23.3 |
36 | 21.9 |
37 | 21.9 |
38 | 28.7 |
39 | 19.9 |
40 | 27.8 |
41 | 26.6 |
42 | 21.1 |
43 | 23.3 |
44 | 25.5 |
45 | 23.8 |
46 | 21.4 |
47 | 23.3 |
48 | 23.6 |
49 | 23.1 |
50 | 23.9 |
1. Insurance companies track life expectancy information to assist in determining the cost of life insurance policies. Life expectancy is a statistical measure of average time a person is expected to live, based on a number of demographic factors. Mathematically, life expectancy is the mean number of years of life remaining at a given age, assuming age-specific mortality rates remain at their most recently measured levels. Last year the average life expectancy of all the Life Insurance policyholders in Ontario at age 65 was 22.3 years (meaning that a person reaching 65 last year was expected to live, on average, until 87.3). The insurance company wants to determine if their clients now have a longer average life expectancy, so they randomly sample some of their recently paid policies. The insurance company will only change their premium structure if there is evidence that people who buy their policies are living longer than before. The sample data is provided in the excel file. Answer the following questions. Results should be support by excel output.
a. Construct a 95% and 99% confidence intervals for the true average life expectancy. Use t-distribution and Descriptive Statistics function from Data Analysis. Interpret each Confidence interval and comment on the difference between the 95% and 99% interval.
b. Write the null and alternative hypotheses for this test:
c. In this context, describe a Type I error possible. How might such an error impact Life Insurance company’s decision regarding the premium structure?
d. What is the value of the t-test statistic?
e. What is the associated P-value?
f. State the conclusion using α = 0.05. Do it using both P-value and critical value.
In: Statistics and Probability
A balanced die with six sides is rolled 60 times. For the binomial distubtion of X = Numbers of 6’s, what is n, what is p, and what is q Find the mean and standard deviation of the distribution of X. What do these values tell you? If you observe X = 2, would you be skeptical that the die is balanced? Explained why, based on the mean and standard deviation of . How long would the value of X Have to be in order to prove evidence that the die is unbalanced? Hint a binomial distribution with a high number of trails approximates the normal distribution.
In: Statistics and Probability
Provide one example each of both a Type I and Type II error that
could occur
when running predictive engines relevant to sports betting. Which
is likely to
be more costly and why?
In: Statistics and Probability
1. Music seems to be everywhere in society, and psychologists have investigated its effects on a variety of performances. For example, suppose an experimenter created three equivalent groups of eight people each and asked each person to perform a proofreading task on a short research paper. Subjects in group A1 performed the task with no music playing in the background, subjects in group A2 performed the task with a selection of oldies playing in the background, and subjects in groups A3 performed the task while listening to hard rock. The dependent variable was the errors detected out of a possible 50. Suppose the following scores were obtained:
No Music (A1) |
Oldies (A2) |
Hard Rock (A3) |
40 |
34 |
26 |
41 |
39 |
24 |
39 |
38 |
19 |
36 |
40 |
23 |
35 |
34 |
18 |
32 |
35 |
21 |
31 |
29 |
23 |
34 |
36 |
29 |
1.a.Analyze the scores with a one-factor between-subjects ANOVA to answer the question: Did the music condition affect the number of errors detected by a subject? Use α= .05. If needed, use the Tukey HSD test for multiple (pairwise) comparisons.
1.b.Report the results of this experiment as it would appear in a research paper.
In: Statistics and Probability
In a survey of 865 registered voters in Washington, 408 people
are in favor of making Pi Day an official holiday.
1. Which of the following would be the appropriate format for a
confidence interval for the proportion of Washington voters who are
in favor of making Pi Day an official holiday?
2. Using the format you selected above, find the 95% confidence
interval for Pi Day support. (Round to four decimal places.)
a=? b=?
In: Statistics and Probability
The cholesterol level in children aged 10-15 is assumed to follow a normal distribution with a mean of 175 and a standard deviation of 20.
What is the probability of selecting a child at 10-15 years with cholesterol level higher than 190?
What is the probability of selecting a child at 10-15 years with cholesterol level between 165 and 190?
What is the interval of cholesterol level of a child 10-15 years higher than 75% and lower than top 10% of children?
In: Statistics and Probability