Questions
Question 2: Describe the three methods of sample selection and list the advantages and disadvantages of...

Question 2:
Describe the three methods of sample selection and list the advantages and disadvantages of each method

In: Statistics and Probability

Collect data on one response (dependent or y) variable and two different explanatory (independent or x)...

Collect data on one response (dependent or y) variable and two different explanatory (independent or x) variables. This will require a survey with three questions. For example: To predict a student’s GPA (y), you might collect data on two x variables: SAT score and age. So we would be trying to determine if there was a linear correlation between someone’s SAT score and their GPA, as well as their age and their GPA. (Note: students may not choose GPA as their dependent variable, must pick a different topic.)

• This data must be quantitative, not qualitative.

• Collect data from at least 15 people. Each person must answer all three questions for their data to count.

• Prepare a brief report that shares the questions used, as to why they are important to be studied.

• Present data in table form and as a scatter plot. You can create your tables and graphs in Excel, but they will need to be copy and pasted into your Word document. Do NOT submit an Excel file as it will not be graded.

• Model the data with two linear regressions (one for each x & y pair.)

• Interpret each linear model.

• Use each of your models to make a prediction.

In: Statistics and Probability

Solve the question below and show all steps: In a random sample of 28 female students...

Solve the question below and show all steps: In a random sample of 28 female students at a college, it was discovered that each of them possessed multiple hand bags. The number of hand bags possessed by each of the 28 female students is listed as follows: 4 4 3 3 3 6 4 2 2 2 1 3 3 3 3 4 4 3 2 8 2 2 3 4 3 3 4 2

a. Find the mean and standard deviation for the sample data set.

b. Form the interval, ?̅±2s

c. According to Chebychev’s rule, what proportion of sample observations will fall within the interval in part b?

d. According to the Empirical Rule, what proportion of sample observations will fall within the interval in part b?

e. Determine the actual proportion of sample observations that fall within the interval in part b. Does the Empirical Rule provide a good estimate of the proportion?

In: Statistics and Probability

A consumer preference study compares the effects of three different bottle designs (A, B, and C)...


A consumer preference study compares the effects of three different bottle designs (A, B, and C) on sales of a popular fabric softener. A completely randomized design is employed. Specifically, 15 supermarkets of equal sales potential are selected, and 5 of these supermarkets are randomly assigned to each bottle design. The number of bottles sold in 24 hours at each supermarket is recorded. The data obtained are displayed in the following table.

Bottle Design Study Data
A B C
17 34 21
16 30 21
13 34 28
13 30 26
16 34 25

  

The Excel output of a one-way ANOVA of the Bottle Design Study Data is shown below.

SUMMARY
Groups Count Sum Average Variance
Design A 5 75 15.0 3.5
Design B 5 162 32.4 4.8
Design C 5 121 24.2 9.7
ANOVA
Source of Variation SS df MS F P-Value F crit
Between Groups 757.7333 2 378.8667 63.14 3.23E-06 3.88529
Within Groups 72.0 12.0 6.0000
Total 829.7333 14

(a) Test the null hypothesis that μA, μB, and μC are equal by setting α = .05. Based on this test, can we conclude that bottle designs A, B, and C have different effects on mean daily sales? (Round your answers to 2 decimal places. Leave no cells blank - be certain to enter "0" wherever required.)

F
p-value

(Click to select)Do not rejectReject H0: bottle design (Click to select)doesdoes not have an impact on sales.

(b) Consider the pairwise differences μBμA, μCμA , and μCμB. Find a point estimate of and a Tukey simultaneous 95 percent confidence interval for each pairwise difference. Interpret the results in practical terms. Which bottle design maximizes mean daily sales? (Round your answers to 2 decimal places. Negative amounts should be indicated by a minus sign.)

Point estimate Confidence interval
μBμA: , [, ]
μCμA: , [, ]
μCμB: , [, ]

Bottle design (Click to select)ACB maximizes sales.

(c) Find a 95 percent confidence interval for each of the treatment means μA, μB, and μC. Interpret these intervals. (Round your answers to 2 decimal places. Negative amounts should be indicated by a minus sign.)

Confidence interval
μA: [, ]
μB: [, ]
μC: [, ]

In: Statistics and Probability

Create a snowflake schema diagram (you do not have to mark every possible level) and include...

Create a snowflake schema diagram (you do not have to mark every possible level) and include your implicit assumptions for each level of the dimension.

Starting with the base cuboid [customer, date, product, store], what are the specific OLAP operations (e.g., roll-up student to the department (level)) that one should perform in order to list the average sales of each cosmetic product since January 2005?

In: Statistics and Probability

. IQ is normally distributed with a mean of 100 and a standard deviation of 15....

. IQ is normally distributed with a mean of 100 and a standard deviation of 15. Suppose an individual is randomly chosen. a) (3pt) Find the probability that the person has an IQ greater than 125. b) (4pt) Find the probability that the person has an IQ score between 105 and 118. c) (4pt) What is the IQ score of a person whose percentile rank is at the 75th percentile, ?75? d) (3pt) Use the information from part (c) to fill in the blanks and circle the correct choice in the following statement. ________% of the individuals (persons) have IQ score less than/more than __________ e) (4pt) “MENSA” is an organization whose members have the top 2% of all IQs. Find the minimum IQ needed to qualify for the “MENSA” organization

In: Statistics and Probability

In the 1920s, about 97% of U.S. colleges and universities required a physical education course for...

In the 1920s, about 97% of U.S. colleges and universities required a physical education course for graduation. Today, about 40% require such a course. A recent study of physical education requirements included 352 institutions: 125 private and 227 public. Among the private institutions, 101 required a physical education course, while among the public institutions, 60 required a course

(a) What are the statistics ( ±0.0001)? X1 = , n1 = , 1p^1 = X2 = , n2 = , 2p^2 =

(b) Use a 95% confidence interval to compare the private and the public institutions with regard to the physical education requirement ( ±±0.0001) The 95% confidence interval if from to Conclusion

We have no evidence to conclude that private institutions are more likely to require physical education

We have evidence to conclude that private institutions are more likely to require physical education

(c) Use a significance test to compare the private and the public institutions with regard to the physical education requirement ( ±0.0001) p^ = z = P-value = Conclusion

We have evidence to conclude that public institutions are more likely to require physical education

We have no evidence to conclude that public institutions are more likely to require physical education Check Syntax

In: Statistics and Probability

1. Create the following items in R. Show the R output provided by each object. b...

1. Create the following items in R. Show the R output provided by each object.

b The matrix     1 2 3 4 5 6 7 8 9 10 11 12     stored as object mat1.

c The matrix     1 5 9 2 6 10 3 7 11 4 8 12     stored as object mat2.

d A data frame datfr such that the vector 1 2 3 4 5 6 is in the first column, green green red blue yellow red is in the second column, and 4 7 2 8 2 10 is in the third column.

e A list objs consisting of the objects in parts a, b, c, and d, in that order.

In: Statistics and Probability

What’s the probability that a fair coin results in 12 or fewer heads from 40 flips?...

What’s the probability that a fair coin results in 12 or fewer heads from 40 flips? And What does that probability say about the fairness of the coin?

In: Statistics and Probability

The Salem Board of Education wants to evaluate the efficiency of the town’s four elementary schools....

The Salem Board of Education wants to evaluate the efficiency of the town’s four elementary schools. The three outputs of the schools are

■   output 1 = average reading score
■   output 2 = average mathematics score

■   output 3 = average self-esteem score


The three inputs to the schools are

■   input 1 = average educational level of mothers
(defined by highest grade completed: 12 = high
school graduate; 16 = college graduate, and so on)

■   input 2 = number of parent visits to school (per child)

■   input 3 = teacher-to-student ratio

The relevant information for the four schools is given in the file P04_42.xlsx. Determine which (if any) schools are inefficient.

Inputs
School Input 1 Input 2 Input 3
1 13 4 0.05
2 14 5 0.05
3 11 6 0.06
4 15 8 0.08
Outputs
School Output 1 Output 2 Output 3
1 9 7 6
2 10 8 7
3 11 7 8
4 9 9 9
Input 1 Input 2 Input 3
Unit costs of inputs
Output 1 Output 2 Output 3
Unit value of outputs
Costs/prices of inputs/outputs used
School Input costs Output values
1
2
3
4
Selected school input cost
Selected school output value
School Efficient or not?
1
2
3
4
Selected DEA school

In: Statistics and Probability

1. In a recent campus survey, 75 Indiana students were asked if they felt that their...

1. In a recent campus survey, 75 Indiana students were asked if they felt that their education at Cleary was preparing them for their future careers and 83% of students responded “Extremely well prepared.” Construct a 95% confidence interval (use z= 1.96) for the true proportion of Cleary students who feel the same way. Round standard error to 4 decimal places.

2.   Is there anything you could do to get a narrower range of values in the previous problem?

3. Is the sample size of 75 from the previous problem enough to get a 3% margin of error and 95% confidence? (To get full credit find the minimum sample size and then compare to 600 to see if large enough). Again use z=1.96. Also use a p-hat value of 0.83.

In: Statistics and Probability

Discussion Question 5.2: One of the assumptions of the F test for comparing two variances is...

Discussion Question 5.2: One of the assumptions of the F test for comparing two variances is that both populations are normally distributed. This is one of the assumptions in a long line of assumptions that we have discussed in this course. In your opinion, why is validating these assumptions a matter of statistical ethics in a sense? Do you feel validating the assumptions is necessary before you use any of these statistical techniques for making decisions?

In: Statistics and Probability

A marketing organization wishes to study the effects of four sales methods on weekly sales of...

A marketing organization wishes to study the effects of four sales methods on weekly sales of a product. The organization employs a randomized block design in which three salesman use each sales method. The results obtained are given in the following table, along with the Excel output of a randomized block ANOVA of these data.

Salesman, j
Sales Method, i A B C
1 39 31 23
2 41 28 28
3 31 23 15
4 33 19 16
ANOVA: Two-Factor without Replication
SUMMARY Count Sum Average Variance
Method 1 3 93 31.0000 64.0000
Method 2 3 97 32.3333 56.3333
Method 3 3 69 23.0000 64.0000
Method 4 3 68 22.6667 82.3333
Salesman A 4 144 36.00 22.6667
Salesman B 4 101 25.25 28.2500
Salesman C 4 82 20.50 37.6667
ANOVA
Source of Variation SS df MS F P-Value F crit
Rows 236.9167 3 78.9722 16.43 .0027 4.7571
Columns 504.5000 2 252.2500 52.49 .0002 5.1433
Error 28.8333 6 4.80556
Total 770.2500 11

(a) Test the null hypothesis H0 that no differences exist between the effects of the sales methods (treatments) on mean weekly sales. Set α = .05. Can we conclude that the different sales methods have different effects on mean weekly sales?

F = 16.43, p-value = .0027; (Click to select)RejectDo not reject H0: there is (Click to select)a differenceno difference in effects of the sales methods (treatments) on mean weekly sales.

(b) Test the null hypothesis H0 that no differences exist between the effects of the salesmen (blocks) on mean weekly sales. Set α = .05. Can we conclude that the different salesmen have different effects on mean weekly sales?

F = 52.49, p-value = .0002; (Click to select)Do not rejectReject H0: salesman (Click to select)do notdo have an effect on sales

(c) Use Tukey simultaneous 95 percent confidence intervals to make pairwise comparisons of the sales method effects on mean weekly sales. Which sales method(s) maximize mean weekly sales? (Round your answers to 2 decimal places. Negative amounts should be indicated by a minus sign.)

Method 1 – Method 2: [, ]
Method 1 – Method 3: [, ]
Method 1 – Method 4: [, ]
Method 2 – Method 3: [, ]
Method 2 – Method 4: [, ]
Method 3 – Method 4: [, ]

In: Statistics and Probability

Question5. Please answer the following questions as either "TRUE" or "FALSE" (Write out either “True” or...

Question5. Please answer the following questions as either "TRUE" or "FALSE" (Write out either “True” or “False”)

A. A normal distribution is characterized by its mean and its degrees of freedom. ______________

B. A paired t-test is considered a nonparametric test. ______________

C. A 93% confidence interval for µ is that numerical interval constructed to contain 93% of the values of µ. ____________

D. A 94% confidence interval for a mean is narrower than a 93% confidence interval. ______________

E. A regression analysis involving 93 patients would have 92 degrees of freedom. ______________

F. A negative correlation coefficient indicates a poor linear relationship. ______________

G. For a two-tailed hypothesis test comparing two group means, assuming a normal distribution and standard deviation σ known, a p-value < 0.05 is analogous to | z-value | > 1.96. ______________

H. An increase in the Type I error will decrease Power, everything else held constant. ______________

I. If the Type II error of a hypothesis test is equal to 0.06, the Power of the test is 0.94. __________

Multiple Choice: (Please circle the correct answer!)

J. The normal approximation may be used as an approximation to the binomial distribution when: (n=sample size)

a) the sample size is at least 30

b) np and nq are both less than 5 [p=P(success), q = 1-p]

c) np and nq are both greater or equal to 5 [p=P(success), q = 1-p]

K. For the normal distribution, with continuous data and the population variance known, the standard error of the mean is determined by the:

a) The population mean and the sample size

b) The population standard deviation and the sample size

c) The population mean and the population standard deviation

d) Sample mean and the population standard deviation

d) np is greater than or equal to 30

L. A 95% confidence interval for a population mean birth weight was determined to be 2100 to 2700 grams. Assume birth weights are normally distributed. The p-value for a test of the null hypothesis H0: Uo= 3000 grams versus Ha: Uo # 3000 grams is approximately.

a) 0.10

b) 0.06 e) 0.25

c) < 0.05

d) > 0.05

In: Statistics and Probability

Please write in BOLD Thanks :) In Lesson Eight you've learned how to construct confidence intervals...

Please write in BOLD Thanks :)

In Lesson Eight you've learned how to construct confidence intervals for population parameters and proportions, based on data from samples.

  1. In a short paragraph, distinguish between Sampling Error and the Margin of Error. Explain what each represents and the relationship between them.
  2. In a short paragraph, distinguish between a Point Estimate and an Interval Estimate. Explain what each represents and the relationship between them.
  3. In a short paragraph, distinguish between a Confidence Interval and the Level of Confidence. Explain what each represents and the relationship between them.
  4. Identify the three components of a Confidence Interval. Explain what each represents and the relationship between them.
  5. Describe a potential issue with public opinion polls that report interval estimates, but not confidence intervals. Discuss this in a short paragraph.

In: Statistics and Probability