Questions
Use R for this problem(Code in R). A firm’s personnel officer sampled 36 male and 24...

Use R for this problem(Code in R). A firm’s personnel officer sampled 36 male and 24 female employees investigate allegations that the men in the organization tend to receive hire annual bonuses than the women. Their bonuses (as percentages of their annual salaries) are below.

Men

10.4 8.9 11.7 12.0 8.7 9.4 9.8 9.0 9.2 9.7
9.1 8.8 7.9 9.9 10.0 10.1 9.0 11.4 8.7 9.6
9.2 9.7 8.9 9.3 10.4 11.9 9.0 12.0 9.6 9.2
9.9 9.0 9.2 9.4 9.7 8.9

Women

8.4 9.0 9.0 7.7 9.6 8.4 9.1 9.2 8.7 9.1
9.3 8.4 6.7 9.9 8.0 9.2 7.7 11.9 6.2 8.4
9.0 6.9 7.6 7.4


a) Check all necessary assumptions for running a t-test for the difference between the two populations of bonus percentages. If you need to, check normality with boxplots, normal probability plots, ad.test(), and shapiro.test().
b) Can you pool in this situation Why or why not
c) Write down the hypotheses to test given the personnel officer wants to know if there is evidence to conclude the men receive higher bonus percentages. Use R to run a t-test using the t.test() function. Provide your code, output, and conclusion based on the p-value.

Question!

I tested the Anderson-Darling and Shapiro tests. According to the two tests, in case of men, we reject to the null hypothesis since the p-value is too small. So, the population of men don't follow the normal distribution. However, the answers from this website are saying the men is normal distribution. And, do you think is can not be pooled? They are almost same standard deviation. I don't want to get same answer that is from here.

In: Statistics and Probability

Business bankruptcies in Canada are monitored by the Office of the Superintendent of Bankruptcy Canada (OSB)....

Business bankruptcies in Canada are monitored by the Office of the Superintendent of Bankruptcy Canada (OSB). Included in each report are the assets and liabilities the company declared at the time of the bankruptcy filing. OSB is interested in finding out if the mean debt (liabilities minus assets) across all bankruptcy cases is different from $100,000. A study is based on a random sample of 75 reports from the current year. The average debt is $92,172. With a standard deviation of $11,153.8. Set up the hypotheses.

Conduct the hypothesis testing at a level of significance of 5%. Use both critical value approach and p-value approach.

In: Statistics and Probability

Sample annual salaries​ (in thousands of​ dollars) for employees at a company are listed. 42   51 ...

Sample annual salaries​ (in thousands of​ dollars) for employees at a company are listed.

42  

51 

57 

47  

41 

41  

42 

51  

57  

31 

47 

42 

46

  

​(a) Find the sample mean and sample standard deviation.

​(b) Each employee in the sample is given a

55​%

raise. Find the sample mean and sample standard deviation for the revised data set.

​(c) To calculate the monthly​ salary, divide each original salary by 12. Find the sample mean and sample standard deviation for the revised data set.

​(d) What can you conclude from the results of​ (a), (b), and​ (c)?

In: Statistics and Probability

What are the different types of sampling, and how do they affect the statistical data analysis...

What are the different types of sampling, and how do they affect the statistical data analysis decisions?

In: Statistics and Probability

A) Fill in the blanks. (Enter an exact positive number as an integer, fraction, or decimal.)...

A) Fill in the blanks. (Enter an exact positive number as an integer, fraction, or decimal.)

In a normal distribution,

x = 3 and z = −1.19. This tells you that x = 3 is ________ standard deviations to the _____ (left or right) of the mean

B) Fill in the blanks. (Enter an exact positive number as an integer, fraction, or decimal.)

In a normal distribution,

x = −2 and z = 6. This tells you that x = −2 is ______standard deviations to the _____ (left or right) of the mean

C) Fill in the blanks. (Enter an exact positive number as an integer, fraction, or decimal.)

In a normal distribution,

x = 9 and z = −1.4. This tells you that x = 9 is ______standard deviations to the _____ (left or right) of the mean

D) About what percent of the x values from a normal distribution lie within two standard deviations (left and right) of the mean of that distribution? (Enter an exact number as an integer, fraction, or decimal.)

E) About what percent of x values lie between the mean and one standard deviation (one sided)? (Enter an exact number as an integer, fraction, or decimal.)

F) About what percent of x values lie between the first and third standard deviations (both sides)? (Enter an exact number as an integer, fraction, or decimal.)

G) If the area to the left of x in a normal distribution is 0.163, what is the area to the right of x? (Enter an exact number as an integer, fraction, or decimal.)

H) Use the following information to answer the next exercise.

X ~ N(54, 8)

Find the 90th percentile. (Round your answer to two decimal places.)

I) Find the probability that x is between four and 12. (Round your answer to four decimal places.)

X ~ N(5, 3)

In: Statistics and Probability

Listed below are the thorax lengths​ (in millimeters) of a sample of male fruit flies. Find...

Listed below are the thorax lengths​ (in millimeters) of a sample of male fruit flies. Find the range and standard deviation for the given sample data.

0.84 0.83 0.66 0.87 0.88 0.91 0.65 0.71 0.71 0.87 0.660.84   0.83   0.66   0.87   0.88   0.91   0.65   0.71   0.71   0.87   0.66

  

In: Statistics and Probability

The following data give the number of hours 55 students spent studying and their corresponding grades...

The following data give the number of hours 55 students spent studying and their corresponding grades on their midterm exams.

Hours Studying 2 5 5 5 5

Midterm Grades

68 74 81 92 99

Step 5 of 5 :  

Construct the 99% confidence interval for the slope. Round your answers to three decimal places.

In: Statistics and Probability

How was the third model done. I don't understand how to get a regression with 3...

How was the third model done. I don't understand how to get a regression with 3 variables

In: Statistics and Probability

1. A researcher is interested in finding a 98% confidence interval for the mean number of...

1. A researcher is interested in finding a 98% confidence interval for the mean number of times per day that college students text. The study included 144 students who averaged 44.7 texts per day. The standard deviation was 16.5 texts.

a. To compute the confidence interval use a ? z t  distribution.

b. With 98% confidence the population mean number of texts per day is between  and   texts.

c. If many groups of 144 randomly selected members are studied, then a different confidence interval would be produced from each group. About  percent of these confidence intervals will contain the true population number of texts per day and about  percent will not contain the true population mean number of texts per day.

2. You want to obtain a sample to estimate how much parents spend on their kids birthday parties. Based on previous study, you believe the population standard deviation is approximately σ=40.4 dollars. You would like to be 90% confident that your estimate is within 1.5 dollar(s) of average spending on the birthday parties. How many parents do you have to sample? n =

3. You want to obtain a sample to estimate a population mean. Based on previous evidence, you believe the population standard deviation is approximately σ=57.5. You would like to be 95% confident that your estimate is within 0.1 of the true population mean. How large of a sample size is required?

n =

In: Statistics and Probability

Answer each question using essay format (that is, use sentences and paragraphs, not point form). For...

Answer each question using essay format (that is, use sentences and paragraphs, not point form). For each question, answers are limited to a maximum of 150-200 words

1. What is chi square?

In: Statistics and Probability

Total plasma volume is important in determining the required plasma component in blood replacement therapy for...

Total plasma volume is important in determining the required plasma component in blood replacement therapy for a person undergoing surgery. Plasma volume is influenced by the overall health and physical activity of an individual. Suppose that a random sample of 44 male firefighters are tested and that they have a plasma volume sample mean of x = 37.5 ml/kg (milliliters plasma per kilogram body weight). Assume that σ = 7.80 ml/kg for the distribution of blood plasma.

(a) Find a 99% confidence interval for the population mean blood plasma volume in male firefighters. What is the margin of error? (Round your answers to two decimal places.)

lower limit?   
upper limit?
margin of error?   


(b) What conditions are necessary for your calculations? (Select all that apply.)

n is largethe distribution of weights is uniformσ is unknownσ is knownthe distribution of weights is normal



(c) Interpret your results in the context of this problem.

The probability that this interval contains the true average blood plasma volume in male firefighters is 0.01.99% of the intervals created using this method will contain the true average blood plasma volume in male firefighters.    1% of the intervals created using this method will contain the true average blood plasma volume in male firefighters.The probability that this interval contains the true average blood plasma volume in male firefighters is 0.99.


(d) Find the sample size necessary for a 99% confidence level with maximal margin of error E = 2.90 for the mean plasma volume in male firefighters. (Round up to the nearest whole number.)
____male firefighters?

In: Statistics and Probability

Answer each question using essay format (that is, use sentences and paragraphs, not point form). For...

Answer each question using essay format (that is, use sentences and paragraphs, not point form). For each question, answers are limited to a maximum of 150-200 words.

1. What is ANOVA?

In: Statistics and Probability

Coca-Cola Revenues ($ millions), 2005–2010 Quarter 2005 2006 2007 2008 2009 2010 Qtr1 5,200 5,117 6,075...

Coca-Cola Revenues ($ millions), 2005–2010
Quarter 2005 2006 2007 2008 2009 2010
Qtr1 5,200 5,117 6,075 7,380 7,150 7,800
Qtr2 6,304 6,465 7,705 9,045 8,220 8,659
Qtr3 6,031 6,410 7,662 8,305 8,025 8,411
Qtr4 5,545 5,905 7,303 7,040 7,480 10,479



(a-1) Use MegaStat or Minitab to deseasonalize Coca-Cola’s quarterly data. (Round your answers to 3 decimal places.)

1 2 3 4
2005
2006
2007
2008
2009
2010
mean


(a-2) State the adjusted four quarterly indexes. (Round your answers to 3 decimal places.)

Q1 Q2 Q3 Q4


(a-3) What is the trend model for the deseasonalized time series? (Round your answers to 2 decimal places.)

yt =  xt +

(b) State the model found when performing a regression using seasonal binaries. (A negative value should be indicated by a minus sign. Round your answers to 4 decimal places.)

yt =  +  t +  Q1 +  Q2 +  Q3

(c) Use the regression equation to make a prediction for each quarter in 2011. (Enter your answers in millions rounded to 3 decimal places.)

Quarter Predicted
Q1
Q2
Q3
Q4

In: Statistics and Probability

An ordinary deck of 52 cards is well-shuffled, and then the cards are turned face up...

An ordinary deck of 52 cards is well-shuffled, and then the cards are turned

face up one by one until a face card (K,Q,J) appears. Find the expected

number of cards that are face up. Show and explain plz

In: Statistics and Probability

A clinical trial was conducted to test the effectiveness of a drug for treatment insomnia in...

A clinical trial was conducted to test the effectiveness of a drug for treatment insomnia in older subjects. Before treatment, 13 subjects had a mean wake time of 102.0 min. After treatment, the 13 subjects had a mean wake time of 92.4 min and a standard deviation of 22.1 min. Assume that 13 samples value appear to be from a normally distributed population and construct a 99% confidence interval estimate of the mean wake time for a population with drug treatments. What does the result suggest about the mean wake time of 102.0 min before the treatment? Does the drug appear to be effective?

-----min < u <----- min. What does the result suggest about the mean wake time of 102.0 min before the treatment? Does the drug appear to be effective?

The confidence interval -------------------the mean wake time of 102.0 min before the treatment, so the means before and after the treatment --------------------this result suggests that the drug treatment ------------------a significant effect.

In: Statistics and Probability