Question

In: Statistics and Probability

Question one A researcher in a large supermarket wishes to study sickness absences among its employees....

Question one

A researcher in a large supermarket wishes to study sickness absences among its employees. The
organisation has branches in all the provinces, each branch keeps full records of sickness leave. A random sample of ten such branches produced the following data showing the number of days
of sickness per branch in the year 2017.
18 23 26 30 32 35 39 45 48 54
Required:
a) Using the above data

a). Calculate (manually and using the computer software such as EXCEL, SPSS etc), a
95% confidence interval for the mean amount of sickness days per branch.

b). Estimate the number of branches that should be included in a simple random sample so that a 95% confidence interval for the mean number of days sickness should not have a width greater than 4 days 5 marks
c)) After the sample was collected, it became apparent that the branches fell into three natural groups in terms of sales-small, medium and large. From the data on all of the branches in the provinces, the researcher found that of 210 randomly selected staff, 90 worked in small branches, 36 in medium sized branches, and the rest worked in large branches. In total, 96 of the selected staff had no days off for sickness, of which 52 worked in small branches, and 29 worked in large sized branches.
i) Form a table showing the information clearly. 5 marks
ii) Carry out an appropriate statistical test to investigate whether the size of branch influences the occurrence of sickness absence, interpret your results clearly.

Solutions

Expert Solution

a). Calculate (manually and using the computer software such as EXCEL, SPSS etc), a
95% confidence interval for the mean amount of sickness days per branch.

The required confidence interval for the population mean by using excel is given as below:

Confidence Interval Estimate for the Mean

Data

Sample Standard Deviation

11.5181017

Sample Mean

35

Sample Size

10

Confidence Level

95%

Intermediate Calculations

Standard Error of the Mean

3.642343568

Degrees of Freedom

9

t Value

2.2622

Interval Half Width

8.2396

Confidence Interval

Interval Lower Limit

26.76

Interval Upper Limit

43.24

b). Estimate the number of branches that should be included in a simple random sample so that a 95% confidence interval for the mean number of days sickness should not have a width greater than 4 days

WE are given

Margin of error = 4

Estimate for standard deviation = σ = 11.5181017

Confidence level = 95%

Critical Z value = 1.96

(by using z-table)

Sample size formula is given as below:

n = (Z*σ/E)^2

n = (1.96*11.5181017/4)^2

n = 31.85327

Required sample size = 32

c)) After the sample was collected, it became apparent that the branches fell into three natural groups in terms of sales-small, medium and large. From the data on all of the branches in the provinces, the researcher found that of 210 randomly selected staff, 90 worked in small branches, 36 in medium sized branches, and the rest worked in large branches. In total, 96 of the selected staff had no days off for sickness, of which 52 worked in small branches, and 29 worked in large sized branches.
i) Form a table showing the information clearly.

The required table is given as below:

Sale

Small

Medium

Large

Total

No days off

52

15

29

96

Off

38

21

55

114

Total

90

36

84

210

ii) Carry out an appropriate statistical test to investigate whether the size of branch influences the occurrence of sickness absence, interpret your results clearly.

Here, we have to use Chi square test for independence.

Null hypothesis: H0: The size of branch not influences the occurrence of sickness absence.

Alternative hypothesis: Ha: The size of branch influences the occurrence of sickness absence.

We assume level of significance = α = 0.05

Test statistic formula is given as below:

Chi square = ∑[(O – E)^2/E]

Where, O is observed frequencies and E is expected frequencies.

E = row total * column total / Grand total

We are given

Number of rows = r = 2

Number of columns = c = 3

Degrees of freedom = df = (r – 1)*(c – 1) = 1*2 = 2

α = 0.05

Critical value = 5.991465

(by using Chi square table or excel)

Calculation tables for test statistic are given as below:

Observed Frequencies

Column variable

Row variable

Small

Medium

Large

Total

No off

52

15

29

96

off

38

21

55

114

Total

90

36

84

210

Expected Frequencies

Column variable

Row variable

Small

Medium

Large

Total

No off

41.14286

16.45714

38.4

96

off

48.85714

19.54286

45.6

114

Total

90

36

84

210

Calculations

(O - E)

10.85714

-1.45714

-9.4

-10.8571

1.457143

9.4

(O - E)^2/E

2.865079

0.129018

2.301042

2.412698

0.108647

1.937719

Chi square = ∑[(O – E)^2/E] = 9.754203

P-value = 0.007619

(By using Chi square table or excel)

P-value < α = 0.05

So, we reject the null hypothesis

There is sufficient evidence to conclude that the size of branch influences the occurrence of sickness absence.


Related Solutions

a store manager wishes to see if the number of absences of her employees is the...
a store manager wishes to see if the number of absences of her employees is the same for each weekdayselecteda tandom week and abscencrs find the gollowing number os abscences
the human resource director of a large corporation wishes to study absenteeism among its mid level...
the human resource director of a large corporation wishes to study absenteeism among its mid level managers at its central office during the year a random sample of 25 mid level managers reveals the following Absenteeism x =6.2 days s= 7 days. 13 mid level managers cite stress as a cause of absence. construct a 95% confidence interval estimate for the mean number of absences for mid level managers during the year
Scenario : A researcher wishes to analyze an outbreak of measles among unvaccinated children in their...
Scenario : A researcher wishes to analyze an outbreak of measles among unvaccinated children in their state. They have access to the following data for each case: age at time of infection, duration of infection (in weeks), and disease outcome (death or survival). 1) What kinds of data are these (nominal, ordinal, interval or ratio)? explain your answer. 2) Describe one statistic that you would calculate from these data to help understand the outbreak and explain your answer.
A) A researcher is interested in knowing the level of job satisfaction among employees in a...
A) A researcher is interested in knowing the level of job satisfaction among employees in a large company. He measures job satisfaction on a 5-point scale where 1=” Extremely dissatisfied” and 5 = “Extremely satisfied”. What is the matching description of the variable of interest in this study? Group of answer choices a. Quantitative / Discrete b. Quantitative / Continuous c. Categorical / Ordinal d. Categorical / Nominal B) What is the most suitable measure of centre for a skewed...
1. A researcher wishes to estimate the proportion of left-handers among a certain population. In a...
1. A researcher wishes to estimate the proportion of left-handers among a certain population. In a random sample of 900 people from the population, 74% are left-handed. a. Find the margin of error for the 95% confidence interval for the population proportion of the left-handers. b. Find the 95% confidence interval for the population proportion of the left-handers to four decimal places.
Human Resources department at a manufacturing company is concerned about unauthorized absences of its employees. As...
Human Resources department at a manufacturing company is concerned about unauthorized absences of its employees. As a part of their research, they decided to investigate the relationship between the number of unauthorized days that employees are absent per year and the distance (miles) between home and work for the employees. A sample of 10 employees was chosen, and the following data were collected. Perform an appropriate analysis on the data at a 0.01 level of significance. Answer the following questions...
A researcher wishes to conduct a study on differences in protein consumption by country of origin...
A researcher wishes to conduct a study on differences in protein consumption by country of origin amongst immigrants in NYC. The researcher selects a sample of patients from the population of interest. The data below represents the ages of patients enrolled in the study. 52 52 63 55 33 47 71 48 30 52 45 52 40 55 67 57 45 43 49 45 45 38 44 46 53 58 61 44 !a. Construct the frequency distribution of patient ages....
A researcher wishes to study the effect of drinking alcohol on players' scores in a video...
A researcher wishes to study the effect of drinking alcohol on players' scores in a video game. 180 people (60 females and 120 males) have volunteered to participate in the study. The treatments he would use will be 0, 1, 2 and 3 cans of 5% beer (355ml each) consumed 15 minutes before the start of the game. The participants' scores in the video game will be recorded Select one: a. The researcher should use randomized block design with two...
A researcher wishes to estimate the number of households with two computers. How large a sample...
A researcher wishes to estimate the number of households with two computers. How large a sample is needed in order to be 98% confident that the sample proportion will not differ from the true proportion by more than 4%? A previous study indicates that the proportion of households with two computers is 25%.
A researcher wishes to estimate the number of households with two computers. How large a sample...
A researcher wishes to estimate the number of households with two computers. How large a sample is needed in order to be 99​% confident that the sample proportion will not differ from the true proportion by more than 3​%? A previous study indicates that the proportion of households with two computers is 24%
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT