The Australian Bureau of Statistics has reported that in 2015, 26.78% of the 16.8 million employees in Australia worked part-time in their main job. Suppose that you select a random sample of 250 employees from around Australia.
(a) What is the probability that more than 26.2% of those sampled work part-time in their main job?
(b) What is the probability that the proportion of part-time employees in the sample is between 27% and 29%?
(c) The probability is 77% that the sample proportion of part-time workers will be above what value?
In: Statistics and Probability
In deciding where to invest her retirement fund, an investor recorded the weekly returns of two portfolios for one year, with the results stored in columns 1 and 2. Some of these data are shown below. Can we conclude at the 5% significance level that portfolio 2 is riskier than portfolio 1? Compare both return and risk . Do on Excel .
This is the Data :
Portflio-1 | Portflio-2 |
0.22 | 0.32 |
0.59 | 0.35 |
0.11 | 0.48 |
-0.05 | 0.36 |
0.44 | 0.27 |
0.38 | 0.33 |
0.05 | 0.47 |
0.00 | -0.01 |
0.27 | -0.22 |
0.10 | 0.37 |
0.41 | 0.76 |
0.01 | 0.14 |
0.46 | 0.57 |
0.26 | 0.06 |
0.08 | 0.70 |
0.42 | -0.32 |
0.38 | 0.83 |
0.22 | 0.55 |
0.13 | 0.73 |
0.11 | -0.17 |
0.33 | 0.04 |
0.14 | -0.07 |
0.28 | 0.28 |
0.28 | 0.31 |
0.44 | 0.72 |
0.28 | -0.03 |
0.21 | 0.34 |
0.04 | 0.13 |
0.58 | 0.24 |
0.24 | 0.64 |
0.17 | -0.22 |
0.10 | 0.81 |
-0.03 | 0.33 |
0.44 | 0.66 |
0.12 | 0.80 |
0.00 | 0.09 |
0.11 | 0.24 |
0.11 | 0.22 |
0.29 | 0.51 |
0.14 | 0.60 |
0.15 | 0.42 |
0.43 | -0.10 |
0.15 | 0.26 |
0.55 | 0.57 |
0.18 | 0.24 |
0.22 | 0.37 |
0.22 | 0.41 |
0.11 | 0.78 |
0.03 | 0.24 |
0.10 | 0.32 |
0.15 | 0.56 |
0.23 | -0.02 |
In: Statistics and Probability
A company claims to manufacture paper clips with a standard
deviation of less than 0.12g. A
set of 71 paper clips has a sample standard deviation of 0.10 g. Is
the claim supported by the test?
In: Statistics and Probability
Question: 2 The data below shows the amount of money in millions of shillings paid out suppliers by firm over a certain period of time 7.0 4.1 3.4 5.5 4.5 6.6 7.3 7.7 8.0 3.0 5.0 4.5 7.2 5.0 2.7 7.0 5.5 7.0 8.5 7.0 3.0 5.0 6.0 5.3 4.0 4.5 3.5 5.5 2.0 8.1 2.5 5.1 3.5 6.2 6.0 3.0 4.5 3.5 5.0 8.9 5.3 2.3 2.8 6.5 6.8 5.0 6.5 3.4 3.5 7.6
a) Starting with 2.0-2.9 ,construct a frequency distribution table for the data
b) Using the frequency table;
i. Draw a cumulative curve for the data and hence estimate the median and inter quartile range
ii. Calculate the mean and standard deviation of production
In: Statistics and Probability
Question:5
September 2019
Mean $372.40
Standard deviation $26.10
Sample size 36
In the previous year the average cost of each holiday was $356.20.
The company wishes to adopt a more scientific approach to estimating customer satisfaction.
What sample size would be needed to estimate the proportion of customers’ views to within 2% of the true figure at the 95% confidence limit and Interval
In: Statistics and Probability
You are working with the marketing team for a FMCG firm that produces shaving cream. The team believes that sales of some of the products are closely related to sales of other products. They want you to explore this in a little more depth for two products, SKU 123 and SKU 456. Unfortunately, all of the base sales data for these products has been destroyed. All that you have is the weekly summary data:
|
|
|
Now the marketing team wants to understand the potential weekly sales for these two products. Let the sales price for the two SKUs be 12.50, 7.75, respectively.
1) What is the expected weekly revenue?
2) What is the standard deviation of the weekly revenue?
3) Assuming the marketing team’s correlation of 0.79 is correct. What is the probability that weekly sales will be between 10,000 and 20,000 dollars?
In: Statistics and Probability
Chi-square analysis compares the counts of two categorical variables to tell you if a relationship exists between the variables or not. You can apply chi-square analysis to answer important questions about factors in everyday life, and even about events like elections... or Halloween. If you are a character in a slasher film, is there a connection between your gender and your dying in some horrible manner?
1. Propose a life event or situation and two categorical variables for it. Complete a chi-square analysis of this event or situation and these variables, and share your results.
2. Do you agree with your findings? Why or why not?
In: Statistics and Probability
(a) Using the armspanSpring2020.csv data from class, test the hypothesis that those who identify as female have a shorter armspan than those who do not so identify. Write out the null and alternative hypotheses, give the value of the test statistic and the p-value, and state your conclusion using a 5% significance level. Use R for all computations.
(b) Interpret, in your own words, the meaning of the p-value you got in part (a).
(c) Find a 95% confidence interval for the mean armspan using
the data in armspanSpring2020.csv. Use R.
(d) What assumptions must you make if we wish to interpret this
interval to apply to all UCLA students? Which of these assumptions
do you think are met adn which are not?
(e) Find a 95% confidence interval for the difference between mean
armspan and mean heights. Does it contain 0? Why is this surprising
or not-surprising?
height | armspan | is.female |
67 | NA | 1 |
70 | 40 | 0 |
64 | 67 | 1 |
71 | 70 | 0 |
72 | 49 | 0 |
62 | 61 | 1 |
72 | 74 | 0 |
71 | 68 | 0 |
63 | 60 | 1 |
69 | 69 | 0 |
67 | 68 | 1 |
63 | 63 | 1 |
60 | 60 | 1 |
66 | 66 | 0 |
61 | 61 | 1 |
69 | 68 | 0 |
65 | 65 | 1 |
72 | 72 | 0 |
70 | 70 | 0 |
73 | 77 | 0 |
65 | 61 | 1 |
68 | 72 | 1 |
62 | 55 | NA |
71 | 74 | 0 |
72 | 70 | 0 |
66 | 22 | 1 |
65 | 67 | 1 |
64 | 62 | 0 |
65 | 62 | 1 |
73 | 69 | 0 |
67 | 77 | 0 |
60 | 62 | 1 |
70 | 59 | 0 |
68 | 66 | 1 |
65 | 65 | 1 |
72 | 69 | 0 |
62 | 52 | 1 |
69 | 66 | 0 |
68 | 67 | 0 |
65 | 66 | 1 |
65 | 64 | 0 |
66 | 65 | 1 |
62 | 52 | 1 |
64 | 62 | 1 |
66 | 65 | 1 |
69 | 69 | 0 |
64 | 65 | 1 |
70 | 74 | 0 |
65 | 69 | 0 |
70 | 80 | 0 |
63 | NA | 1 |
67 | 70 | 1 |
64 | 64 | 1 |
64 | 62 | 1 |
6 | 5.7 | 0 |
67 | 67 | 1 |
72 | 71 | 0 |
73 | 75 | 0 |
68 | 68 | 0 |
67 | 63 | 1 |
66 | 67 | 1 |
67 | 36 | 0 |
68 | 72 | 0 |
73 | 70 | 0 |
70 | 70 | 0 |
70 | 72 | 0 |
60 | 58 | 0 |
70 | 68 | 0 |
62 | 63 | 0 |
68 | 68 | 1 |
67 | 67 | NA |
68 | 71 | 0 |
65 | 48 | 1 |
70 | 76 | 0 |
69 | 70 | 0 |
69 | 66 | 0 |
58 | 55 | NA |
64 | 64 | 0 |
Please help with the r codes especially. It is my first time using it and I'm having a hard time. Thanks!
In: Statistics and Probability
A corporation monitors time spent by office workers browsing the web on their computers instead of working. The computer records of a random sample of 101 workers showed that 9 of them spent at least 30 minutes per day browsing. Construct a 90% confidence interval for the proportion of workers who spend at least 30 minutes per day browsing. Do the results support the claim that over 15% of the workers do this
In: Statistics and Probability
4) We're going to test the same hypothesis four ways. Assume the
people in the dataset in armspanSpring2020.csv are a random sample
of all adults. For each test, report the test statistic and the
p-value. With a 5% significance level, give the conclusion of each
test.
a) Test the hypothesis that the mean difference between armspan and
height it not equal to 0, using the data in armspanSpring2020.csv.
Do this by creating a new variable named diff = (armspan - height).
Perform a one-sample t-test.
b) Test the same hypothesis, but use a two-sample t-test with
paired =TRUE.
c) Test the same hypothesis, but use a two-sample t-test with
paired=FALSE and var.equal=FALSE.
d) Test the same hypothesis, but use a two-sample t-test with
paired=FALSE and var.equal=TRUE.
e) Which test(s) do you think are valid for this situation and
why?
hint: We almost never use the var.equal=TRUE test. Why? Because it
is only valid if the population standard deviations of both
populations are equal. You might be in a situation where you know
this to be true. If so, fine, use it. But usually we don't, in
which case (a) the var.equal=FALSE test will provide more accurate
p-values if the standard deviations are not equal and (b) will
provide pretty accurate p-value if they are. So you can't lose,
really, with the var.equal=FALSE test, but you can lose with it the
other way.
g)Data cleaning. Identify by row number which observations seem in need of cleaning and why you think so. Provide a table. (Hint: consider the "which()" and "identify()" functions.) Provide a graph to justify your identifications.
height | armspan | is.female |
67 | NA | 1 |
70 | 40 | 0 |
64 | 67 | 1 |
71 | 70 | 0 |
72 | 49 | 0 |
62 | 61 | 1 |
72 | 74 | 0 |
71 | 68 | 0 |
63 | 60 | 1 |
69 | 69 | 0 |
67 | 68 | 1 |
63 | 63 | 1 |
60 | 60 | 1 |
66 | 66 | 0 |
61 | 61 | 1 |
69 | 68 | 0 |
65 | 65 | 1 |
72 | 72 | 0 |
70 | 70 | 0 |
73 | 77 | 0 |
65 | 61 | 1 |
68 | 72 | 1 |
62 | 55 | NA |
71 | 74 | 0 |
72 | 70 | 0 |
66 | 22 | 1 |
65 | 67 | 1 |
64 | 62 | 0 |
65 | 62 | 1 |
73 | 69 | 0 |
67 | 77 | 0 |
60 | 62 | 1 |
70 | 59 | 0 |
68 | 66 | 1 |
65 | 65 | 1 |
72 | 69 | 0 |
62 | 52 | 1 |
69 | 66 | 0 |
68 | 67 | 0 |
65 | 66 | 1 |
65 | 64 | 0 |
66 | 65 | 1 |
62 | 52 | 1 |
64 | 62 | 1 |
66 | 65 | 1 |
69 | 69 | 0 |
64 | 65 | 1 |
70 | 74 | 0 |
65 | 69 | 0 |
70 | 80 | 0 |
63 | NA | 1 |
67 | 70 | 1 |
64 | 64 | 1 |
64 | 62 | 1 |
6 | 5.7 | 0 |
67 | 67 | 1 |
72 | 71 | 0 |
73 | 75 | 0 |
68 | 68 | 0 |
67 | 63 | 1 |
66 | 67 | 1 |
67 | 36 | 0 |
68 | 72 | 0 |
73 | 70 | 0 |
70 | 70 | 0 |
70 | 72 | 0 |
60 | 58 | 0 |
70 | 68 | 0 |
62 | 63 | 0 |
68 | 68 | 1 |
67 | 67 | NA |
68 | 71 | 0 |
65 | 48 | 1 |
70 | 76 | 0 |
69 | 70 | 0 |
69 | 66 | 0 |
58 | 55 | NA |
64 | 64 | 0 |
Please help with the r codes. It is my first time doing r studio and I'm having a hard time. Thanks!
In: Statistics and Probability
Question:4
In: Statistics and Probability
There are 10 democrats and 8 Republicans on a senate committee. From this group a 5- member subcommittee is to be formed. Find the number of 5-person subcommittee that consists of:
a)Any members of the senate committee.
b)Democrats only.
c)3 Republicans and 2 Democrats.
d)At least 4 democrats.
e)John Edwards,who is a Democrat, and any 4 Republicans
f)What is the probability that the 5-members subcommittee will include 3 Republicans and 2 Democrats?
In: Statistics and Probability
If a regression analysis was to be completed on body mass index (BMI), what could be an independent variable in that analysis? Why? If we could, what other independent variables should be included in the analysis? What statistic(s) would show the value of that regression in understanding BMI?
Alternatively, find an article that uses regression analysis to study a medical concern. In that study, what was the dependent variable and what was the independent variable(s)? Further, how would you use this study to highlight the difference between correlations and causation?
Please provide APA references
Thank You
In: Statistics and Probability
Thinking of the many variables tracked by hospitals and doctors' offices, confidence intervals could be created for population parameters (such as means or proportions) that were calculated from many of them. Choose a topic of study that is tracked (or that you would like to see tracked) from your place of work. Discuss the variable and parameter (mean or proportion) you chose, and explain why you would use these to create an interval that captures the true value of the parameter of patients with 95% confidence.
Please provide APA reference
Thank You
In: Statistics and Probability
Oak hill has 74806 registered automobiles. A city ordinance requires each to display a bumper decal showing that the owner paid an annual wheel tax of $50. By law, new decals need to be purchased during the month of the owner's birthday. This year's budget assumes that at least 306000 in decal revenue will be collected in November. What is the probability that the wheel taxes reported in that month will be less than anticipated and produce a budget shortfall?
In: Statistics and Probability