Is a sample correlation coefficient ρ = 0.83 significant at the α = 0.01 level based on a sample size of n = 6 data pairs? What about n = 14 data pairs? (Select all that apply.)
Yes, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 6 and α = 0.01.
No, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 14 and α = 0.01.
No, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 6 and α = 0.01.
Yes, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 14 and α = 0.01.
Yes, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 14 and α = 0.01.
No, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 6 and α = 0.01.
Yes, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 6 and α = 0.01.
No, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 14 and α = 0.01.
(b) Is a sample correlation coefficient ρ = 0.41 significant at the α = 0.05 level based on a sample size of n = 20 data pairs? What about n = 26 data pairs? (Select all that apply.)
No, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 20 and α = 0.05.
Yes, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 20 and α = 0.05.
No, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 26 and α = 0.05.
Yes, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 26 and α = 0.05.
No, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 26 and α = 0.05.
Yes, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 26 and α = 0.05.
Yes, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 20 and α = 0.05.
No, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 20 and α = 0.05.
(c) Is it true that in order to be significant, a ρ value must be larger than 0.90? larger than 0.70? larger than 0.50? What does sample size have to do with the significance of ρ? Explain your answer.
No, a larger sample size means that a smaller absolute value of the correlation coefficient might be significant.
Yes, a larger correlation coefficient of 0.90 means that the data will be significant.
Yes, a larger correlation coefficient of 0.50 means that the data will be significant.
Yes, a larger correlation coefficient of 0.70 means that the data will be significant.
No, sample size has no bearing on whether or not the correlation coefficient might be significant
In: Statistics and Probability
Both X and S2 are unbiased for the Poisson λ. Which is better? Use the following steps to answer this question.
a) Generate 200000 random numbers from the Poisson(λ = 2)
distribution and arrange them in a matrix with 20 rows. Thus you
have 10000 samples of size 20.
b) Compute the 10000 sample means and sample variances and store
them in objects means and vars, respectively.
c) Compute the average of the 10000 sample means and the average
of the 10000 sample variances. Report the two averages. Do they
support the claim that they are unbiased for λ = 2?
d) Compute the sample variance of the 10000 sample means and the
sample variance of the 10000 sample variances. Report the two
variances. Which estimator of λ is preferable.
In: Statistics and Probability
The exponential distribution with rate λ has mean μ = 1/λ. Thus the method of moments estimator of λ is 1/X. Use the following steps to verify that X is unbiased, but 1/X is biased.
a) Generate 10000 samples of size n = 5 from the standard exponential distribution (i.e. λ = 1) using rexp(50000) and arranging the 50000 random numbers in a matrix with 5 rows.
b) Use the apply() function to compute the 10000 sample means and store them in the object means. The 10000 estimators of λ can be stored in the object lambdas by lambdas = 1/means
c) Compute the sample mean of the object means, and sample mean of the object lambdas. What can you say about the bias of X and of 1/X?
d) Repeat with a sample of size n = 10, using rexp(100000), and report your estimate of the bias of 1/X. Has the bias decreased?
In: Statistics and Probability
#4
**Below are two samples of test scores from two different calculus
classes. It is believed that class 1 performed better than class
two. From previous tests, it is known that the test scores for both
classes are normally distributed and the population standard
deviation of class 1 is 10 points and the population standard
deviation of class 2 is 8 points. Do the data support that class 1
performed better.**
```{r}
class1<-c(100, 86, 98, 72, 66, 95, 93, 82)
class2<-c(98, 82, 99, 99, 70, 71, 94, 79)
```
##5
**A teaching assistant in Florida collected a sample to see if the
average number of hours students put into studying depending on if
they were in graduate school or not. The data below represents
these two samples.**
**Perform a test to determine if the time graduate students spend studying is greater than that of undergraduate students. Be sure to identify your hypotheses and explain your conclusion in the context of the question. Assume the time spend studying for both graduate and undergraduate is normally distributed.**
grad<-c(15,7,15,10,5,5,2,3,12,16,15,37,8,14,10,18,3,25,15,5,5)
undergrad<-c(6,8,15,6,5,14,10,10,12,5)
Please Solve using R studio codes with explanation
In: Statistics and Probability
Rework problem 35 from the Chapter 2 review exercises in your text, involving auditioning for a play. For this problem, assume 11 males audition, one of them being Seamus, 4 females audition, one of them being Nisha, and 5 children audition. The casting director has 4 male roles available, 2 female roles available, and 2 child roles available.
(1) How many different ways can these roles be filled from these auditioners?
(2) How many different ways can these roles be filled if exactly
one of Seamus and Nisha gets a part?
(3) What is the probability (if the roles are filled at random) of
both Seamus and Nisha getting a part
In: Statistics and Probability
Suppose x has a normal distribution with mean
μ = 52 and standard deviation σ = 9.
Describe the distribution of x values for sample size
n = 4. (Round σx to two
decimal places.)
μx | = |
σx | = |
Describe the distribution of x values for sample size
n = 16. (Round σx to two
decimal places.)
μx | = |
σx | = |
Describe the distribution of x values for sample size
n = 100. (Round σx to two
decimal places.)
μx | = |
σx | = |
How do the x distributions compare for the various samples
sizes?
The standard deviations are the same, but the means are increasing with increasing sample size.The standard deviations are the same, but the means are decreasing with increasing sample size. The means are the same, but the standard deviations are increasing with increasing sample size.The means are the same, but the standard deviations are decreasing with increasing sample size.The means and standard deviations are the same regardless of sample size.
In: Statistics and Probability
Case Problem 1: Stock Market a. Using the dataset “Stock Market”, build a table with the descriptive statistics (N, Mean, Standard Deviation, Minimum, Median and Maximum) (10 points) • Which companies had a higher mean monthly return than the market (as measured by the S&P 500)? (5 points) • Which one was the most volatile (has the largest standard deviation)? Why is the S&P Index the less volatile? (5 points) b. Find the estimated regression equation relating each of the individual stocks to the S&P 500 and the value of R-Sq for each equation. (25 points) c. Find the betas (slope of estimated regression equation) for the individual stocks from the regression output. (10 points) • What does a stock with a beta greater than 1 indicate? And less than 1? What is the stock that benefits most from a rising market? Why? (25 points) d. What dp the R-Sq values indicate? (20 points)
Month | Microsoft | Exxon Mobil | Caterpillar | Johnson & Johnson | McDonald's | Sandisk | Qualcomm | Procter & Gamble | S&P 500 |
Jan-03 | 0,21799 | 0,27739 | 0,2696 | 0,29814 | 0,18557 | 0,05133 | 0,3349 | 0,300465 | 0,272585 |
Feb-03 | 0,30211 | 0,30293 | 0,36867 | 0,28219 | 0,25576 | 0,39363 | 0,21822 | 0,256644 | 0,282996 |
Mar-03 | 0,32152 | 0,32734 | 0,34681 | 0,40334 | 0,36245 | 0,30839 | 0,34251 | 0,387833 | 0,308358 |
Apr-03 | 0,35576 | 0,30715 | 0,37622 | 0,27391 | 0,48257 | 0,73876 | 0,18556 | 0,313588 | 0,381044 |
May-03 | 0,26283 | 0,34119 | 0,29144 | 0,26859 | 0,39532 | 0,80165 | 0,35395 | 0,321925 | 0,350899 |
Jun-03 | 0,34185 | 0,28654 | 0,36731 | 0,25124 | 0,47779 | 0,4164 | 0,37124 | 0,271248 | 0,311322 |
Jul-03 | 0,33003 | 0,29081 | 0,51847 | 0,30174 | 0,34306 | 0,69734 | 0,34285 | 0,290413 | 0,316224 |
Aug-03 | 0,30417 | 0,36661 | 0,36462 | 0,26196 | 0,27436 | 0,36633 | 0,40459 | 0,293399 | 0,317873 |
Sep-03 | 0,34827 | 0,27082 | 0,25837 | 0,29879 | 0,34996 | 0,35409 | 0,30823 | 0,363352 | 0,288056 |
Oct-03 | 0,24604 | 0,29945 | 0,36987 | 0,31636 | 0,36202 | 0,56491 | 0,43967 | 0,363833 | 0,354962 |
Nov-03 | 0,28355 | 0,29645 | 0,3378 | 0,2843 | 0,3412 | 0,30273 | 0,23957 | 0,279143 | 0,307129 |
Dec-03 | 0,36457 | 0,4326 | 0,39165 | 0,34787 | 0,26879 | 0,05724 | 0,51055 | 0,337822 | 0,350765 |
Jan-04 | 0,31023 | 0,29488 | 0,24556 | 0,33407 | 0,33665 | 0,18725 | 0,38678 | 0,31657 | 0,317276 |
Feb-04 | 0,25949 | 0,33996 | 0,26954 | 0,31367 | 0,39946 | 0,23628 | 0,37763 | 0,314147 | 0,312209 |
Mar-04 | 0,23969 | 0,28625 | 0,34383 | 0,24083 | 0,30954 | 0,41566 | 0,35072 | 0,32312 | 0,283641 |
Apr-04 | 0,34813 | 0,32308 | 0,28773 | 0,36526 | 0,2531 | 0,11629 | 0,24222 | 0,313063 | 0,283209 |
May-04 | 0,30383 | 0,3228 | 0,26938 | 0,33637 | 0,26952 | 0,36479 | 0,37541 | 0,319574 | 0,312083 |
Jun-04 | 0,38883 | 0,32682 | 0,35428 | 0,29982 | 0,28485 | 0,17992 | 0,38812 | 0,309831 | 0,317989 |
Jul-04 | 0,29755 | 0,34256 | 0,23026 | 0,29228 | 0,35769 | 0,42125 | 0,24834 | 0,262528 | 0,265709 |
Aug-04 | 0,26104 | 0,30151 | 0,28925 | 0,35636 | 0,28255 | 0,26012 | 0,40157 | 0,37325 | 0,302287 |
Sep-04 | 0,31282 | 0,34837 | 0,4066 | 0,26954 | 0,33738 | 0,54711 | 0,32602 | 0,266947 | 0,309364 |
Oct-04 | 0,31157 | 0,31842 | 0,30622 | 0,33639 | 0,33996 | 0,01669 | 0,36557 | 0,250296 | 0,314014 |
Nov-04 | 0,36864 | 0,34673 | 0,4367 | 0,33811 | 0,37341 | 0,38194 | 0,30048 | 0,344939 | 0,338595 |
Dec-04 | 0,29664 | 0,3002 | 0,3651 | 0,35139 | 0,34294 | 0,40585 | 0,32042 | 0,329918 | 0,332458 |
Jan-05 | 0,28353 | 0,30663 | 0,21796 | 0,32018 | 0,31029 | 0,28919 | 0,1783 | 0,270951 | 0,27471 |
Feb-05 | 0,26043 | 0,53217 | 0,36678 | 0,31832 | 0,3213 | 0,38826 | 0,26992 | 0,29737 | 0,318903 |
Mar-05 | 0,26065 | 0,2414 | 0,26202 | 0,32378 | 0,24135 | 0,33423 | 0,31609 | 0,298305 | 0,280882 |
Apr-05 | 0,34675 | 0,25688 | 0,26741 | 0,32189 | 0,24123 | 0,15252 | 0,2525 | 0,326981 | 0,279891 |
May-05 | 0,32292 | 0,29053 | 0,36882 | 0,28251 | 0,35561 | 0,39578 | 0,37079 | 0,318467 | 0,329952 |
Jun-05 | 0,26279 | 0,3226 | 0,31275 | 0,2687 | 0,1969 | 0,21375 | 0,1857 | 0,256482 | 0,299857 |
Jul-05 | 0,331 | 0,32227 | 0,4365 | 0,284 | 0,42324 | 0,7252 | 0,496 | 0,359905 | 0,335968 |
Aug-05 | 0,37224 | 0,32451 | 0,32931 | 0,29625 | 0,34107 | 0,44814 | 0,30811 | 0,297304 | 0,288778 |
Sep-05 | 0,23974 | 0,36077 | 0,35875 | 0,29826 | 0,33205 | 0,54234 | 0,42692 | 0,371738 | 0,306949 |
Oct-05 | 0,29883 | 0,18354 | 0,1994 | 0,28957 | 0,24357 | 0,52056 | 0,18849 | 0,246351 | 0,282259 |
Nov-05 | 0,38016 | 0,33883 | 0,39869 | 0,29138 | 0,3924 | 0,16719 | 0,44361 | 0,321432 | 0,335186 |
Dec-05 | 0,24473 | 0,26795 | 0,29983 | 0,27328 | 0,29616 | 0,53032 | 0,24942 | 0,312065 | 0,299048 |
In: Statistics and Probability
Listed below are the 25 players on the opening-day roster of the 2016 New York Yankees Major League Baseball team, their salaries, and fielding positions.
Player | Position | Salary (US$) | |||||
C.C. Sabathia | Starting Pitcher | $ | 25,000,000 | ||||
Mark Teixeira | First Base | $ | 23,125,000 | ||||
Masahiro Tanaka | Starting Pitcher | $ | 22,000,000 | ||||
Jacoby Ellsbury | Center Field | $ | 21,142,857 | ||||
Alex Rodriguez | Designated Hitter | $ | 21,000,000 | ||||
Brian McCann | Catcher | $ | 17,000,000 | ||||
Carlos Beltran | Right Field | $ | 15,000,000 | ||||
Brett Gardner | Left Field | $ | 13,500,000 | ||||
Chase Headley | Third Base | $ | 13,000,000 | ||||
Andrew Miller | Relief Pitcher | $ | 9,000,000 | ||||
Starlin Castro | Second Base | $ | 7,857,142 | ||||
Nathan Eovaldi | Starting Pitcher | $ | 5,600,000 | ||||
Michael Pineda | Starting Pitcher | $ | 4,300,000 | ||||
Ivan Nova | Relief Pitcher | $ | 4,100,000 | ||||
Dustin Ackley | Left Field | $ | 3,200,000 | ||||
Didi Gregorius | Shortstop | $ | 2,425,000 | ||||
Aaron Hicks | Center Field | $ | 574,000 | ||||
Austin Romine | Catcher | $ | 556,000 | ||||
Chasen Shreve | Relief Pitcher | $ | 533,400 | ||||
Luis Severino | Starting Pitcher | $ | 521,300 | ||||
Kirby Yates | Relief Pitcher | $ | 511,900 | ||||
Ronald Torreyes | Second Base | $ | 508,600 | ||||
Johnny Barbato | Relief Pitcher | $ | 507,500 | ||||
Dellin Betances | Relief Pitcher | $ | 507,500 | ||||
Luis Cessa | Relief Pitcher | $ | 507,500 | ||||
Sort the players into two groups, all pitchers (relief and starting) and position players (all others). Assume equal population standard deviations for the pitchers and the position players. Test the hypothesis that mean salaries of pitchers and position players are equal using the 0.01 significance level.
State the null and alternative hypothesis.
What is the decision rule? (Negative amounts should be indicated by a minus sign. Round your answers to 3 decimal places.)
Calculate the test statistic. (Negative amounts should be indicated by a minus sign. Round your answer to 3 decimal places.)
What is your decision regarding H0?
Do not reject H0
Reject H0
Is there a difference in the mean salaries of pitchers and position players?
Yes
No
In: Statistics and Probability
In studies for a medication, 77 percent of patients gained weight as a side effect. Suppose 615 patients are randomly selected. Use the normal approximation to the binomial to approximate the probability that
(a) exactly 44 patients will gain weight as a side effect.
(b) no more than 44 patients will gain weight as a side effect.
(c) at least 56 patients will gain weight as a side effect. What does this result suggest?
In: Statistics and Probability
1. What is the difference between the “Between Treatment” sources in the One-Way ANOVA compared to the Two-Way ANOVA?
2. Explain how and why the numerators changes when calculating different F-statistics in a Two-Way ANOVA..
3. Explain why the denominator does not change when calculating different F-statistics in a Two-Way ANOVA.
In: Statistics and Probability
A certain flight arrives on time 86 percent of the time. Suppose 178 flights are randomly selected. Use the normal approximation to the binomial to approximate the probability that
(a) exactly 161 flights are on time.
(b) at least 161 flights are on time.
(c) fewer than 147 flights are on time.
(d) between 147 and 160, inclusive are on time.
In: Statistics and Probability
Problem 1. A drug company tested three formulations of a pain relief medicine for migraine headache sufferers. For the experiment 27 volunteers were selected and 9 were randomly assigned to one of three drug formulations. The subjects were instructed to take the drug during their next migraine headache episode and to report their pain on a scale of 1 to 10 (10 being most pain).
Drug A 4 5 4 3 2 4 3 4 4
Drug B 6 8 4 5 4 6 5 8 6
Drug C 6 7 6 6 7 5 6 5 5
You can read in data into R by the following R code.
pain = c(4, 5, 4, 3, 2, 4, 3, 4, 4, 6, 8, 4, 5, 4, 6, 5, 8, 6, 6, 7, 6, 6, 7, 5, 6, 5, 5)
drug = c(rep("A",9), rep("B",9), rep("C",9))
migraine = data.frame(pain,drug)
a). Make a boxplot to have a visual check.
b). Compute SST, SSG. SSE directly by definition
c). Compute F value directly by definition
d). Find p-value directly by definition
e). Conduct ANOVA test by R function aov.
f). Make your conclusion.
Applied stats 2 r code questions
In: Statistics and Probability
Based on historical data, your manager believes that 29% of the company's orders come from first-time customers. A random sample of 52 orders will be used to estimate the proportion of first-time-customers. What is the probability that the sample proportion is greater than than 0.37?
Note: You should carefully round any z-values you calculate to 4 decimal places to match wamap's approach and calculations.
In: Statistics and Probability
A mechanical workshop has registered that three cars with
electrical problems arrive in the morning, eight with mechanical
problems and three with sheet metal problems, and in the afternoon
two with electrical problems, three with mechanical problems and
one with sheet metal problems.
a) Construct the contingency table with the previous data.
b) Calculate the percentage of those who come in the
afternoon.
c) Calculate the percentage of those who attend due to mechanical
problems.
d) Calculate the probability that a car with electrical problems
will come in the morning.
In: Statistics and Probability
Based on past experience, a bank believes that
88%
of the people who receive loans will not make payments on time. The bank has recently approved
300
loans. Answer the following questions.
a) What are the mean and standard deviation of the proportion of clients in this group who may not make timely payments?
mu left parenthesis ModifyingAbove p with caret right parenthesisμpequals=
SD left parenthesis ModifyingAbove p with caret right parenthesisSDpequals=
(Round to three decimal places as needed.)
b) What assumptions underlie your model? Are the conditions met?
A.
With reasonable assumptions about the sample, all the conditions are met.
B.
The randomization and success/failure conditions are not met.
C.
The randomization and 10% conditions are not met.
D.
The randomization condition is not met.
E.
The 10% condition is not met.
F.
The 10% and success/failure conditions are not met.
G.
The success/failure condition is not met.
H.
Without unreasonable assumptions, none of the conditions are met.
c) What is the probability that over
10%
of these clients will not make timely payments?
Upper P left parenthesis ModifyingAbove p with caret greater than 0.1 right parenthesisPp>0.1equals=nothing
In: Statistics and Probability