Questions
Is a sample correlation coefficient ρ = 0.83 significant at the α = 0.01 level based...

Is a sample correlation coefficient ρ = 0.83 significant at the α = 0.01 level based on a sample size of n = 6 data pairs? What about n = 14 data pairs? (Select all that apply.)

Yes, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 6 and α = 0.01.

No, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 14 and α = 0.01.

No, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 6 and α = 0.01.

Yes, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 14 and α = 0.01.

Yes, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 14 and α = 0.01.

No, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 6 and α = 0.01.

Yes, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 6 and α = 0.01.

No, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 14 and α = 0.01.

(b) Is a sample correlation coefficient ρ = 0.41 significant at the α = 0.05 level based on a sample size of n = 20 data pairs? What about n = 26 data pairs? (Select all that apply.)

No, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 20 and α = 0.05.

Yes, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 20 and α = 0.05.

No, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 26 and α = 0.05.

Yes, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 26 and α = 0.05.

No, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 26 and α = 0.05.

Yes, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 26 and α = 0.05.

Yes, because the absolute value of the given correlation coefficient is greater than or equal to that for a sample size of n = 20 and α = 0.05.

No, because the absolute value of the given correlation coefficient is smaller than that for a sample size of n = 20 and α = 0.05.

(c) Is it true that in order to be significant, a ρ value must be larger than 0.90? larger than 0.70? larger than 0.50? What does sample size have to do with the significance of ρ? Explain your answer.

No, a larger sample size means that a smaller absolute value of the correlation coefficient might be significant.

Yes, a larger correlation coefficient of 0.90 means that the data will be significant.

Yes, a larger correlation coefficient of 0.50 means that the data will be significant.

Yes, a larger correlation coefficient of 0.70 means that the data will be significant.

No, sample size has no bearing on whether or not the correlation coefficient might be significant

In: Statistics and Probability

Both X and S2 are unbiased for the Poisson λ. Which is better? Use the following...

Both X and S2 are unbiased for the Poisson λ. Which is better? Use the following steps to answer this question.

a) Generate 200000 random numbers from the Poisson(λ = 2) distribution and arrange them in a matrix with 20 rows. Thus you have 10000 samples of size 20.
b) Compute the 10000 sample means and sample variances and store them in objects means and vars, respectively.

c) Compute the average of the 10000 sample means and the average of the 10000 sample variances. Report the two averages. Do they support the claim that they are unbiased for λ = 2?
d) Compute the sample variance of the 10000 sample means and the sample variance of the 10000 sample variances. Report the two variances. Which estimator of λ is preferable.

In: Statistics and Probability

The exponential distribution with rate λ has mean μ = 1/λ. Thus the method of moments...

The exponential distribution with rate λ has mean μ = 1/λ. Thus the method of moments estimator of λ is 1/X. Use the following steps to verify that X is unbiased, but 1/X is biased.

a) Generate 10000 samples of size n = 5 from the standard exponential distribution (i.e. λ = 1) using rexp(50000) and arranging the 50000 random numbers in a matrix with 5 rows.

b) Use the apply() function to compute the 10000 sample means and store them in the object means. The 10000 estimators of λ can be stored in the object lambdas by lambdas = 1/means

c) Compute the sample mean of the object means, and sample mean of the object lambdas. What can you say about the bias of X and of 1/X?

d) Repeat with a sample of size n = 10, using rexp(100000), and report your estimate of the bias of 1/X. Has the bias decreased?

In: Statistics and Probability

#4 **Below are two samples of test scores from two different calculus classes. It is believed...

#4
**Below are two samples of test scores from two different calculus classes. It is believed that class 1 performed better than class two. From previous tests, it is known that the test scores for both classes are normally distributed and the population standard deviation of class 1 is 10 points and the population standard deviation of class 2 is 8 points. Do the data support that class 1 performed better.**
```{r}
class1<-c(100, 86, 98, 72, 66, 95, 93, 82)
class2<-c(98, 82, 99, 99, 70, 71, 94, 79)


```

##5
**A teaching assistant in Florida collected a sample to see if the average number of hours students put into studying depending on if they were in graduate school or not. The data below represents these two samples.**

**Perform a test to determine if the time graduate students spend studying is greater than that of undergraduate students. Be sure to identify your hypotheses and explain your conclusion in the context of the question. Assume the time spend studying for both graduate and undergraduate is normally distributed.**

grad<-c(15,7,15,10,5,5,2,3,12,16,15,37,8,14,10,18,3,25,15,5,5)
undergrad<-c(6,8,15,6,5,14,10,10,12,5)

Please Solve using R studio codes with explanation

In: Statistics and Probability

Rework problem 35 from the Chapter 2 review exercises in your text, involving auditioning for a...

Rework problem 35 from the Chapter 2 review exercises in your text, involving auditioning for a play. For this problem, assume 11 males audition, one of them being Seamus, 4 females audition, one of them being Nisha, and 5 children audition. The casting director has 4 male roles available, 2 female roles available, and 2 child roles available.

(1) How many different ways can these roles be filled from these auditioners?



(2) How many different ways can these roles be filled if exactly one of Seamus and Nisha gets a part?


(3) What is the probability (if the roles are filled at random) of both Seamus and Nisha getting a part

In: Statistics and Probability

Suppose x has a normal distribution with mean μ = 52 and standard deviation σ =...

Suppose x has a normal distribution with mean μ = 52 and standard deviation σ = 9.

Describe the distribution of x values for sample size n = 4. (Round σx to two decimal places.)

μx =
σx =


Describe the distribution of x values for sample size n = 16. (Round σx to two decimal places.)

μx =
σx =


Describe the distribution of x values for sample size n = 100. (Round σx to two decimal places.)

μx =
σx =


How do the x distributions compare for the various samples sizes?

The standard deviations are the same, but the means are increasing with increasing sample size.The standard deviations are the same, but the means are decreasing with increasing sample size.    The means are the same, but the standard deviations are increasing with increasing sample size.The means are the same, but the standard deviations are decreasing with increasing sample size.The means and standard deviations are the same regardless of sample size.

In: Statistics and Probability

Case Problem 1: Stock Market a. Using the dataset “Stock Market”, build a table with the...

Case Problem 1: Stock Market a. Using the dataset “Stock Market”, build a table with the descriptive statistics (N, Mean, Standard Deviation, Minimum, Median and Maximum) (10 points) • Which companies had a higher mean monthly return than the market (as measured by the S&P 500)? (5 points) • Which one was the most volatile (has the largest standard deviation)? Why is the S&P Index the less volatile? (5 points) b. Find the estimated regression equation relating each of the individual stocks to the S&P 500 and the value of R-Sq for each equation. (25 points) c. Find the betas (slope of estimated regression equation) for the individual stocks from the regression output. (10 points) • What does a stock with a beta greater than 1 indicate? And less than 1? What is the stock that benefits most from a rising market? Why? (25 points) d. What dp the R-Sq values indicate? (20 points)

Month Microsoft Exxon Mobil Caterpillar Johnson & Johnson McDonald's Sandisk Qualcomm Procter & Gamble S&P 500
Jan-03 0,21799 0,27739 0,2696 0,29814 0,18557 0,05133 0,3349 0,300465 0,272585
Feb-03 0,30211 0,30293 0,36867 0,28219 0,25576 0,39363 0,21822 0,256644 0,282996
Mar-03 0,32152 0,32734 0,34681 0,40334 0,36245 0,30839 0,34251 0,387833 0,308358
Apr-03 0,35576 0,30715 0,37622 0,27391 0,48257 0,73876 0,18556 0,313588 0,381044
May-03 0,26283 0,34119 0,29144 0,26859 0,39532 0,80165 0,35395 0,321925 0,350899
Jun-03 0,34185 0,28654 0,36731 0,25124 0,47779 0,4164 0,37124 0,271248 0,311322
Jul-03 0,33003 0,29081 0,51847 0,30174 0,34306 0,69734 0,34285 0,290413 0,316224
Aug-03 0,30417 0,36661 0,36462 0,26196 0,27436 0,36633 0,40459 0,293399 0,317873
Sep-03 0,34827 0,27082 0,25837 0,29879 0,34996 0,35409 0,30823 0,363352 0,288056
Oct-03 0,24604 0,29945 0,36987 0,31636 0,36202 0,56491 0,43967 0,363833 0,354962
Nov-03 0,28355 0,29645 0,3378 0,2843 0,3412 0,30273 0,23957 0,279143 0,307129
Dec-03 0,36457 0,4326 0,39165 0,34787 0,26879 0,05724 0,51055 0,337822 0,350765
Jan-04 0,31023 0,29488 0,24556 0,33407 0,33665 0,18725 0,38678 0,31657 0,317276
Feb-04 0,25949 0,33996 0,26954 0,31367 0,39946 0,23628 0,37763 0,314147 0,312209
Mar-04 0,23969 0,28625 0,34383 0,24083 0,30954 0,41566 0,35072 0,32312 0,283641
Apr-04 0,34813 0,32308 0,28773 0,36526 0,2531 0,11629 0,24222 0,313063 0,283209
May-04 0,30383 0,3228 0,26938 0,33637 0,26952 0,36479 0,37541 0,319574 0,312083
Jun-04 0,38883 0,32682 0,35428 0,29982 0,28485 0,17992 0,38812 0,309831 0,317989
Jul-04 0,29755 0,34256 0,23026 0,29228 0,35769 0,42125 0,24834 0,262528 0,265709
Aug-04 0,26104 0,30151 0,28925 0,35636 0,28255 0,26012 0,40157 0,37325 0,302287
Sep-04 0,31282 0,34837 0,4066 0,26954 0,33738 0,54711 0,32602 0,266947 0,309364
Oct-04 0,31157 0,31842 0,30622 0,33639 0,33996 0,01669 0,36557 0,250296 0,314014
Nov-04 0,36864 0,34673 0,4367 0,33811 0,37341 0,38194 0,30048 0,344939 0,338595
Dec-04 0,29664 0,3002 0,3651 0,35139 0,34294 0,40585 0,32042 0,329918 0,332458
Jan-05 0,28353 0,30663 0,21796 0,32018 0,31029 0,28919 0,1783 0,270951 0,27471
Feb-05 0,26043 0,53217 0,36678 0,31832 0,3213 0,38826 0,26992 0,29737 0,318903
Mar-05 0,26065 0,2414 0,26202 0,32378 0,24135 0,33423 0,31609 0,298305 0,280882
Apr-05 0,34675 0,25688 0,26741 0,32189 0,24123 0,15252 0,2525 0,326981 0,279891
May-05 0,32292 0,29053 0,36882 0,28251 0,35561 0,39578 0,37079 0,318467 0,329952
Jun-05 0,26279 0,3226 0,31275 0,2687 0,1969 0,21375 0,1857 0,256482 0,299857
Jul-05 0,331 0,32227 0,4365 0,284 0,42324 0,7252 0,496 0,359905 0,335968
Aug-05 0,37224 0,32451 0,32931 0,29625 0,34107 0,44814 0,30811 0,297304 0,288778
Sep-05 0,23974 0,36077 0,35875 0,29826 0,33205 0,54234 0,42692 0,371738 0,306949
Oct-05 0,29883 0,18354 0,1994 0,28957 0,24357 0,52056 0,18849 0,246351 0,282259
Nov-05 0,38016 0,33883 0,39869 0,29138 0,3924 0,16719 0,44361 0,321432 0,335186
Dec-05 0,24473 0,26795 0,29983 0,27328 0,29616 0,53032 0,24942 0,312065 0,299048

In: Statistics and Probability

Listed below are the 25 players on the opening-day roster of the 2016 New York Yankees...

Listed below are the 25 players on the opening-day roster of the 2016 New York Yankees Major League Baseball team, their salaries, and fielding positions.

Player Position Salary (US$)
C.C. Sabathia Starting Pitcher $ 25,000,000
Mark Teixeira First Base $ 23,125,000
Masahiro Tanaka Starting Pitcher $ 22,000,000
Jacoby Ellsbury Center Field $ 21,142,857
Alex Rodriguez Designated Hitter $ 21,000,000
Brian McCann Catcher $ 17,000,000
Carlos Beltran Right Field $ 15,000,000
Brett Gardner Left Field $ 13,500,000
Chase Headley Third Base $ 13,000,000
Andrew Miller Relief Pitcher $ 9,000,000
Starlin Castro Second Base $ 7,857,142
Nathan Eovaldi Starting Pitcher $ 5,600,000
Michael Pineda Starting Pitcher $ 4,300,000
Ivan Nova Relief Pitcher $ 4,100,000
Dustin Ackley Left Field $ 3,200,000
Didi Gregorius Shortstop $ 2,425,000
Aaron Hicks Center Field $ 574,000
Austin Romine Catcher $ 556,000
Chasen Shreve Relief Pitcher $ 533,400
Luis Severino Starting Pitcher $ 521,300
Kirby Yates Relief Pitcher $ 511,900
Ronald Torreyes Second Base $ 508,600
Johnny Barbato Relief Pitcher $ 507,500
Dellin Betances Relief Pitcher $ 507,500
Luis Cessa Relief Pitcher $ 507,500

Sort the players into two groups, all pitchers (relief and starting) and position players (all others). Assume equal population standard deviations for the pitchers and the position players. Test the hypothesis that mean salaries of pitchers and position players are equal using the 0.01 significance level.

  1. State the null and alternative hypothesis.

  1. What is the decision rule? (Negative amounts should be indicated by a minus sign. Round your answers to 3 decimal places.)

  1. Calculate the test statistic. (Negative amounts should be indicated by a minus sign. Round your answer to 3 decimal places.)

  1. What is your decision regarding H0?

  • Do not reject H0

  • Reject H0

  1. Is there a difference in the mean salaries of pitchers and position players?

  • Yes

  • No

In: Statistics and Probability

In studies for a​ medication, 77 percent of patients gained weight as a side effect. Suppose...

In studies for a​ medication, 77 percent of patients gained weight as a side effect. Suppose 615 patients are randomly selected. Use the normal approximation to the binomial to approximate the probability that

​(a) exactly 44 patients will gain weight as a side effect.

​(b) no more than 44 patients will gain weight as a side effect.

​(c) at least 56 patients will gain weight as a side effect. What does this result​ suggest?

In: Statistics and Probability

1. What is the difference between the “Between Treatment” sources in the One-Way ANOVA compared to...

1. What is the difference between the “Between Treatment” sources in the One-Way ANOVA compared to the Two-Way ANOVA?

2. Explain how and why the numerators changes when calculating different F-statistics in a Two-Way ANOVA..

3. Explain why the denominator does not change when calculating different F-statistics in a Two-Way ANOVA.

In: Statistics and Probability

A certain flight arrives on time 86 percent of the time. Suppose 178 flights are randomly...

A certain flight arrives on time 86 percent of the time. Suppose 178 flights are randomly selected. Use the normal approximation to the binomial to approximate the probability that

​(a) exactly 161 flights are on time.

​(b) at least 161 flights are on time.

​(c) fewer than 147 flights are on time.

​(d) between 147 and 160​, inclusive are on time.

In: Statistics and Probability

Problem 1. A drug company tested three formulations of a pain relief medicine for migraine headache...

Problem 1. A drug company tested three formulations of a pain relief medicine for migraine headache sufferers. For the experiment 27 volunteers were selected and 9 were randomly assigned to one of three drug formulations. The subjects were instructed to take the drug during their next migraine headache episode and to report their pain on a scale of 1 to 10 (10 being most pain).

             Drug A 4 5 4 3 2 4 3 4 4

             Drug B 6 8 4 5 4 6 5 8 6

             Drug C 6 7 6 6 7 5 6 5 5

You can read in data into R by the following R code.

pain = c(4, 5, 4, 3, 2, 4, 3, 4, 4, 6, 8, 4, 5, 4, 6, 5, 8, 6, 6, 7, 6, 6, 7, 5, 6, 5, 5)

drug = c(rep("A",9), rep("B",9), rep("C",9))

migraine = data.frame(pain,drug)

a). Make a boxplot to have a visual check.

b). Compute SST, SSG. SSE directly by definition

c). Compute F value directly by definition

d). Find p-value directly by definition

e). Conduct ANOVA test by R function aov.

f). Make your conclusion.

Applied stats 2 r code questions

In: Statistics and Probability

Based on historical data, your manager believes that 29% of the company's orders come from first-time...

Based on historical data, your manager believes that 29% of the company's orders come from first-time customers. A random sample of 52 orders will be used to estimate the proportion of first-time-customers. What is the probability that the sample proportion is greater than than 0.37?

Note: You should carefully round any z-values you calculate to 4 decimal places to match wamap's approach and calculations.

In: Statistics and Probability

A mechanical workshop has registered that three cars with electrical problems arrive in the morning, eight...

A mechanical workshop has registered that three cars with electrical problems arrive in the morning, eight with mechanical problems and three with sheet metal problems, and in the afternoon two with electrical problems, three with mechanical problems and one with sheet metal problems.
a) Construct the contingency table with the previous data.
b) Calculate the percentage of those who come in the afternoon.
c) Calculate the percentage of those who attend due to mechanical problems.
d) Calculate the probability that a car with electrical problems will come in the morning.

In: Statistics and Probability

Based on past​ experience, a bank believes that 88​% of the people who receive loans will...

Based on past​ experience, a bank believes that

88​%

of the people who receive loans will not make payments on time. The bank has recently approved

300

loans. Answer the following questions.

​a) What are the mean and standard deviation of the proportion of clients in this group who may not make timely​ payments?

mu left parenthesis ModifyingAbove p with caret right parenthesisμpequals=

SD left parenthesis ModifyingAbove p with caret right parenthesisSDpequals=

​(Round to three decimal places as​ needed.)

​b) What assumptions underlie your​ model? Are the conditions​ met?

A.

With reasonable assumptions about the​ sample, all the conditions are met.

B.

The randomization and​ success/failure conditions are not met.

C.

The randomization and​ 10% conditions are not met.

D.

The randomization condition is not met.

E.

The​ 10% condition is not met.

F.

The​ 10% and​ success/failure conditions are not met.

G.

The​ success/failure condition is not met.

H.

Without unreasonable​ assumptions, none of the conditions are met.

​c) What is the probability that over

10​%

of these clients will not make timely​ payments?

Upper P left parenthesis ModifyingAbove p with caret greater than 0.1 right parenthesisPp>0.1equals=nothing

In: Statistics and Probability