Question

In: Statistics and Probability

Please use R and R studio The accompanying observations are precipitation values during March over a...

Please use R and R studio

The accompanying observations are precipitation values during March over a 30-year period in Minneapolis-St. Paul.

.77 1.20 3.00 1.62 2.81 2.48

1.74 .47 3.09 1.31 1.87 .96

.81 1.43 1.51 .32 1.18 1.89

1.20 3.37 2.10 .59 1.35 .90

1.95 2.20 .52 .81 4.75 2.05

a. Construct and interpret a normal probability plot for this data set.

b. Calculate the square root of each value and then construct a normal probability plot based on this transformed data. Does it seem plausible that the square root of precipitation is normally distributed?

c. Repeat part (b) after transforming by cube roots

Solutions

Expert Solution

Solutiona

a. Construct and interpret a normal probability plot for this data set.

Rcode:

precipitation <- c(.77, 1.20, 3.00, 1.62 ,2.81, 2.48,

1.74 ,.47, 3.09, 1.31 ,1.87 ,.96,

.81, 1.43, 1.51, .32, 1.18, 1.89,

1.20, 3.37, 2.10 ,.59, 1.35, .90,

1.95 ,2.20 ,.52, .81, 4.75, 2.05)
qqnorm(precipitation)
qqline(precipitation)

Output:

from normal probability plot

precipitaion datset values does not follow normal distribution as there is a deviation from straight line

SolutionB:

Rcode:

sqrt_prectpitation <- sqrt(precipitation)
print(sqrt_prectpitation)
qqnorm(sqrt_prectpitation)
qqline(sqrt_prectpitation)

output:

From plot we observe square root of precipitation is normally distributed s most of the observations after transformation fall on straight line.

Solutionc:

Rcode:

library(kader)
cuberoot_precipitation <- kader:::cuberoot(x =precipitation )
qqnorm(cuberoot_precipitation,main="Normal Q-Q plot for cube root of precipitation")
qqline(cuberoot_precipitation)
shapiro.test(cuberoot_precipitation)

From normal QQplot we observe that cube root of precipitation is normally distributed as most of the observations after transformation fall on straight line.


Related Solutions

Use R studio As of March 23, 2020, the number of COVID19 cases in the state...
Use R studio As of March 23, 2020, the number of COVID19 cases in the state of Florida was 1007, with 13 deaths. The number of cases in Louisiana was 837, with 20 deaths. We are interested in comparing the state’s mortality rates.   Construct a 95% confidence intervals for the difference of the two unknown population proportions. Draw some conclusions. Test the hypothesis that the state of Louisiana has a higher mortality rate at a = 0.1. What are the...
Please use R and R studio A sample of 15 female collegiate golfers was selected and...
Please use R and R studio A sample of 15 female collegiate golfers was selected and the clubhead velocity (km/hr) while swinging a driver was determined for each one, resulting in the following data (“Hip Rotational Velocities During the Full Golf Swing,” J.of Sports Science and Medicine, 2009: 296–299): 69.0 69.7 72.7 80.3 81.0 85.0 86.0 86.3 86.7 87.7 89.3 90.7 91.0 92.5 93.0 The corresponding z percentiles are -1.83 -1.28 -0.97 -0.73 -0.52 -0.34 -0.17 0.0 0.17 0.34 0.52...
Please Use R studio to answer the question. This is the Statistics section of Comparing Groups....
Please Use R studio to answer the question. This is the Statistics section of Comparing Groups. One month before the election, a poll of 630 randomly selected votes showed 54% planning to vote for a certain candidate. A week later, it became known that he had had an extramarital affair, and a new poll showed only 51% of 1010 voters supporting him. Do these results indicate a decrease in voter support fo his candidacy? a) Test an appropriate hypothesis as...
THIS QUESTION REQUIRES THE USE OF R STUDIO. ANY ANSWERS GIVEN THAT ARE NOT IN R...
THIS QUESTION REQUIRES THE USE OF R STUDIO. ANY ANSWERS GIVEN THAT ARE NOT IN R STUDIO CODE WILL NOT SUFFICE. SOLVING WITHOUT THE USE OF R STUDIO IS NOT ACCEPTABLE. The previous question was: Annual salaries for a large company are approximately normally distributed with a mean of 49000 dollars and a standard deviation of 2000 dollars. One manager claims that all of his direct reports are paid "above the 75th percentile" for the company. What is the minimum...
Please use R studio, Thank you. 2. The probability of a student passing statistics is known...
Please use R studio, Thank you. 2. The probability of a student passing statistics is known to be 0.41; and the probability of a student passing chemistry is known to be 0.55. If the probability of passing both is known to be 0.35, calculate: (a) the probability of passing at least one of statistics and chemistry (b) the probability of a student passing chemistry, given that they passed statistics (c) Are passing chemistry and statistics independent? Justify (d) (harder) a...
Please Use R studio and show all the steps to answer this question NY Marathon 2013...
Please Use R studio and show all the steps to answer this question NY Marathon 2013 the table below shows the winning times (in minutes) for men and women in the new york city marathon between 1978 and 2013. (the race was not run in 2012 because of superstorm sandy.) assuming that performances in the big apple resemble performances elsewhere, we can think of these data as a sample of performance in marathon competitions. Create a 90% confidence interval for...
Please Use R studio to answer this question NY Marathon 2013 the table below shows the...
Please Use R studio to answer this question NY Marathon 2013 the table below shows the winning times (in minutes) for men and women in the new york city marathon between 1978 and 2013. (the race was not run in 2012 because of superstorm sandy.) assuming that performances in the big apple resemble performances elsewhere, we can think of these data as a sample of performance in marathon competitions. Create a 90% confidence interval for the mean difference in winning...
I want this to be solved using R studio or R software, please. Here is the...
I want this to be solved using R studio or R software, please. Here is the example: The data in stat4_prob5 present the performance of a chemical process as a function of sever controllable process variables. (a) Fit a multiple regression modelrelating CO2product (y) to total solvent (x1) and hydrogen consumption (x2) and report the fitted regression line. (b) Find a point estimatefor the variance term σ2. (c) Construct the ANOVA tableand test for the significance of the regression using...
1. Basic use of R/R Studio. Solve the following problem in R and print out the...
1. Basic use of R/R Studio. Solve the following problem in R and print out the commands and outputs. (a) Create a vector of the positive odd integers less than 100; Remove the values greater than 60 and less than 80; Find the variance of the remaining set of values (b) What’s the difference in output between the commands 2*1:5 and (2*1):5? Why is there a difference? (c) If you wanted to enter the odd numbers from 1 to 19...
please use R studio to answer the following questions 1. An eductional theorist collects behavioural data...
please use R studio to answer the following questions 1. An eductional theorist collects behavioural data from two groups of children in an early childhood center. She measures how much time the children are active (e.g. running or swinging on the monkey bars) in minutes. The first group of children are encouraged to run about and as such are expected to be active; the second group is encouraged to sit still and paint, and are expected to be less active....
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT