Question

In: Statistics and Probability

solve using r if coding is required The data in air-pollution.txt are 42 measurements on air-pollution...

solve using r if coding is required

The data in air-pollution.txt are 42 measurements on air-pollution variables recorded at noon in the Los Angeles area on different days.

Variables: X1 = Wind, X2 = Solar radiation, X3 = Carbon monoxide, X4 = Nitric oxide, X5 = Nitrogen dioxide, X6 = Ozone, and X7 = Hydrocarbon content.

(a) Evaluate the sample mean vector x.

(b) Evaluate the sample variance-covariance matrix, S, and its inverse, S−1.

(c) Evaluate the sample correlation matrix R. Interpret the pairwise correlations. Also con- struct a scatterplot matrix (a matrix of scatter plots) for these data

(d) Construct a normal quantile plot (Q-Q plot) for the solar radiation measurements (X2) and carry out a test for normality using the Shapiro-Wilk test. Does the data appear to be normally distributed? Explain.

data is

Wind Radiation CO NO NO2 O3 HC
  8  98  7  2  12  8  2
  7  107  4  3  9  5  3
  7  103  4  3  5  6  3
  10  88  5  2  8  15  4
  6  91  4  2  8  10  3
  8  90  5  2  12  12  4
  9  84  7  4  12  15  5
  5  72  6  4  21  14  4
  7  82  5  1  11  11  3
  8  64  5  2  13  9  4
  6  71  5  4  10  3  3
  6  91  4  2  12  7  3
  7  72  7  4  18  10  3
  10  70  4  2  11  7  3
  10  72  4  1  8  10  3
  9  77  4  1  9  10  3
  8  76  4  1  7  7  3
  8  71  5  3  16  4  4
  9  67  4  2  13  2  3
  9  69  3  3  9  5  3
  10  62  5  3  14  4  4
  9  88  4  2  7  6  3
  8  80  4  2  13  11  4
  5  30  3  3  5  2  3
  6  83  5  1  10  23  4
  8  84  3  2  7  6  3
  6  78  4  2  11  11  3
  8  79  2  1  7  10  3
  6  62  4  3  9  8  3
  10  37  3  1  7  2  3
  8  71  4  1  10  7  3
  7  52  4  1  12  8  4
  5  48  6  5  8  4  3
  6  75  4  1  10  24  3
  10  35  4  1  6  9  2
  8  85  4  1  9  10  2
  5  86  3  1  6  12  2
  5  86  7  2  13  18  2
  7  79  7  4  9  25  3
  7  79  5  2  8  6  2
  6  68  6  2  11  14  3
  8  40  4  3  6  5  2

Solutions

Expert Solution


Related Solutions

Solve using coding in R script: 1) Given a standard normal distribution, find the value of...
Solve using coding in R script: 1) Given a standard normal distribution, find the value of k such that P(Z < k) = 0.0197. 2) Given a normal distribution with mu = E(X) = 32 and sigma^2 = V(X) = 30, find the normal curve area to the left of x = 31. Report your code as well as your final answer (4 decimals). 3) A company pays its employees an average wage of $17.90 an hour with a standard...
For expert using R I try to solve this question((USING DATA FAITHFUL)) but each time I...
For expert using R I try to solve this question((USING DATA FAITHFUL)) but each time I solve it, I have error , I try it many times. So,everything you write will be helpful.. Modify the EM-algorithm functions to work for a general K component Gaussian mixtures. Please use this function to fit a K= 1;2;3;4 modelto the old faithful data available in R (You need to initialize the EM-algorithm First ).    Which modelseems to t the data better? (Hint:...
solve using r and Consider the data birds.txt. A wildlife ecologist measured X1 = Tail length...
solve using r and Consider the data birds.txt. A wildlife ecologist measured X1 = Tail length (in millimeters) and X2 = Wing length (in millimeters) for a sample of n = 45 female hook-billed kites. (a) Construct both Q-Q plots and histograms from the marginal distributions of tail length (X1) and wing length (X2). Do these data appear to be normally distributed? Discuss. (b) Is the bivariate normal distribution a viable population model? Discuss. (c) Consider the variable, wing length...
solve using r and Consider the data birds.txt. A wildlife ecologist measured X1 = Tail length...
solve using r and Consider the data birds.txt. A wildlife ecologist measured X1 = Tail length (in millimeters) and X2 = Wing length (in millimeters) for a sample of n = 45 female hook-billed kites. (a) Evaluate the sample mean vector x. (b) Evaluate the sample variance-covariance matrix, S. (c) Determine the eigenvalue and eigenvector pairs for S. (d) Evaluate the sample correlation matrix R. Interpret the sample correlation, r12. the data is Tail Wing 191 284 197 285 208...
The data below shows the observed pollution indexes of air samples that are randomly chosen in...
The data below shows the observed pollution indexes of air samples that are randomly chosen in two areas of a city. The pollution indexes of air are said to be normally distributed from both areas. Use a 95% confidence interval to determine if the mean pollution indexes of air are different for the two areas. Area A Area B 2.92 4.69              1.84    3.44 1.88 4.86             0.95    3.69 5.35    5.81              4.26      4.95 3.81      5.55              3.18    4.47
Use R for coding Fit a density estimate to the data set `pi2000` (**UsingR**). Compare with...
Use R for coding Fit a density estimate to the data set `pi2000` (**UsingR**). Compare with the appropriate histogram. Why might you want to add an argument like `breaks = 0:10-.5` to `hist()`? I know this much code: install.packages("UsingR") library(UsingR) data(pi2000) if you put this in R a data set will appear
How do you solve this using R? The file "flow-occ.csv" contains data collected by loop detectors...
How do you solve this using R? The file "flow-occ.csv" contains data collected by loop detectors at a particular location of eastbound Interstate 80 in Sacramento, California, from March 14-20, 2003. For each of three lanes, the flow (the number of cars) and the occupancy (the percentage of time a car was over the loop) were recorded in successive five-minute intervals. There were 1740 such five-minute intervals. Lane 1 is the farthest left lane, lane 2 is in the center,...
show coding and answer using R A deck of 16 cards, from the bottom up, consists...
show coding and answer using R A deck of 16 cards, from the bottom up, consists of 4 ♣, 4 ♠, 4 ♦, and 4 ♥. The cards are cut at a random place. Let Tk be the event that the all the cards of every suit are together after cut k. (a) Assuming that all possible cuts are equally likely, find P(T2). (b) Find P(Tk+1) in terms of P(Tk). (c) If the deck is repeatedly cut many, many times,...
Please solve all of the question using R and do clarify the answers. Using the (SATGPA)...
Please solve all of the question using R and do clarify the answers. Using the (SATGPA) data set in (Stat2Data) package. Test by using ?= .01. 1) Create the following three variables and then print out all the six variables. A) Create new variable "SAT", which is the sum of (MathSAT) and (VerbalSAT). B) Create second new variable ("SATLevel"), and assign the value of( "SATLevel") as 1 when SAT<=1100, 2 when 1100<SAT<=1200, 3 when 1200<SAT<=1300, and 4 when SAT>1300. C)Create...
Go into R and view all of the data sets preloaded in R by using the...
Go into R and view all of the data sets preloaded in R by using the data() command. As you see there are quite a few data sets loaded into R. Now retrieve the dataset women using data(women). This data is from a random sample of 15 women, recording the height and weight of each woman in the sample. I want you to create a 95% confidence interval for the population mean using this data and R. First find the...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT