Question

In: Statistics and Probability

Pls attempt both parts for UPVOTE a) For the following dataset:- A B C 82.95406 48.596...

Pls attempt both parts for UPVOTE

a) For the following dataset:-

A B C
82.95406 48.596 62.83925
80.2694 94.88806 69.11351
51.32409 87.00438 5.26083
84.40903 73.14477 67.37821
7.744191 70.83899 30.42249
70.09185 96.19882 35.38787
27.85478 70.86354 10.82541
36.31444 54.53047 94.30487
78.58975 88.44509 91.97403
78.83427 97.59331 67.44993
40.58147 62.05577 67.98824
5.522503 0.005762 28.78233
51.0516 75.53139 82.53751
22.99913 6.099075 16.05481
37.90452 78.80319 33.0078
90.42208 68.23812 86.88297
59.52895 23.34578 8.346984
96.59504 52.17967 75.20052
98.23697 87.31435 97.50355
56.66422 25.66281 27.79151
16.59429 84.47958 61.71686
53.90397 10.89486 93.26763
55.11838 13.11304 75.92159
71.32999 70.36975 10.86584
40.88035 84.11119 97.83293
88.07786 10.15206 76.98687
86.25806 68.54747 98.22674
14.63472 37.58765 68.50834
48.94452 77.09557 45.1666
83.50869 20.72787 33.30376
59.14445 55.82262 96.20811
1.253421 18.14296 71.29829
32.03952 22.48347 1.707322
82.10399 54.66754 71.42761
1.551587 88.15809 13.04672
55.40726 71.10242 10.2861
66.0299 17.13271 90.60817
70.02227 49.47755 9.984934
11.2358 99.71097 2.637771
54.2171 64.7902 28.8158

Please examine Does 68.26% of the data fall within one standard deviation of the mean value? Does 95% of the data fall within 2 standard deviations? Does 99.7% of the data fall within 3 standard deviations? Does the statistical analysis behave as expected?

Please also create a Histogram for data. Use 10 “bins” in your histogram.

b)

For the following dataset:-

A B
65.77503 64.79644
87.57873 81.42366
69.16423 47.8631
78.7769 74.97734
39.29159 36.33522
83.14534 67.22618
49.35916 36.51458
45.42246 61.71659
83.51742 86.33629
88.21379 81.29251
51.31862 56.87516
2.764132 11.43687
63.2915 69.70683
14.5491 15.05101
58.35385 49.90517
79.3301 81.84772
41.43736 30.40724
74.38735 74.65841
92.77566 94.35162
41.16351 36.70618
50.53694 54.26358
32.39941 52.68882
34.11571 48.051
70.84987 50.85519
62.49577 74.27482
49.11496 58.4056
77.40276 84.34409
26.11119 40.24357
63.02004 57.0689
52.11828 45.84677
57.48353 70.39173
9.698192 30.23156
27.26149 18.74344
68.38577 69.39971
44.85484 34.25214
63.25484 45.59859
41.5813 57.92359
59.74991 43.16158
55.47338 37.86151
59.50365 49.27436

Determine the mean and standard deviations for these new sets and create histograms for them as well. Examine the standard deviation ranges and comment on whether these new sets provide a “Normal” distribution.

Solutions

Expert Solution

R code:

#Reading the data
x=read.csv(file.choose(),header = FALSE)
v=x$V1
s=sd(x$V1)
m=mean(x$V1)

#Percentage of values that lie within one sd
length(which(v<m+s & v>m-s))/length(v)*100

#Percentage of values that lie within two sd
s1=2*sd(x$V1)
length(which(v<m+s1 & v>m-s1))/length(v)

#Percentage of values that lie within 3 sd
s2=3*sd(x$V1)
length(which(v<m+s2 & v>m-s2))/length(v)


#Check whether data is normal
qqnorm(v)

From the above code, we can see that 58.33333% values lie within one sd from the mean

But , 100% values lie within two sd from the mean.

Also 100% values lie within three sd from the mean.

The reason of this is there is very high standard deviation of the data.

The concept that 68.26,95,99.7 % values lie within one,two,three sd from the mean is only valid if the data is normal.

So, we checked the qqplot of the data i.e. quantile quantile plot which showed the data does not follow normal distribution

Histogram R code:


min(v)
max(v)
hist(v,breaks=c(0,10,20,30,40,50,60,70,80,90,100),main="Histogram of data",xlab="Bins")

Output:

From the histogram also it is evident that the data do not follow normal distribution


Related Solutions

Pls attempt all parts for Upvote a) The median incomes of females in each state of...
Pls attempt all parts for Upvote a) The median incomes of females in each state of the United States, including the District of Columbia and Puerto Rico, are given in table #2.2.10 ("Median income of," 2013). Create a frequency distribution, relative frequency distribution, and cumulative frequency distribution using 7 classes. Table #2.2.10: Data of Median Income for Females $31,862 $40,550 $36,048 $30,752 $41,817 $40,236 $47,476 $40,500 $60,332 $33,823 $35,438 $37,242 $31,238 $39,150 $34,023 $33,745 $33,269 $32,684 $31,844 $34,599 $48,748 $46,185...
Pls answer all parts for Upvote Sound-Around Turntables (SAT) is a regional manufacturer of high fidelity...
Pls answer all parts for Upvote Sound-Around Turntables (SAT) is a regional manufacturer of high fidelity turntables. It has been in business in Detroit, Michigan since 2008 and has steadily increased sales volume as U.S. sales of vinyl albums re-ignited. Robert Ritchie, CEO for SAT received an order for 2,500 turntables for Best Buy, a national electronics retailer. Production for this order will begin in late March and the customer order must be delivered on June 1. Since SAT builds...
Please answer both parts for upvote part a) Consider the shape of the distribution of each...
Please answer both parts for upvote part a) Consider the shape of the distribution of each of the following scenarios. For each, answer the following questions: Scenario 1 - The heights of female adults in Halifax Scenario 2 - The average number of children that Canadian families have. For each, answer the following questions: 1. What shape would you expect the distribution to have, and why? 2. If we randomly select a sample of 25 from that population, would the...
Pls answer all three parts for UPVOTE Data set: Students Outside US Stu ID Age GPA...
Pls answer all three parts for UPVOTE Data set: Students Outside US Stu ID Age GPA Hrs spend on sch wrk 3 48 4.00 7 6 47 2.79 14 9 45 3.48 5 12 19 4.00 30 15 24 3.10 10 18 34 3.24 2 21 44 36.00 6 24 19 2.85 7 27 19 2.80 10 30 27 3.40 8 33 28 2.90 16 36 27 3.40 8 39 28 2.90 16 42 21 2.9 4 45 20 2.50...
PLS DO NOT ATTEMPT THIS QUESTION IF YOU ARE NOT GOING TO COMPLETE IT. NO PLAGIARISM...
PLS DO NOT ATTEMPT THIS QUESTION IF YOU ARE NOT GOING TO COMPLETE IT. NO PLAGIARISM MINIMUM OF 400 typed words and not written on a piece of paper. 1)Compare and contrast an organization and an enterprise. 2)Using the scenario, please provide the following: a. What should structural and cultural aspects be captured by this Enterprise Architecture (EA) program? b.Who are the potential stakeholders for this Enterprise Architecture program? c.What could strategies for gaining stakeholder buy-in be used?
PLS DO NOT ATTEMPT THIS QUESTION IF YOU ARE NOT GOING TO COMPLETE IT. "NO PLAGIARISM"...
PLS DO NOT ATTEMPT THIS QUESTION IF YOU ARE NOT GOING TO COMPLETE IT. "NO PLAGIARISM" MINIMUM OF 200 typed words and not written on a piece of paper. 2)How can Enterprise Architecture help an enterprise to view its strategic objectives? 3)How can Enterprise Architecture help an enterprise to view its business services? 4)How can Enterprise Architecture help an enterprise to view its technology resources?
Consider Dataset C for answering the questions that follows below. Teams A, B and C have...
Consider Dataset C for answering the questions that follows below. Teams A, B and C have been used to serve as respondents in a recently concluded webinar in Cybercrime to evaluate the delivery of the webinar. Is there any reason to believe that the mean responses of the three teams are different from one another? Test this using a level of significance of 0.05. All the teams are being categorized as either Male or Female. In this scenario, can we...
Which of the following species is not expected to be a ligand? (a) NO (b) NH4 (c) NH2CH2CH2NH2 (d) Both (a) and (b)
Which of the following species is not expected to be a ligand?(a) NO(b) NH4(c) NH2CH2CH2NH2(d) Both (a) and (b)
ndividual A ("A"), Individual B ("B"), both calendar year taxpayers, and Corporation C ("C") with a...
ndividual A ("A"), Individual B ("B"), both calendar year taxpayers, and Corporation C ("C") with a fiscal year end June 30, form Partnership P ("P") on January 1 of Year 1. P manufactured widgets and is not a passive activity.   A contributes $300,000 cash in exchange for a 30% ownership interest (profits and capital), B contributes property with a fair market value ("FMV") of $400,000 and adjusted basis of $110,000, but subject to a non-recourse mortgage of $100,000 (which is...
Individual A ("A"), Individual B ("B"), both calendar year taxpayers, and Corporation C ("C") with a...
Individual A ("A"), Individual B ("B"), both calendar year taxpayers, and Corporation C ("C") with a fiscal year end June 30, form Partnership P ("P") on January 1 of Year 1. P manufactured widgets and is not a passive activity.   A contributes $300,000 cash in exchange for a 30% ownership interest (profits and capital), B contributes property with a fair market value ("FMV") of $400,000 and adjusted basis of $110,000, but subject to a non-recourse mortgage of $100,000 (which is...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT