Question

In: Statistics and Probability

Find a dataset online, and get a feel for it by performing some EDA. Produce a...

Find a dataset online, and get a feel for it by performing some EDA. Produce a single plot which you think captures an interesting aspect of the data, and comment on it. (If you wish to use R, there are many data sets already built in, e.g. to the package ‘MASS’; if you are not using R, datasets are readily available -a simple Google search of ’sample datasets’ yields numerous results, for example.)

Solutions

Expert Solution

Solution:

Rcode:

library(MASS)
data()
print(HumanBodyTemp )
sum(HumanBodyTemp$temp)
mean(HumanBodyTemp$temp)
sd(HumanBodyTemp$temp)
length(HumanBodyTemp$temp)
median(HumanBodyTemp$temp)

outlier_values <- boxplot.stats(HumanBodyTemp$temp)$out # outlier values.
boxplot(HumanBodyTemp$temp, main="Age ", boxwex=0.1)
mtext(paste("Outliers: ", paste(outlier_values, collapse=", ")), cex=0.6)
boxplot(HumanBodyTemp$temp,main="boxplot")

hist(HumanBodyTemp$temp,main="histogram")
fivenum(HumanBodyTemp$temp)

Intrepretation:

It ahs one numerical variable called temp

mean=98.524

median= 98.6

standard deviation=0.6777905

No outliers seen from boxplot,Fivenumber summary from boxplot is

Minimum value= 97.4

Q1=98.0

Q2=median= 98.6

Q3=99.0

Maximum=100.0

From Histogram and shapiro.test

p=0.7001

p>0.05

variable temp follows normal distribution.

R scrreenshot


Related Solutions

Install the Analysis Tool Pak on your computer. Find any dataset online in the public domain....
Install the Analysis Tool Pak on your computer. Find any dataset online in the public domain. Copy around 15-20 into Excel, and then create a randomized data set. Paste it here.
Install the Analysis Tool Pak on your computer. Find any dataset online in the public domain....
Install the Analysis Tool Pak on your computer. Find any dataset online in the public domain. Copy around 15-20 into Excel, and then create a randomized data set. Paste it here. A Gallup pool indicated that 29% of Americans spent more money in recent months than they used to. But the majority (58%) still said they enjoy saving money more than spending it. The results are based on telephone interviews conducted in April with a random sample of 1,016 adults,...
You will be performing an analysis on a dataset that contains data on fertility and life...
You will be performing an analysis on a dataset that contains data on fertility and life expectancy for 198 different countries. All data is from the year 2013. The fertility numbers are the average number of children per woman in each of the countries. The life expectancy numbers are the average life expectancy in each of the countries. You will be turning in a paper that should include section headings, graphics and tables when appropriate and complete sentences which explain...
You will be using your Framingham dataset to answer the following questions. You will be performing...
You will be using your Framingham dataset to answer the following questions. You will be performing hypothesis testing. For each question, please write out the null hypothesis, alternate hypothesis, which test statistic you will be using (based on variable type). Then report the results from performing the analysis using SPSS. Make sure to report the test statistic, significance level, and whether you will accept or reject the null hypothesis and why. Finally, if you find significant differences, report the proper....
go online and find some exchange-rate policy that is going on in the world. It doesn't...
go online and find some exchange-rate policy that is going on in the world. It doesn't necessarily have to be between the United States and some other country it could be between Australia and the Philippines for example. Again using chapter as a guide, find some trade events or policies that are going on in the world that are driving the currency exchange - and the Net Capital Out or Inflow. Is it a case where the purchase power is...
Discuss why some people go online to get diagnoses instead of going to a doctor/health professional.
  PHC 281 “Historically, people have always tried to answer their health questions at home and made personal choices about whether and when to consult a clinician. Many have now added the internet to their personal health toolbox, helping themselves and their loved ones better understand what might be ailing them.” Fox, S., & Duggan, M. (2013). Health online 2013. Health, 2013, 1-55. Discuss why some people go online to get diagnoses instead of going to a doctor/health professional. 
Discuss why some people go online to get diagnoses instead of going to a doctor/health professional.
  PHC281 “Historically, people have always tried to answer their health questions at home and made personal choices about whether and when to consult a clinician. Many have now added the internet to their personal health toolbox, helping themselves and their loved ones better understand what might be ailing them.” Fox, S., & Duggan, M. (2013). Health online 2013. Health, 2013, 1-55. Discuss why some people go online to get diagnoses instead of going to a doctor/health professional.  
To find the dataset needed for this problem, you’ll first need to open the “swiss” dataset...
To find the dataset needed for this problem, you’ll first need to open the “swiss” dataset that is contained in R by running the following line: > data('swiss') Now you can rename the “swiss” dataset and use it to answer the question below. Name the data frame with your UT EID:                         > my_variable <- swiss This dataset contains socio-economic indicators for the French-speaking provinces of Switzerland in the year 1888. Among the variables, “Agriculture” is the percentage of the...
Do some research to find the 2018 balance sheets for Macy’s and Nordstrom online. For each...
Do some research to find the 2018 balance sheets for Macy’s and Nordstrom online. For each company, navigate to the company’s website, then scroll down to investors or investor relations, 2018 annual report, and locate their 2018 balance sheet. Identify and describe the company’s three most significant accounts in the following areas: Current assets Long-term asset Current liabilities Long-term liabilities Stockholders’ equity How do these two retailer’s balance sheets compare? Which company do you feel is healthier?
(a) To get a feel for how ∆mixGm varies with χAB, plot it for the following...
(a) To get a feel for how ∆mixGm varies with χAB, plot it for the following values of χAB = 0, 1, 2, 3 (pick any temperature). (b) Verify that ∆mixGm obtained from the lattice model for a mixture of 2 components has a maximum or a minimum at xA = xB = 1/2 for any value of χAB. (c) Discuss how the second derivative of the free energy of mixing with respect to the mole fraction of one of...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT