In: Statistics and Probability

Find a dataset online, and get a feel for it by performing some EDA. Produce a single plot which you think captures an interesting aspect of the data, and comment on it. (If you wish to use R, there are many data sets already built in, e.g. to the package ‘MASS’; if you are not using R, datasets are readily available -a simple Google search of ’sample datasets’ yields numerous results, for example.)

Solution:

Rcode:

library(MASS)

data()

print(HumanBodyTemp )

sum(HumanBodyTemp$temp)

mean(HumanBodyTemp$temp)

sd(HumanBodyTemp$temp)

length(HumanBodyTemp$temp)

median(HumanBodyTemp$temp)

outlier_values <- boxplot.stats(HumanBodyTemp$temp)$out #
outlier values.

boxplot(HumanBodyTemp$temp, main="Age ", boxwex=0.1)

mtext(paste("Outliers: ", paste(outlier_values, collapse=", ")),
cex=0.6)

boxplot(HumanBodyTemp$temp,main="boxplot")

hist(HumanBodyTemp$temp,main="histogram")

fivenum(HumanBodyTemp$temp)

Intrepretation:

It ahs one numerical variable called temp

mean=98.524

median= 98.6

standard deviation=0.6777905

No outliers seen from boxplot,Fivenumber summary from boxplot is

Minimum value= 97.4

Q1=98.0

Q2=median= 98.6

Q3=99.0

Maximum=100.0

From Histogram and shapiro.test

p=0.7001

p>0.05

variable temp follows normal distribution.

R scrreenshot

Install the Analysis Tool Pak on your computer. Find any dataset
online in the public domain. Copy around 15-20 into Excel, and then
create a randomized data set. Paste it here.

Install the Analysis Tool Pak on your computer. Find any
dataset online in the public domain. Copy around 15-20 into Excel,
and then create a randomized data set. Paste it here.
A Gallup pool indicated that 29% of Americans spent more money
in recent months than they used to. But the majority (58%) still
said they enjoy saving money more than spending it. The results are
based on telephone interviews conducted in April with a random
sample of 1,016 adults,...

You will be performing an analysis on a dataset that contains
data on fertility and life expectancy for 198 different countries.
All data is from the year 2013. The fertility numbers are the
average number of children per woman in each of the countries. The
life expectancy numbers are the average life expectancy in each of
the countries.
You will be turning in a paper that should include section
headings, graphics and tables when appropriate and complete
sentences which explain...

You will be using your Framingham dataset to answer
the following questions. You will be performing hypothesis testing.
For each question, please write out the null hypothesis, alternate
hypothesis, which test statistic you will be using (based on
variable type). Then report the results from performing the
analysis using SPSS. Make sure to report the test statistic,
significance level, and whether you will accept or reject the null
hypothesis and why. Finally, if you find significant differences,
report the proper....

PHC 281
“Historically, people have always tried to answer their health questions at home and made personal choices about whether and when to consult a clinician. Many have now added the internet to their personal health toolbox, helping themselves and their loved ones better understand what might be ailing them.” Fox, S., & Duggan, M. (2013). Health online 2013. Health, 2013, 1-55.
Discuss why some people go online to get diagnoses instead of going to a doctor/health professional.

PHC281
“Historically, people have always tried to answer their health questions at home and made personal choices about whether and when to consult a clinician. Many have now added the internet to their personal health toolbox, helping themselves and their loved ones better understand what might be ailing them.” Fox, S., & Duggan, M. (2013). Health online 2013. Health, 2013, 1-55.
Discuss why some people go online to get diagnoses instead of going to a doctor/health professional.

To find the dataset needed for this problem, you’ll first need
to open the “swiss” dataset that is contained in R by
running the following line:
> data('swiss')
Now you can rename the “swiss” dataset and use it to
answer the question below. Name the data frame with your UT
EID:
> my_variable <- swiss
This dataset contains socio-economic indicators for the
French-speaking provinces of Switzerland in the year 1888. Among
the variables, “Agriculture” is the percentage of the...

(a) To get a feel for how ∆mixGm varies with χAB, plot it for
the following values of χAB = 0, 1, 2, 3 (pick any temperature).
(b) Verify that ∆mixGm obtained from the lattice model for a
mixture of 2 components has a maximum or a minimum at xA = xB = 1/2
for any value of χAB. (c) Discuss how the second derivative of the
free energy of mixing with respect to the mole fraction of one of...

to get a feel for the importance of media literacy on
the part of business communicators, make a list of all the
information technology (devices, media platforms, and applications)
that you've used over the last two years.what kind of knowledge is
required to be able to use each of the used Technologies well?

to get a feel for the importance of media literacy on
the part of business communicators, make a list of all the
information technology (devices, media platforms, and applications)
that you've used over the last two years.what kind of knowledge is
required to be able to use each of the used Technologies well?

ADVERTISEMENT

ADVERTISEMENT

Latest Questions

- Write a program to do the following. • Input an integer n. • Create a BST...
- Summarize the basic process for valuing assets
- Why business owners should know about finance?
- The Alpine House, Inc., is a large retailer of snow skis. The company assembled the information...
- Dr. North, a surgeon practicing in Georgia, engaged an Arizona professional corporation consisting of twenty lawyers...
- Project management Expediting a Project Task Predecessor Normal Time Weeks Normal Cost Crash Time Crash Cost...
- Roger issued 40 year bonds six years ago at a coupon rate of 4.50 percent, and...

ADVERTISEMENT