Question

In: Statistics and Probability

Case Study 1 - Data Visualization and Descriptive Statistics The data file Home_Values.xlsx contains median home...

Case Study 1 - Data Visualization and Descriptive Statistics The data file Home_Values.xlsx contains median home values (Home Value), median household income (HH Inc), median per capita (Per Cap Inc) and percent of homes that are owner occupied (Pct Owner Occ) for each state and the District of Columbia. Prior to a more detailed analysis of the data, a company wants to get a good understanding of the 4 variables (e.g. central tendency, variability, shape of the distribution, pattern of relationship between the variables). A company representative contracts with you to help with this process. To help the company get a better understanding of the data, you are asked to perform the following analysis steps:

State Home Value HH Inc Per Cap Inc Pct Owner Occ
New York 303900 55603 30948 55.2
North Carolina 149100 45570 24745 68.1
North Dakota 111300 46781 25803 66.6
Ohio 136400 47358 25113 69.2
Oklahoma 104300 42979 23094 68.2
Oregon 252600 49260 26171 63.8
Pennsylvania 159300 50398 27049 71
Rhode Island 279300 54902 28707 62.5
South Carolina 134100 43939 23443 69.9
South Dakota 122200 46369 24110 68.9
Tennessee 134100 43314 23722 69.6
Texas 123500 49646 24870 64.8
Utah 218100 56330 23139 71.2
Vermont 208400 51841 27478 71.4
Virginia 255100 61406 32145 68.9
Washington 285400 57244 29733 64.8
West Virginia 94500 38380 21232 74.6
Wisconsin 169000 51598 26624 69.5
Wyoming 174000 53802 27860 70.2

Solutions

Expert Solution

The descriptive statistics of the data is:

Home Value HH Inc Per Cap Inc Pct Owner Occ
count 19 19 19 19
mean 1,79,715.79 49,827.37 26,104.53 67.811
sample standard deviation 67,262.61 5,843.50 2,875.21 4.244
sample variance 4,52,42,59,181.29 3,41,46,444.80 82,66,860.93 18.012
minimum 94500 38380 21232 55.2
maximum 303900 61406 32145 74.6
range 209400 23026 10913 19.4
1st quartile 1,28,800.00 45,969.50 23,916.00 65.700
median 1,59,300.00 49,646.00 25,803.00 68.900
3rd quartile 2,35,350.00 54,352.00 27,669.00 70.050
interquartile range 1,06,550.00 8,382.50 3,753.00 4.350
mode 1,34,100.00 #N/A #N/A 68.900
low extremes 0 0 0 0
low outliers 0 0 0 1
high outliers 0 0 0 0
high extremes 0 0 0 0

The boxplots are:


Related Solutions

Case Study 1 - Data Visualization and Descriptive Statistics The data file Home_Values.xlsx contains median home...
Case Study 1 - Data Visualization and Descriptive Statistics The data file Home_Values.xlsx contains median home values (Home Value), median household income (HH Inc), median per capita (Per Cap Inc) and percent of homes that are owner occupied (Pct Owner Occ) for each state and the District of Columbia. Prior to a more detailed analysis of the data, a company wants to get a good understanding of the 4 variables (e.g. central tendency, variability, shape of the distribution, pattern of...
* Descriptive statistics: o Find the mean, median, mode, range, and standard deviation for the data...
* Descriptive statistics: o Find the mean, median, mode, range, and standard deviation for the data set. o Create a scatter plot for the data set. * Regression Analysis: o Perform a linear regression analysis onto the data set. o Report the correlation coefficient, the equation of the regression function, and make a few predictions base on hypnotical input values. o Write down a summary of your conclusions (How well does the regression fit the values? How correlated are the...
The file medinc.mtw contains data on the median incomes (medinc) of census dissemination areas in Toronto....
The file medinc.mtw contains data on the median incomes (medinc) of census dissemination areas in Toronto. (a) Treating this set of data as the population, use Minitab to calculate the population mean and the population standard deviation for the medinc variable. Set aside all population information until parts (d) and (e). (b) Use Minitab (Calc Menu – Random Data – Sample from Columns) to draw twenty samples of size n = 40 from the Toronto medinc population. This procedure must...
Use Descriptive Statistics in Excel to find the mean, median, range and standard deviation of the...
Use Descriptive Statistics in Excel to find the mean, median, range and standard deviation of the data below. Table 1: Health Expenditures as a Percentage of GDP of countries around the world 3.35 5.96 10.64 5.24 3.79 5.65 7.66 7.38 5.87 11.15 5.96 4.78 7.75 2.72 9.50 7.69 10.05 11.96 8.18 6.74 5.89 6.20 5.98 8.83 6.78 6.66 9.45 5.41 5.16 8.55
7.38 Teaching descriptive statistics. A study compared five different methods for teaching descriptive statistics. The five...
7.38 Teaching descriptive statistics. A study compared five different methods for teaching descriptive statistics. The five methods were traditional lecture and discussion, programmed textbook instruction, programmed text with lectures, computer instruction, and computer instruction with lectures. 45 students were randomly assigned, 9 to each method. After completing the course, students took a 1-hour exam. (a) What are the hypotheses for evaluating if the average test scores are different for the different teaching methods? (b) What are the degrees of freedom...
Question 1: As explained in Lesson 5, data exploration through visualization is important because statistics alone...
Question 1: As explained in Lesson 5, data exploration through visualization is important because statistics alone might not tell the entire story. This is best shown by the French statistician Francis Anscombe in 1973 when he presented four sets of data. This data is shown here Data I Data II Data III Data IV x y x y x y x y 10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58 8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76 13.0 7.58...
Question 1: As explained in Lesson 5, data exploration through visualization is important because statistics alone...
Question 1: As explained in Lesson 5, data exploration through visualization is important because statistics alone might not tell the entire story. This is best shown by the French statistician Francis Anscombe in 1973 when he presented four sets of data. This data is shown here Data I Data II Data III Data IV x y x y x y x y 10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58 8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76 13.0 7.58...
There are a number of descriptive statistics that could be used to describe data from a...
There are a number of descriptive statistics that could be used to describe data from a study. Name and define at least 5
If the file circuit.txt contains the following data
Exercise 2: If the file circuit.txt contains the following data 3.0             2.1 1.5             1.1 2.6             4.1 The first column is voltage and the second column is the electric current. Write program that reads the voltages and currents then calculates the electric power (P) based on the equation: Voltage     Current        Power 3.0             2.1              (result) 1.5             1.1              (result) 2.6             4.1              (result)                         P = v * i Write your output to the file results.txt with voltage in the first, current in the second...
Data Analysis & Visualization Topic R vector and save the r code in a text file...
Data Analysis & Visualization Topic R vector and save the r code in a text file Problem 1. Create two vectors named v and w with the following contents:      v : 21,10,32,2,-3,4,5,6,7,4,-22      w : -18,72,11,-9,10,2,34,-5,18,9,2 A) Print the length of the vectors B) Print all elements of the vectors C) Print elements at indices 3 through 7. D) Print the sum of the elements in each vector. E) Find the mean of each vector. (Use R's mean() function)...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT