Question

In: Statistics and Probability

Case Study 1 - Data Visualization and Descriptive Statistics The data file Home_Values.xlsx contains median home...

Case Study 1 - Data Visualization and Descriptive Statistics

The data file Home_Values.xlsx contains median home values (Home Value), median household income (HH Inc), median per capita (Per Cap Inc) and percent of homes that are owner occupied (Pct Owner Occ) for each state and the District of Columbia. Prior to a more detailed analysis of the data, a company wants to get a good understanding of the 4 variables (e.g. central tendency, variability, shape of the distribution, pattern of relationship between the variables). A company representative contracts with you to help with this process. To help the company get a better understanding of the data, you are asked to perform the following analysis steps:

I

State Home Value HH Inc Per Cap Inc Pct Owner Occ
Iowa 119200 48872 25335 73.2
Kansas 122600 49424 25907 69.4
Kentucky 116800 41576 22515 69.9
Louisiana 130000 43445 23094 68.2
Maine 176200 46933 25385 73.1
Maryland 329400 70647 34849 69
Massachusetts 352300 64509 33966 64
Michigan 144200 48432 25135 74.2
Minnesota 206200 57243 29582 74.2
Mississippi 96500 37881 19977 70.8
Missouri 137700 46262 24724 70
Montana 173300 43872 23836 69
Nebraska 123900 49342 25229 68.6
Nevada 254200 55726 27589 60.1
New Hampshire 253200 63277 31422 72.6
New Jersey 357000 69811 34858 66.9
New Mexico 158400 43820 22966 69.6

Solutions

Expert Solution

From the above table, the mean and median of the Home Value are  191241 and 158400 respectively. Here, the mean is greater than the median and skewness value 0.99 with a positive sign. So, this variable has positive skewness. Further, the standard deviation of this variable is 86581.

The mean and median of the HH Inc are  51828 and 48872 respectively. Here, the mean is greater than the median and skewness value 0.74 with a positive sign. So, this variable has positive skewness. Further, the standard deviation of this variable is 9995.

The mean and median of the Per Cap Inc are  26845 and 25335 respectively. Here, the mean is greater than the median and skewness value 0.72 with a positive sign. So, this variable has positive skewness. Further, the standard deviation of this variable is 4523.

The mean and median of the Pct Owner Occ are  69.576 and 69.600 respectively. Here, the mean is greater than the median and skewness value -1.08 with a positive sign. So, this variable has negative skewness. Further, the standard deviation of this variable is 3.630.

Home value has a positive association with HH Inc and Per Cap Inc. Whereas, Home value has a negative association with variable Pct Owner Occ.

HH Inc has a positive association with the variables Per Cap Inc, whereas, it does not have the association with variable Pct Owner Occ.

At last, the variable Per Cap Inc does not have the association with Pct Owner Occ.


Related Solutions

Case Study 1 - Data Visualization and Descriptive Statistics The data file Home_Values.xlsx contains median home...
Case Study 1 - Data Visualization and Descriptive Statistics The data file Home_Values.xlsx contains median home values (Home Value), median household income (HH Inc), median per capita (Per Cap Inc) and percent of homes that are owner occupied (Pct Owner Occ) for each state and the District of Columbia. Prior to a more detailed analysis of the data, a company wants to get a good understanding of the 4 variables (e.g. central tendency, variability, shape of the distribution, pattern of...
* Descriptive statistics: o Find the mean, median, mode, range, and standard deviation for the data...
* Descriptive statistics: o Find the mean, median, mode, range, and standard deviation for the data set. o Create a scatter plot for the data set. * Regression Analysis: o Perform a linear regression analysis onto the data set. o Report the correlation coefficient, the equation of the regression function, and make a few predictions base on hypnotical input values. o Write down a summary of your conclusions (How well does the regression fit the values? How correlated are the...
The file medinc.mtw contains data on the median incomes (medinc) of census dissemination areas in Toronto....
The file medinc.mtw contains data on the median incomes (medinc) of census dissemination areas in Toronto. (a) Treating this set of data as the population, use Minitab to calculate the population mean and the population standard deviation for the medinc variable. Set aside all population information until parts (d) and (e). (b) Use Minitab (Calc Menu – Random Data – Sample from Columns) to draw twenty samples of size n = 40 from the Toronto medinc population. This procedure must...
Use Descriptive Statistics in Excel to find the mean, median, range and standard deviation of the...
Use Descriptive Statistics in Excel to find the mean, median, range and standard deviation of the data below. Table 1: Health Expenditures as a Percentage of GDP of countries around the world 3.35 5.96 10.64 5.24 3.79 5.65 7.66 7.38 5.87 11.15 5.96 4.78 7.75 2.72 9.50 7.69 10.05 11.96 8.18 6.74 5.89 6.20 5.98 8.83 6.78 6.66 9.45 5.41 5.16 8.55
7.38 Teaching descriptive statistics. A study compared five different methods for teaching descriptive statistics. The five...
7.38 Teaching descriptive statistics. A study compared five different methods for teaching descriptive statistics. The five methods were traditional lecture and discussion, programmed textbook instruction, programmed text with lectures, computer instruction, and computer instruction with lectures. 45 students were randomly assigned, 9 to each method. After completing the course, students took a 1-hour exam. (a) What are the hypotheses for evaluating if the average test scores are different for the different teaching methods? (b) What are the degrees of freedom...
Question 1: As explained in Lesson 5, data exploration through visualization is important because statistics alone...
Question 1: As explained in Lesson 5, data exploration through visualization is important because statistics alone might not tell the entire story. This is best shown by the French statistician Francis Anscombe in 1973 when he presented four sets of data. This data is shown here Data I Data II Data III Data IV x y x y x y x y 10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58 8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76 13.0 7.58...
Question 1: As explained in Lesson 5, data exploration through visualization is important because statistics alone...
Question 1: As explained in Lesson 5, data exploration through visualization is important because statistics alone might not tell the entire story. This is best shown by the French statistician Francis Anscombe in 1973 when he presented four sets of data. This data is shown here Data I Data II Data III Data IV x y x y x y x y 10.0 8.04 10.0 9.14 10.0 7.46 8.0 6.58 8.0 6.95 8.0 8.14 8.0 6.77 8.0 5.76 13.0 7.58...
There are a number of descriptive statistics that could be used to describe data from a...
There are a number of descriptive statistics that could be used to describe data from a study. Name and define at least 5
If the file circuit.txt contains the following data
Exercise 2: If the file circuit.txt contains the following data 3.0             2.1 1.5             1.1 2.6             4.1 The first column is voltage and the second column is the electric current. Write program that reads the voltages and currents then calculates the electric power (P) based on the equation: Voltage     Current        Power 3.0             2.1              (result) 1.5             1.1              (result) 2.6             4.1              (result)                         P = v * i Write your output to the file results.txt with voltage in the first, current in the second...
Data Analysis & Visualization Topic R vector and save the r code in a text file...
Data Analysis & Visualization Topic R vector and save the r code in a text file Problem 1. Create two vectors named v and w with the following contents:      v : 21,10,32,2,-3,4,5,6,7,4,-22      w : -18,72,11,-9,10,2,34,-5,18,9,2 A) Print the length of the vectors B) Print all elements of the vectors C) Print elements at indices 3 through 7. D) Print the sum of the elements in each vector. E) Find the mean of each vector. (Use R's mean() function)...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT