In: Statistics and Probability
Descriptive Statistics Data Analysis Plan
Scenario: Please write a few lines describing your scenario and the four variables (in addition to income) you have selected.
Use Table 1 to report the variables selected for this assignment. Note: The information for the required variable, “Income,” has already been completed and can be used as a guide for completing information on the remaining variables.
Table 1. Variables Selected for the Analysis
Variable Name in the Data Set |
Description (See the data dictionary for describing the variables.) |
Type of Variable (Qualitative or Quantitative) |
Variable 1: “Income” |
Annual household income in USD. |
Quantitative |
Variable 2: |
||
Variable 3: |
||
Variable 4: |
||
Variable 5: |
Reason(s) for Selecting the Variables and Expected Outcome(s):
Variable 1: “Income” -
Variable 2: “ “ -
Variable 3: “ “ -
Variable 4: “ “ -
Variable 5: “ “ -
Data Set Description:
Proposed Data Analysis:
Measures of Central Tendency and Dispersion
Complete Table 2. Numerical Summaries of the Selected Variables and briefly explain why you choose those measurements. Note: The information for the required variable, “Income,” has already been completed and can be used as a guide for completing information on the remaining variables.
Table 2. Numerical Summaries of the Selected Variables
Variable Name |
Measures of Central Tendency and Dispersion |
Rationale for Why Appropriate |
Variable 1: “Income” |
Number of Observations Median Sample Standard Deviation |
I am using median for two reasons: If there are any outliers or the data is not normally distributed, the median is the best measure of central tendency. The variable is quantitative. I am using sample standard deviation for three reasons: The data is a sample from a larger data set. It is the most commonly used measure of dispersion. The variable is quantitative. |
Variable 2: |
||
Variable 3: |
||
Variable 4: |
||
Variable 5: |
Graphs and/or Tables
Complete Table 3. Type of Graphs and/or Table for Selected Variables and briefly explain why you choose those graphs and/or tables. Note: The information for the required variable, “Income,” has already been completed and can be used as a guide for completing information on the remaining variables.
Table 3. Type of Graphs and/or Tables for Selected Variables
Variable Name |
Graph and/or Table |
Rationale for why Appropriate? |
Variable 1: “Income” |
Graph: I will use the histogram to show the normal distribution of data. |
Histogram is one of the best plot to show the normal distribution of quantitative level data . |
Variable 2: |
||
Variable 3: |
||
Variable 4: |
||
Variable 5: |