In: Statistics and Probability
An example of a question may be;
A cell phone company may ask whether most people send 50 or more text messages per week. To analyze this question, you could ask 30 people how many text messages they sent during the past week. If another student asks 40 other people, do you think that the two of you will have the same conclusion?
The answer for above mentioned question is explained in below steps:
1. The sample set of real-world numeric data with 50 entries(Ex. Household Income) taken are mentioned below to find Five-Number Summary:
72 | 153 | 28 | 26 | 23 | 76 | 40 | 57 | 24 | 89 |
72 | 24 | 40 | 137 | 70 | 159 | 37 | 28 | 109 | 117 |
23 | 21 | 17 | 34 | 115 | 47 | 33 | 135 | 272 | 41 |
20 | 22 | 60 | 58 | 92 | 21 | 13 | 24 | 213 | 19 |
59 | 544 | 32 | 35 | 35 | 22 | 39 | 134 | 103 | 240 |
2. Next Step is to arrange the data set in ascending order, the arranged data set is mentioned below:
13 | 17 | 19 | 20 | 21 | 21 | 22 | 22 | 23 | 23 |
24 | 24 | 24 | 26 | 28 | 28 | 32 | 33 | 34 | 35 |
35 | 37 | 39 | 40 | 40 | 41 | 47 | 57 | 58 | 59 |
60 | 70 | 72 | 72 | 76 | 89 | 92 | 103 | 109 | 115 |
117 | 134 | 135 | 137 | 153 | 159 | 213 | 240 | 272 | 544 |
3. The Five-Number summary is a set of descriptive statistics that provides information about a dataset. It consists of the five most important sample percentiles:
A) The sample minimum (smallest observation) is 13.
B) The lower quartile or first quartile is the value of the middle of the first set, where 25% of the values are smaller than Q1 and 75% are larger.
This first quartile takes the notation Q1 = 24.00.
C) The median (the middle value / The median divides the data into two equal sets) is 40.50.
D) The upper quartile or third quartile is the value of the middle of the second set, where 75% of the values are smaller than Q3 and 25% are larger.
This third quartile takes the notation Q3 = 104.50.
E) The sample maximum (largest observation) is 544.
The another part of the above question is to frame a problem that involves an analysis of the data which is framed below:
For Example: Consider an average Household Income of a Population data of 6400 people is 69.47.
From the sample of above mentioned 50 entries / data points calculate sample mean and compare this sample mean with given population mean to find out any significant difference between these 2 Means??
To compare sample mean with population mean which Statistical Test do you prefer???