Question

In: Statistics and Probability

First, obtain a set of real-world numeric data. You should have 25 to 50 entries in...

  • First, obtain a set of real-world numeric data. You should have 25 to 50 entries in your data set. You can collect your own data or use data from an online source.
  • In your initial post, list your data and calculate the five-number summary. Then, pose a problem that involves an analysis of the data. Do not provide a solution. Instead, be sure that you include enough relevant information so that your classmates can propose their solutions in their responses.
  • An example of a question may be;

  • A cell phone company may ask whether most people send 50 or more text messages per week. To analyze this question, you could ask 30 people how many text messages they sent during the past week. If another student asks 40 other people, do you think that the two of you will have the same conclusion?

Solutions

Expert Solution

The answer for above mentioned question is explained in below steps:

1. The sample set of real-world numeric data with 50 entries(Ex. Household Income) taken are mentioned below to find Five-Number Summary:

72 153 28 26 23 76 40 57 24 89
72 24 40 137 70 159 37 28 109 117
23 21 17 34 115 47 33 135 272 41
20 22 60 58 92 21 13 24 213 19
59 544 32 35 35 22 39 134 103 240

2. Next Step is to arrange the data set in ascending order, the arranged data set is mentioned below:

13 17 19 20 21 21 22 22 23 23
24 24 24 26 28 28 32 33 34 35
35 37 39 40 40 41 47 57 58 59
60 70 72 72 76 89 92 103 109 115
117 134 135 137 153 159 213 240 272 544

3. The Five-Number summary is a set of descriptive statistics that provides information about a dataset. It consists of the five most important sample percentiles:

A) The sample minimum (smallest observation) is 13.

B) The lower quartile or first quartile is the value of the middle of the first set, where 25% of the values are smaller than Q1 and 75% are larger.

This first quartile takes the notation Q1 = 24.00.

C) The median (the middle value / The median divides the data into two equal sets) is 40.50.

D) The upper quartile or third quartile is the value of the middle of the second set, where 75% of the values are smaller than Q3 and 25% are larger.

This third quartile takes the notation Q3 = 104.50.

E) The sample maximum (largest observation) is 544.

The another part of the above question is to frame a problem that involves an analysis of the data which is framed below:

For Example: Consider an average Household Income of a Population data of 6400 people is 69.47.

From the sample of above mentioned 50 entries / data points calculate sample mean and compare this sample mean with given population mean to find out any significant difference between these 2 Means??

To compare sample mean with population mean which Statistical Test do you prefer???


Related Solutions

Using a real-world data set that interests you, conduct one of the tests you learned this...
Using a real-world data set that interests you, conduct one of the tests you learned this week (or fit a linear regression model). Make sure to document all steps in the hypothesis testing process, including stating your hypotheses, your code, your output and your findings along with interpretation. You may use data from the MASS library if you wish, or load external data
Using a real-world data set that interests you, conduct one of the tests you learned this...
Using a real-world data set that interests you, conduct one of the tests you learned this week (or fit a linear regression model). Make sure to document all steps in the hypothesis testing process, including stating your hypotheses, your code, your output and your findings along with interpretation. You may use data from the MASS library if you wish, or load external data Please type all answer.
Suppose you have the following set of hexadecimal values: $20, $25, $40, $50, $12. Write a...
Suppose you have the following set of hexadecimal values: $20, $25, $40, $50, $12. Write a segment of program to find the minimum and maximum values of the set. Answer must be written in assembly.
You should have 10 entries. There should be a minimum of 4 behavioral entries and 4...
You should have 10 entries. There should be a minimum of 4 behavioral entries and 4 emotional entries, and the remaining 2 entries can be either type. As you learn more concepts and theories, begin to fill in the third column, justifying your explanation for the behavior or emotion with a psychological theory. You must use a theory to justify your explanation for 8 of the 10 entries, 4 behavioral entries and 4 emotional entries. Date Activity or Emotion Motivation...
Find a real-world business example with data set (or example from a case study) and comment...
Find a real-world business example with data set (or example from a case study) and comment on the way hypothesis testing can be applied to answer a relevant business question. Identify the question(s) in your post and provide the data set at an attachment as a reference. please try to do it with R.
suppose you have two sets of data to work with.The first set is a list of...
suppose you have two sets of data to work with.The first set is a list of all the injuries that were seen in a clinic in a month's time.The second set contains data on the number of minutes that each patient spent in the waiting room of a doctor's office. Propose your idea of how to represent the key information.To organize your data would you choose to use a frequency table,a culmative frequency table, or avrelative frequency table?Why?
Make a Frequency Distribution Chart for the following set of Data 50, 10, 25, 20, 20,...
Make a Frequency Distribution Chart for the following set of Data 50, 10, 25, 20, 20, 20, 50,100, 30, 15
You run a regression analysis on a bivariate set of data (n=99). You obtain the regression...
You run a regression analysis on a bivariate set of data (n=99). You obtain the regression equation y=0.843x+6.762 with a correlation coefficient of r=0.954 which is significant at α=0.01 You want to predict what value (on average) for the explanatory variable will give you a value of 150 on the response variable. What is the predicted explanatory value? x = (Report answer accurate to one decimal place.) Here is a bivariate data set. Find the regression equation for the response...
Suppose you have a set of data as shown below: {3, 25, 33, 21, 55, 43,...
Suppose you have a set of data as shown below: {3, 25, 33, 21, 55, 43, 78, 31, 33, 75, 43, 11, 36, 4, 10, 99, A, B, C} Write a Java class called "DataAnalysis" that has the following methods: Data_NaN: this method filters nonnumerical data, for example, A, B, and C will be filtered out by this method; Data_min: this method computes and returns the minimum of the numerical data;
Suppose you have a set of data as shown below: {3, 25, 33, 21, 55, 43,...
Suppose you have a set of data as shown below: {3, 25, 33, 21, 55, 43, 78, 31, 33, 75, 43, 11, 36, 4, 10, 99, A, B, C} Write a Java class called "DataAnalysis" that has the following methods: Data_media: this method computes and returns the median of the numerical data; Data_mode: this method computes and returns the mode of the numerical data; Data_SortedArray: this method rearranges and returns the data in the increasing order (i.e., smallest to largest).
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT