Question

In: Statistics and Probability

Part 1: Random Data, Statistics, and the Empirical Rule **Data Set Below** Methods: Use Excel (or...

Part 1: Random Data, Statistics, and the Empirical Rule **Data Set Below**

Methods: Use Excel (or similar software) to create the tables and graph. Then copy the items and paste them into a Word document. The tables should be formatted vertically, have borders, and be given the labels and titles stated in the assignment. The proper symbols should be used. Do not submit this assignment as an Excel file. The completed assignment should be a Word (or .pdf) document.

  1. The data values and relevant information are posted in the course website. Use the data set (P, Q, R, S, or T) assigned to you by your instructor to complete this application.

For the purpose of this application, treat the data set as if it represented a certain random variable and was a valid random sample gathered by a researcher from a normally distributed population. The sample data was actually found with an online Gaussian random number generator that creates normally distributed data values. The random number generator simulates the results of a researcher finding those values through observation or experimentation.

  1. Use technology (Excel, graphing calculator, etc.) to sort the sample data values from low to high. Use Excel or similar software to put the data into a table with about 5 or 6 columns. Label this “Table 1: Sorted Set of Sample Data.”

  1. Using 5 to 10 class intervals, organize the sample data as a frequency distribution in a table. The intervals of the frequency distribution should be rounded to the tenths so that they match the data. Label this “Table 2: Frequency Distribution.”

  1. Use Excel (or similar software) to construct a frequency histogram to illustrate the data. Give the axes the proper titles. Label this “Graph 1: Histogram.”

  1. Use Table 2, the frequency distribution, to find the midpoints of each class interval. Create a new frequency distribution with the midpoints in the left column and the frequencies in the right column. Label this “Table 3: Frequency Distribution with Midpoints.”

  1. Use technology to find the mean, median, standard deviation, and variance of the sample data organized in Table 3 (from step 5 above). Put these values into a table with the proper symbol in the left column and the value of the statistic in the right column. Also, from the original data set, put the values of the range and sample size in the table. The median and range do not generally have symbols so the terms “Median” and “Range” can be used in the left column. Identify the modal class (the one with the highest frequency). Put the terms “Modal Class” in the left column and the class interval in the right column. The statistics should be rounded properly (one more decimal place than the data). Label this “Table 4: Summary Statistics”
  2. Use the sample mean and standard deviation to find the values related to the Empirical Rule.

         The Empirical Rule: For a set of data whose distribution is approximately normal,

  • about 68% of the data are within one standard deviation of the mean.
  • about 95% of the data are within two standard deviations of the mean.
  • about 99.7% of the data are within three standard deviations of the mean.

Use the value of n and the percents listed above to find how many data values should be within each category. Then use the sample mean and standard deviation to find the lower and upper cut-off values in each category. Then use the sorted list of data to determine how many values are actually in each category. Put the values into a table as shown in the example and label it “Table 5: The Empirical Rule.”

Data Set R

Mu=31.2

sd=4.5

n=46

31.0

34.9

31.4

27.4

37.6

38.2

32.8

32.0

26.4

32.3

33.9

21.4

34.7

33.3

39.5

32.5

29.1

28.0

34.9

33.5

26.9

38.4

17.9

28.9

30.8

31.3

33.8

31.6

33.7

38.3

38.3

28.2

39.0

29.1

41.1

23.6

31.6

29.3

29.2

25.8

38.6

27.2

31.1

34.0

29.3

28.3

Solutions

Expert Solution

We will be using the Excel software for the analysis.

->Enter the data into the excel.

1) Go to excel -> sort the data lowest-highest

2) Calculate the frequency distribution

3)

4)

We have to calculate the empirical limits for the data

....K=1,2,3

=

a) First limit (26.7,35.7)

b)Second limit(22.2,40.2)

c)Third limit(17.7,44.7)

These are the empirical limits for the given dataset.


Related Solutions

What is meant by empirical rule or 68 – 95 – 99.7 rule in statistics ?
What is meant by empirical rule or 68 – 95 – 99.7 rule in statistics ?
Use the Empirical Rule to answer the questions below: The distribution of weights for newborn babies...
Use the Empirical Rule to answer the questions below: The distribution of weights for newborn babies is approximately normally distributed with a mean of 7.4 pounds and a standard deviation of 0.8 pounds. 1. What percent of newborn babies weigh more than 8.2 pounds? % 2. The middle 95% of newborn babies weigh between and pounds. 3. What percent of newborn babies weigh less than 5.8 pounds? % 4. Approximately 50% of newborn babies weigh more than pounds. 5. What...
1. Choose the statement below that is not true: A: The Empirical rule applies to all...
1. Choose the statement below that is not true: A: The Empirical rule applies to all normal distributions. B: Both the Poisson and the exponential distributions are characterized by only their mean. C: All of the statements are true. D: Poisson distributions tend to be less useful the higher the mean is. E: The exponential distribution tends to be a good fit for counting the discrete number of events per time interval. 2.Combining the standard deviations of two negatively correlated...
The quantitative data set under consideration has roughly a bell-shaped distribution. Apply the empirical rule to...
The quantitative data set under consideration has roughly a bell-shaped distribution. Apply the empirical rule to answer the following question. A quantitative data set of size 90 has mean 40 and standard deviation 6. Approximately how many observations lie between 22 and 58? Approximately _____ observations lie between 22 and 58.
Case 1 Instruction (Accounting Application) Use the MS Excel tabular graphical methods of descriptive statistics to...
Case 1 Instruction (Accounting Application) Use the MS Excel tabular graphical methods of descriptive statistics to summarize the sample data in the data set named PelicanStores in Case 1 folder. The managerial report should contain summaries such as: 1. A frequency and relative frequency distributions for the methods of payment (different cards). (20%) 2. Mean, median, first quartile, third quartile, and sample standard deviation for net sales from regular customers. (20%) 3. Mean, median, first quartile, third quartile, and sample...
Case 1 Instruction (Accounting Application) Use the MS Excel tabular graphical methods of descriptive statistics to...
Case 1 Instruction (Accounting Application) Use the MS Excel tabular graphical methods of descriptive statistics to summarize the sample data in the data set named PelicanStores in Case 1 folder. The managerial report should contain summaries such as: 1. A frequency and relative frequency distributions for the methods of payment (different cards). (20%) 2. Mean, median, first quartile, third quartile, and sample standard deviation for net sales from regular customers. (20%) 3. Mean, median, first quartile, third quartile, and sample...
Use the data set which is labeled below and answer using regression output in excel i....
Use the data set which is labeled below and answer using regression output in excel i. Find the correlation coefficients between Y and X1, X2 and X3 and test the significance of population correlation coefficient using the value of r calculated for X2 and X3. ii. Estimate the regression equation for Y and X1, Y and X2 and report the results and explain the intercept and slope coefficients. iii. Check the significance or insignificance of the independent variables? and explain...
Econ 2310 Business Statistics: Problem Set #1 Instructions: You may use Excel and/or a calculator to...
Econ 2310 Business Statistics: Problem Set #1 Instructions: You may use Excel and/or a calculator to complete this assignment. Please show work or reference what Excel commands you used to solve the problems. You are given the following two series on income and credit scores. Income FICO 39 625 27 600 57 710 31 595 34 610 50 840 38 726 62 710 43 635 49 560 Find the covariance and the correlation coefficient. (B) Do credit scores increase, decrease,...
Use the data below to solve the following problem using excel: 1 a) Import the data...
Use the data below to solve the following problem using excel: 1 a) Import the data into an Excel file. Done! b) Create a new column in the spreadsheet to assign the category of each car according to the engine horsepower. For this exercise use IF statements in each cell to determine the class for each vehicle. i. Class 1 if the vehicle horsepower is less than 80 HP. ii. Class 2 if the vehicle horsepower is between 81 and...
Problem 05 Empirical Rule : Several years of final exam scores in statistics courses for a...
Problem 05 Empirical Rule : Several years of final exam scores in statistics courses for a local university are normally distributed with a mean of 81 and a standard deviation of 8. If the lowest 2.5% of the scores on this test qualifies for a failing grade (F), use the empirical rule to calculate the score a student would need on this test to pass the final exam? show your work integer answer Problem 08 Expected Value and Life Insurance...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT