Question

In: Statistics and Probability

–Construct a reasonable frequency distribution of High School GPA (HSGPA) –Construct a histogram –Present the frequency...

–Construct a reasonable frequency distribution of High School GPA (HSGPA)

–Construct a histogram

–Present the frequency distribution and histogram

There are a total of 196 HS student GPAs. 1.6, 2, 2.1, 2.1, 2.2, 2.2, 2.2, 2.4, 2.4, 2.5, 2.5, 2.5, 2.5, 2.5, 2.6, 2.7, 2.75, 2.75, 2.75, 2.75, 2.75, 2.8, 2.8, 2.8, 2.9, 2.9, 2.9, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3.1, 3.1, 3.1, 3.2, 3.2, 3.2, 3.2, 3.2, 3.2, 3.2, 3.2, 3.2, 3.2, 3.2, 3.2, 3.2, 3.2, 3.2, 3.23, 3.25, 3.25, 3.25, 3.25, 3.3, 3.3, 3.3, 3.3, 3.3, 3.31, 3.34, 3.4, 3.4, 3.4, 3.4, 3.4, 3.4, 3.4, 3.4, 3.4, 3.45, 3.479, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.5, 3.6, 3.6, 3.6, 3.6, 3.6, 3.6, 3.6, 3.6, 3.63, 3.63, 3.64, 3.65, 3.65, 3.7, 3.7, 3.7, 3.7, 3.7, 3.7, 3.7, 3.7, 3.7, 3.7, 3.7, 3.7, 3.7, 3.7, 3.729, 3.75, 3.75, 3.75, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.8, 3.81, 3.81, 3.83, 3.9, 3.9, 3.9, 3.9, 3.9, 3.92, 3.94, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4

Solutions

Expert Solution

Problem Statement: In a class there are 196 students and we have their GPAs available. For this dataset of GPAs of 196 students a frequency distribution and a histogram needs to be constructed.

NOTE: Although in the question it is mentioned that there are GPAs of 196 students, when looked at data provided closely it looks like there are only 188 entries, implying GPAs of only188 students is available. It should be noted that irrespective of number of data samples available (188 v/s 196), the approach taken TO SOLVE THIS PROBLEM STATEMENT IS SAME.

IN THE WORKING I WILL BE USING 188 AS NUMBER OF DATA POINTS. IF ALL 196 POINTS, AS MENTIONED IN QUESTION ARE AVAILABLE , WE CAN REPLACE 188 WITH 196 AND USE THE SAME STEPS.

Given: In the problem statement we are provided with GPAs of 188 students.

Step 1: First step to be done while trying to construct frequency distribution and histogram is to arrange data in ascending order, as it gives a vague idea about data distribution. r.In the dataset provided, GPAs are already in ascending order.

Step 2: To obtain frequency distribution for the data, we have to decide on the number of bins that would be suitable for this dataset. Bins are class intervals arranged ascending order where each data point would fall in to one of the interval. One of the major property/characteristic of bin is all bins should be of same size. In most cases, deciding on number of bins required is subjective and depends on the understanding, application of the problem statement. However, there are certain rules that directs us to thee optimal number of bins for a dataset and we will use of those for this problem statement.

In this problem let us use Struge's rule to compute the number of bins required. Struge's rule is one the simple and highly used technique used to create bins.

Struge's rule is given by:

Number of bins (n)=1+(3.322*Logarithm10 (number of data points (m) )

Using the formula we get,

n= 1+(3.322*Logarithm10 (188) )= 8.5547.

Since number of bins should be a whole number we can round the number of bins to 8.

Step 3: Once the number of bins are decided, compute the number of data points falling in each of these equally spaced ascendingly ordered class interval. First bin starts with 1.6 (lowest GPA). Highest GPA scored is 4. Size of each bin would be =(4-1.6)/8= 0.3GPA

This forms frequency distribution table. An extra column could be computed to calculate relative frequency. Where number of data points within a class interval is divided by total number of samples

Below is the table:

GPA Frequency Relative Frequency
1.6 to 1.9 1 0.53%
1.9 to 2.2 6 3.19%
2.2 to 2.5 7 3.72%
2.5 to 2.8 10 5.32%
2.8 to 3.1 25 13.30%
3.1 to 3.4 36 19.15%
3.4 to 3.7 51 27.13%
3.7 to 4 52 27.66%
Total 188 100.00%

Step 4: Once we have obtained frequency of data points for each class interval (bin) we can proceed to plot histogram. Histogram is visual representation of frequency of data points in each class interval when the class intervals are arranged in an ascending order.

Below is the histogram for above data points:

Note : [ ] in graph indicates, number is inclusive in the interval while ( ) indicates the number is not included in the interval during frequency calculation.


Related Solutions

The number of bears killed in 2010 for 52 counties in Pennsylvania is shown in the frequency distribution. Construct a histogram,
The number of bears killed in 2010 for 52 counties in Pennsylvania is shown in the frequency distribution. Construct a histogram, frequency polygon, and ogive for the data. Comment on the skewness of the distribution. How many counties had 75 or fewer bears killed?
The frequency distribution shows the total number of points scored in the Super Bowl games from 1967 to 2012. Construct a histogram, frequency polygon, and ogive for the distribution.
The frequency distribution shows the total number of points scored in the Super Bowl games from 1967 to 2012. Construct a histogram, frequency polygon, and ogive for the distribution. If you were to predict the total number of points for the next Super Bowl game, what range of values would you choose? In how many games were the total points greater than 56?
Use EXCEL Construct Distribution: For each variable, construct Empirical distribution: Frequency Distribution and Relative Frequency Distribution...
Use EXCEL Construct Distribution: For each variable, construct Empirical distribution: Frequency Distribution and Relative Frequency Distribution (and if warranted, construct Cumulative Relative Distribution.) Apply a meaningful chart type to present each of the Relative Frequency Distribution (and Cum. Rel. Freq.) calculated in part “a”. (Freq. Dist. Plot is not necessary). Savings Months Employed Age Housing Credit Risk 1 $739 12 23 Own Low 2 $1,230 0 32 Own High 3 $389 119 38 Own High 4 $347 14 36 Own...
Using the histogram shown here, do the following. a. Construct a frequency distribution; include class limits, class frequencies ...
Using the histogram shown here, do the following.a. Construct a frequency distribution; include class limits, class frequencies, midpoints, and cumulative frequencies.b. Construct a frequency polygon.c. Construct an ogive.
Is there a linear correlation between students' high school GPA and their current GPA( College GPA)?...
Is there a linear correlation between students' high school GPA and their current GPA( College GPA)? Here is the link to our survey 1. Let's try to test this claim by using the survey. Assume that alpha is 0.05. 2. What is the equation of the regression line? 3. What is the best-predicted GPA if the high school GPA is 3.5? High School College 3 3.89 3.3 2.7 3.5 3.5 3 3.7 3.5 3.5 3.89 3.7 4 4 3 3...
a) Use the Histogram tool to develop a frequency distribution for the number of days and...
a) Use the Histogram tool to develop a frequency distribution for the number of days and the browser used as a customer of this online store. Use your judgement for determining the number of bins to use. b) Construct complete histogram graphs and numerical descriptions (that is, histograms and measures of location and dispersion) for the length of time the shopper spends on the website and the amount spent per transaction. c) The company would like to investigate the effect...
The data below provides College GPA, High School GPA, SAT total score, and a number of...
The data below provides College GPA, High School GPA, SAT total score, and a number of letters of reference. a.Generate a model for college GPA as a function of the other three variables? b.Is this model useful? Justify your conclusion. c.Are any of the variables not useful predictors? Why? CGPA HSGPA SAT REF 2.04 2.01 1070 5 2.56 3.4 1254 6 3.75 3.68 1466 6 1.1 1.54 706 4 3 3.32 1160 5 0.05 0.33 756 3 1.38 0.36 1058...
Statistics concepts for engineering management: The data in the table provides: College GPA, High School GPA,...
Statistics concepts for engineering management: The data in the table provides: College GPA, High School GPA, SAT total score, and number of letters of reference.    a. Generate a model for college GPA as a function of the other three variables.    b. Is this model useful? Justify your conclusion.    c. Are any of the variables not useful predictors? Why? CGPA   HSGPA   SAT   REF 2.04 2.01   1070   5 2.56 3.4   1254   6 3.75 3.68   1466   6 1.1 1.54   706  ...
Make a Frequency Distribution Chart, Histogram and Box and Whiskers Plot for the following set of...
Make a Frequency Distribution Chart, Histogram and Box and Whiskers Plot for the following set of Data 50, 10, 25, 20, 20, 20, 50,100, 30, 15
Shown is an ogive depicting the cumulative frequency of the average mathematics SAT scores by state. Use it to construct a histogram and a frequency polygon.
Shown is an ogive depicting the cumulative frequency of the average mathematics SAT scores by state. Use it to construct a histogram and a frequency polygon.
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT