Question

In: Statistics and Probability

Frequency Plots: a) How do you draw a frequency plot given a data set? b) What...

Frequency Plots:
a) How do you draw a frequency plot given a data set?
b) What information about a data set do you get from a frequency plot of that set?
c) What is the difference between theoretical and experimental frequency plots?
i. When is it a safe bet to say that experimental frequency plots will begin to look like theoretical frequency plots?
d) How does the frequency plot relate to the cumulative distribution plot?
i. How do you draw a frequency plot using only a cumulative distribution plot?

Solutions

Expert Solution

a). A frequency plot -

Let us first of all try to understand as ot what exactly is a frequency plot.?

A frequency plot is a graphical representation method for summarizing the distributional information of a given variable.

To make a frequency plot we follow the following steps.

1. Arrange the data array into ascending order.

2. Now we shall distribute these data points into various data intervals covering the entire distribution. To do so we look at the lowest and the largest value in the data set . Now we shall define the lower limit of the first class interval (pre defined range of data) to be such that it is equal to or just less than the lowest data point of the array. After the lower class limit is defined, we will choose a class range of sufficient range such that the entire distribution is spread across 6-7 intervals.

Upper class limit = lower class limit+period

Thus we now know the class limits (upper and lower) and class intervals, we can make a table of all the possible class intervals.

3). Now count the total number of values from the data sets that lie in the defined class intervals. This is called as frequency of the relevant class interval. Thus we will now have a frequency distribution table.

4). Now plot this distribution in form of a bar graph (without gaps between bars -known as histogram) and connect the mid points of all the bars to make a contineous curve/line.

This is called as frequency plot.

The frequency plot and the histogram have the same information except the frequency plot has lines connecting the frequency values whereas the histogram has bars at the frequency values.

In summary we say that The data set is divided into equal sized intervals (or bins). The number of occurrences of the data points is calculated for each bin. The frequency plot then consists of:

Vertical axis = frequencies or relative frequencies;
Horizontal axis = data set (i.e., the mid-point of each interval).

There can be  4 types of frequency plots:

  1. frequency plot (absolute counts);
  2. relative frequency plot (converts counts to proportions);
  3. cumulative frequency plot;
  4. cumulative relative frequency plot.

b). A frequency plot is a very significant plot and it gives various information about the data set given ,such as

1. The nature and profile of the plot lets us know if the given distribution of the data set follows a normal distribution or not ?

2. If it follows a normal distribution ,then what can be its skewness (shift of mean) ,standard deviation ,standard error etc.

3. It tells us about the outliers from the data set.

4. It gives us an information about the percentile distribution of the data set

5. It provides the information of the number of occurrences (frequency) of distinct values distributed within a given period of time or interval.

6. It is also an estimate of the probability distribution of a continuous variable .

7. We can analyse the number of data sets belonging to a certain proportion of the data set.

8. It tells us about of the density distribution of the data from the data set.

c). Difference between theoretical and experimental frequncy plot

A experimental frequency plot is a frequency plot wherein the data is obtained after an actual experimental design whereas a theoretical frequency plot is a frequency plot in which the data obtained is the theoretical output based no assumed behaviour (mathematical model). The theoretical value is what we expect to happen, but it isn't always what actually happens.

For example , a theoretical frequency data point for chance of occurrance of 6 in an unbiased die is 1/6 while the experimental output for the same can be any actual observed value from 1/6 to 6/6 .

In general, the experimental frequency of an event tends to get closer to the theoretical probability of the event as we perform more trials. The theretical frequency converges towards the expected value/mean value (theretical value) as we increase on the number of data points.

Usually , if we take more than 30 data points or more then there will be very close relationship between the two.

d). Before understanding hte diffence between a frequency plot and a cumulative freuqnecy plot, we need ot firstly understand as ot what exactly is a cumulative freuqnecy plot.

A cumulative frequency plot is a graphical method of displaying the cumulative information graphically. It shows the number, percentage, or proportion of observations that are less than or equal to particular values. A cumulative frequency plot is a graphical represnetaiton of hte cumulative frequency.

But what is cumulative frequency?

Cumulative frequency is the running total of all frequencies upto the given point. It is the sum of all the previous frequencies up to the current point. It is easily understandable through a Cumulative Frequency Table.

Now to explain the difference between a frequency plot and a cumulative frequency plot let us take an example as follows.

Let us have the following data set

25,22,36,38,36,38,46,45,48,46,55,55,52,58,55,68,67,61,72,91

Now refer to part a of our solution .wE have explianed the steps to create the frequency table.

The frequency table and its histogram looks like this

Joining the mid points of hte bar of the histogram, we will get the frequency plot.

From the frequency table we can now plot the cummulative frequency plot.

Class Frequency cumu. Frequency
20-35 2 2 =2+0
36-51 8 10 =2+8
52-67 7 17 =10+7
68-83 2 19 =17+2
84-99 1 20 =19+1

and the cumulative plot will be


Related Solutions

How do you graph outliers on a box plot when given a data set of numbers?...
How do you graph outliers on a box plot when given a data set of numbers? I found the median, lower and upper quartile numbers and have already plotted that but how do you plot outliers? Lets say the data is :1 2 3 4 5 6 7 8 9
how do you find frequency and percent frequency of a data set on excel? steps please!...
how do you find frequency and percent frequency of a data set on excel? steps please! :)
.  Draw a plot of the following set of data and determine the linear regression equation.  What is...
.  Draw a plot of the following set of data and determine the linear regression equation.  What is the      value of the slope and intercept?   What is r and R2?  Are there any outlier values?   (15 points)                                 Age (X):     20  25  36  29  41  35  56  43  66  50  59  67  51  75  75  81  54  66  52  48            Total Body Water (Y):     61  57  52  59  53  58  48  51  37  44  42  41  48  38  41  39  47  42  51  50  
how to calculate quartiles and draw a box plot from a sample data set containing two...
how to calculate quartiles and draw a box plot from a sample data set containing two independent variables and one dependent variable?
A student makes three plots of their data and finds that a plot of [A] vs...
A student makes three plots of their data and finds that a plot of [A] vs t is non-linear, a plot of ln[A] vs t is non-linear, and a plot of 1/[A] vs t is linear. What is the rate law of the reaction?
The yield data (kg/plot) were collected from 20 plots. Each fertilizer (A or B) was randomly...
The yield data (kg/plot) were collected from 20 plots. Each fertilizer (A or B) was randomly assigned to 10 plots. Fertilizer A 17.5 17.9 18.2 16.9 17.2 17.5 17.3 18.9 16.8 17.1 Fertilizer B 18.4 17.9 18.2 17.6 17.6 18.8 18.1 17.4 17.8 17.6 a. State the null and alternative hypotheses. b. Conduct 2 tailed, paired t-test c. Make a conclusion based on a 10% significance level. Briefly explain how do you make this conclusion.
For the accompanying data​ set, (a) draw a scatter diagram of the​ data, (b) by​ hand,...
For the accompanying data​ set, (a) draw a scatter diagram of the​ data, (b) by​ hand, compute the correlation​ coefficient, and​ (c) determine whether there is a linear relation between x and y. n 3 0.997 4 0.950 5 0.878 6 0.811 7 0.754 8 0.707 9 0.666 10 0.632 11 0.602 12 0.576 13 0.553 14 0.532 15 0.514 16 0.497 17 0.482 18 0.468 19 0.456 20 0.444 21 0.433 22 0.423 23 0.413 24 0.404 25 0.396...
For the accompanying data​ set, (a) draw a scatter diagram of the​ data, (b) by​ hand,...
For the accompanying data​ set, (a) draw a scatter diagram of the​ data, (b) by​ hand, compute the correlation​ coefficient, and​ (c) determine whether there is a linear relation between x and y. x 2 4 6 6 7 y 4 8 11 13 18 n 3 0.997 4 0.950 5 0.878 6 0.811 7 0.754 8 0.707 9 0.666 10 0.632 11 0.602 12 0.576 13 0.553 14 0.532 15 0.514 16 0.497 17 0.482 18 0.468 19 0.456...
For the accompanying data​ set, (a) draw a scatter diagram of the​ data, (b) compute the...
For the accompanying data​ set, (a) draw a scatter diagram of the​ data, (b) compute the correlation​ coefficient, and​ (c) determine whether there is a linear relation between x and y. x 2 6 1 7 9 y 8 2 6 9 5 Critical Values for Correlation Coefficient n 3 0.997 4 0.950 5 0.878 6 0.811 7 0.754 8 0.707 9 0.666 10 0.632 11 0.602 12 0.576 13 0.553 14 0.532 15 0.514 16 0.497 17 0.482 18...
For the accompanying data​ set, (a) draw a scatter diagram of the​ data, (b) compute the...
For the accompanying data​ set, (a) draw a scatter diagram of the​ data, (b) compute the correlation​ coefficient, and​ (c) determine whether there is a linear relation between x and y.    Data set x 7 6 6 7 9 y 3 2 6 9 5 Critical Values for Correlation Coefficient n 3 0.997 4 0.950 5 0.878 6 0.811 7 0.754 8 0.707 9 0.666 10 0.632 11 0.602 12 0.576 13 0.553 14 0.532 15 0.514 16 0.497...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT