Question

In: Statistics and Probability

I have to clean up a data spreadsheet to make a histogram, bar chart etc. What...

I have to clean up a data spreadsheet to make a histogram, bar chart etc. What my question is, is if a number on my spreadsheet said 10-15 or 20+ shouldn't I just change those to 15 and 20. Most numbers in my column are whole numbers.   

Solutions

Expert Solution

No that won't be necessary to change that from 10-15 to just 15. In fact, that will be wrong to do it.

A histogram tells you how many numbers of items lie in that range. You can create a range like 1

0.5-5.5

5.5-10.5

10.5-15.5

15.5-20.5

20.5-25.5

Remember the upper limit is not included in the interval.

Here is an example of creating a histogram.

Here is the data on starting salaries of a group of 54 people. When constructing a histogram it is helpful to sort the observations.

8870 10800 12000 12500 13000 14000 15000 16000 16500 16600 16700 16900 16900 17000 17000 17600 17880 18000 18000 18000 18000 18000 18000 18000 18000 18000 18000 18500 18680 19100 20000 20000 20000 20000 20000 20300 20900 22000 23000 23000 23000 23000 23400 24000 25000 25000 26000 26000 27000 30000 30000 32500 37000 48785

Minimum = 8870 Maximum = 48785 Range = 39915.

First, decide how many intervals you would like. A thumb rule is to use the square root of the number of observations then round it up. Here, that is the square root of 54 = 7.34; round up and use 8.

The interval width should then be approximately equal to the range divided by the number of intervals. Range/number of Intervals = 39915/8 = 4989.375; I'll round up to the conveniently even figure of 5000.

Start the first interval at a convenient value below the minimum. Here the minimum is 8870, so we begin at 7500.

The intervals then begin at 7500 and have a width of 5000. So, the first interval runs from 7500 to 12500, the second from 12500 to 17500 and so on. By convention, we agree that an interval includes the lower boundary point, but does not include the upper boundary point. So, for instance, a value of 7500 falls in the (7500, 12500) interval, but a value of 12500 does not. A value of 12500 falls instead in the (12500, 17500) interval.

Construct a simple table including each interval, the count of observations in that interval and the relative frequency or percentage of observations in the interval.


Related Solutions

What is the difference between a histogram and a bar chart?
What is the difference between a histogram and a bar chart?
Analyze the data set (e.g. mean, standard deviation, scatterplot, histogram, bar chart, etc.)and discuss important findings....
Analyze the data set (e.g. mean, standard deviation, scatterplot, histogram, bar chart, etc.)and discuss important findings. Suggest courses of action related to the given situation. Variable Names: 1. VOL: Cubic feet of cab space 2. HP: Engine horsepower 3. MPG: Average miles per gallon 4. SP: Top speed (mph) 5. WT: Vehicle weight (100 lb) MAKE / MODEL VOL HP MPG SP WT GM/GeoMetroXF1 89 49 65.4 96 17.5 GM/GeoMetro 92 55 56 97 20 GM/GeoMetroLSI 92 55 55.9 97...
Analyze the data set (e.g. mean, standard deviation, scatterplot, histogram, bar chart, etc.)and discuss important findings.
  Analyze the data set (e.g. mean, standard deviation, scatterplot, histogram, bar chart, etc.)and discuss important findings. Suggest courses of action related to the given situation. Variable Names:           1. VOL: Cubic feet of cab space           2. HP: Engine horsepower           3. MPG: Average miles per gallon           4. SP: Top speed (mph)           5. WT: Vehicle weight (100...
Analyze the data set (e.g. mean, standard deviation, scatterplot, histogram, bar chart, etc.)and discuss important findings....
Analyze the data set (e.g. mean, standard deviation, scatterplot, histogram, bar chart, etc.)and discuss important findings. Suggest courses of action related to the given situation. Variable Names: 1. VOL: Cubic feet of cab space 2. HP: Engine horsepower 3. MPG: Average miles per gallon 4. SP: Top speed (mph) 5. WT: Vehicle weight (100 lb) MAKE / MODEL VOL HP MPG SP WT GM/GeoMetroXF1 89 49 65.4 96 17.5 GM/GeoMetro 92 55 56 97 20 GM/GeoMetroLSI 92 55 55.9 97...
Find an example of a histogram or a bar chart outside of STAT 100.
Find an example of a histogram or a bar chart outside of STAT 100.
Prepare an Audit Program for "Data Clean up". Google relevant terms such as "Data cleansing" etc.
Prepare an Audit Program for "Data Clean up". Google relevant terms such as "Data cleansing" etc.
True or False 1. PIE CHART, HISTOGRAM, and BAR CHART can be produced when one select...
True or False 1. PIE CHART, HISTOGRAM, and BAR CHART can be produced when one select T-TEST statistical function. 2. Descriptive statistics are run when researchers want to find out relationship between phenomena, such as if a higher gas price leads to more use of public transportation system. 3. In SPSS, DATA VIEW allows researchers to see actual numerical data that researchers have entered. 4. When running SPSS to generate PEARSON CORRELATION, one will use ANALYZE-à CORRELATE. 5. In PEARSON...
Describe an instance where a graph/chart/histogram/etc. or a median/mean given that was misleading and did not...
Describe an instance where a graph/chart/histogram/etc. or a median/mean given that was misleading and did not reveal the whole situation.
Make a Frequency Distribution Chart, Histogram and Box and Whiskers Plot for the following set of...
Make a Frequency Distribution Chart, Histogram and Box and Whiskers Plot for the following set of Data 50, 10, 25, 20, 20, 20, 50,100, 30, 15
Consider as SAMPLE data: 52,84,86,91,96,96,98,100,103,105,109. 1) What graph is better - bar graph or histogram? 2)...
Consider as SAMPLE data: 52,84,86,91,96,96,98,100,103,105,109. 1) What graph is better - bar graph or histogram? 2) What's sum of squares? 3) What's sample standard deviation? 4) What's sample variance? Now, consider as POPULATION data: 52,84,86,91,96,96,98,100,103,105,109 (same). 1) What's sum of squares? 2) What's population standard deviation? 3) What's population variance?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT