Question

In: Statistics and Probability

2. Suppose that the data for analysis includes the attribute age . The age values for...

2. Suppose that the data for analysis includes the attribute age . The age values for the data tuples are (in increasing order) 13, 15, 16, 16, 19, 20, 20, 21, 22, 22, 25, 25, 25, 25, 30, 33, 33, 35, 35, 35, 35, 36, 40, 45, 46, 52, 70.

(a) What is the mean of the data? What is the median ?

(b) What is the mode of the data? Comment on the data’s modality (i.e., bimodal, trimodal, etc.).

(c) What is the midrange of the data?

(d) Can you find (roughly) the first quartile (Q 1) and the third quartile (Q 3) of the data? (e) Give the five-number summary of the data.

(f) Show a boxplot of the data.

(g) How is a quantile-quantile plot different from a quantile plot ?

Solutions

Expert Solution

Solution:

Part a

For the given data, we have

Total sum = ∑x = 809

Sample size = n = 27

Sample mean = ∑x / n = 809/27 = 29.96296

Median = Middle most observation when data is in increasing order = 14th obs.

Median = 25

Part b

Mode = Most repeated observation = Observation with highest frequency

There are two observations 25 and 35 are repeated most times (4 times)

So, modes are 25 and 35

So, data is bimodal.

Part c

From given data, we have

Minimum = 13

Maximum = 70

Mid-range = (Minimum + Maximum) / 2

Mid-range = (13 + 70)/2

Mid-range = 41.5

Part d

WE are given n = 27

First quartile = Q1 = (1/4)*27 = 6.75 ≈ 7th observation when data is in increasing order

First quartile = 20

Third quartile = Q3 = (3/4)*27 = 20.25 ≈ 20th observation when data is in increasing order

Third quartile = 35

Part e

Five number summary is given as below:

Minimum = 13

First quartile = 20

Median = 25

Third quartile = 35

Maximum = 70

Part f

Box plot for the given data is given as below:

Part g

We know that quantile – quantile plot used for two data sets whether they come from same population or not; while quantile plot is used for single data set.


Related Solutions

Suppose that a block can contain at most four data values and that all data values...
Suppose that a block can contain at most four data values and that all data values are integers. Using only B+ trees of degree 2, give examples of each of the following : a. A B+ tree whose height changes from 2 to 3 when the value 42 is inserted. Show your structure before and after the insertion. b. A B+ tree in which the deletion of the value 42 leads to a redistribution. Show your structure before and after...
What are the values you get from "data[,1]" and "data[,2]" in r code? Are the values...
What are the values you get from "data[,1]" and "data[,2]" in r code? Are the values from "data[,1]" are the fitted values/ yhat values? When I try "fitted(data)" I get different values from "data[,1]", I am very confused.
Conduct a descriptive data analysis that includes the following: a measure of central tendency a measure...
Conduct a descriptive data analysis that includes the following: a measure of central tendency a measure of dispersion at least one graph Briefly interpret the descriptive data analysis. Conduct the appropriate statistical test that will answer your hypothesis. It must be a statistical test covered in this course such as regression analysis, single t-test, independent t-test, cross-tabulations, Chi-square, or One-Way ANOVA. Explain your justification for using the test based on the type of data and the level of measurement. Report...
5. Conduct a descriptive data analysis that includes the following: a. a measure of central tendency...
5. Conduct a descriptive data analysis that includes the following: a. a measure of central tendency b. a measure of dispersion c. at least one graph Living arrangement Sense of isolation Housing development Integrated Neighborhood Totals Low 80 30 110 High 20 120 140 Totals                     100                        150 250
5. Conduct a descriptive data analysis that includes the following: a. a measure of central tendency...
5. Conduct a descriptive data analysis that includes the following: a. a measure of central tendency b. a measure of dispersion c. at least one graph Living arrangement Sense of isolation Housing development Integrated Neighborhood Totals Low 80 30 110 High 20 120 140 Totals                     100                        150 250
(1) Define attribute sampling and discuss why tests of controls tend to be attribute samples. (2)...
(1) Define attribute sampling and discuss why tests of controls tend to be attribute samples. (2) define sampling risk and non-sampling risk as well as differentiate between these concepts. (3) define each of the following parameters used in calculating sample size and explain how changes in each parameter affect sample size. a. Confidence level b. Tolerable deviation rate c. Expected deviation rate (4) What are some factors that auditors use in setting their confidence level?
IN JAVA PLEASE Create a class called Child with an instance data values: name and age....
IN JAVA PLEASE Create a class called Child with an instance data values: name and age. a. Define a constructor to accept and initialize instance data b. include setter and getter methods for instance data c. include a toString method that returns a one line description of the child
Much of the data that public health professionals have access to includes web tools for analysis...
Much of the data that public health professionals have access to includes web tools for analysis or reports that discuss the applied analysis from the data. Given the limited amount of statistical calculations required by some public health positions, explain why it is important for you to know how to utilize appropriate software to perform these tasks. Support your ideas with reasons, facts, and examples.
Susie maintains a household that includes a daughter (age 29) and a cousin (age 28). Susie...
Susie maintains a household that includes a daughter (age 29) and a cousin (age 28). Susie says that she can claim the cousin as a dependent, but not her daughter. Could Susie be correct? Discuss the dependency exemption rules and how they would apply to this scenario.
Suppose the Domino’s marketing manager believes there are differences in the importance of the Promotion attribute...
Suppose the Domino’s marketing manager believes there are differences in the importance of the Promotion attribute (Q2c) when deciding on a pizza store between male and female customers (Q4Gender). Suppose the manager wants to be 95% certain (α=0.05). Answer the following questions. What are the null and the alternative hypotheses? What analysis should we use and why?
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT