Question

In: Statistics and Probability

Consider Dataset A for answering the questions that follows below. a. Calculate the measures of central...

Consider Dataset A for answering the questions that follows below. a. Calculate the measures of central tendencies for Variable X and Variable Y. i. Mean ii. Median iii. Mode iv. Midrange v. What can you say about the skewness of X and Y variables? b. Calculate the measures of variations for Variable X and Variable Y. i. Range ii. Variance iii. Standard Deviation iv. Coefficient of Variation v. Which is more variable, X or Y? Why? c. Calculate the measures of position for Variable X. i. Z-score of the mean value of X ii. Percentile rank of the maximum value of X iii. Check for any outliers in variable X

variable X:6,7,8,8,8,9,9,9,11,11

variableY: -2.77,-0.23,-0.29,-0.05,0.33,0.43,0.51,0.63,0.85,1.12

Solutions

Expert Solution

Variable X:

--------------------------------------------------------------------------

X (X - X̄)²
6 6.76
7 2.56
8 0.36
8 0.36
8 0.36
9 0.16
9 0.160
9 0.160
11 5.760
11 5.760
X (X - X̄)²
total sum 86 22.40
n 10 10

mean =    ΣX/n =    86.000   /   10   =   8.6000
                      
sample variance =    Σ(X - X̄)²/(n-1)=   22.4000   /   9   =   2.489
                      
sample std dev =   √ [ Σ(X - X̄)²/(n-1)] =   √   (22.4/9)   =       1.5776

range=max-min =    11   -   6   =   5
mid range=(max + min)/2= (   11   +   6   ) /2 =    8.5

mode= highest frequency data =    8

coefficient of variation,CV=σ/µ=   0.183444

skewness using pearson coefficient of skewness,PC      
PC=3(mean-median)/std dev=       0.190160

IQR = Q3-Q1 =    1.75
  
1.5IQR =    2.625
  
lower bound=Q1-1.5IQR=   5.125
  
upper bound=Q3+1.5IQR=   12.125
  
outlier =values outside lower bound and upper bound  
total outlier below lower bound=   0
total outlier above upper bound=   0
total outlier =    0

Z score of mean value = 0

Percentile Rank of maximum value = 1

Variable Y

-------------------------------------

X (X - X̄)²
total sum 0.53 10.71
n 10 10

mean =    ΣX/n =    0.530   /   10   =   0.0530
                      
sample variance =    Σ(X - X̄)²/(n-1)=   10.7120   /   9   =   1.190
                      
sample std dev =   √ [ Σ(X - X̄)²/(n-1)] =   √   (10.712/9)   =       1.0910

range=max-min =    1.12   -   -2.77   =   3.89
mid range=(max + min)/2= (   1.12   +   -2.77   ) /2 =    -0.825
Skewness=   -2.1904              

coefficient of variation,CV=σ/µ=   20.584407

lower bound=Q1-1.5IQR=   -1.64
  
upper bound=Q3+1.5IQR=   2.08
  
outlier =values outside lower bound and upper bound  
total outlier below lower bound=   1
total outlier above upper bound=   0
total outlier =    1

Thanks in advance!

revert back for doubt

Please upvote


Related Solutions

Consider Dataset C for answering the questions that follows below. Teams A, B and C have...
Consider Dataset C for answering the questions that follows below. Teams A, B and C have been used to serve as respondents in a recently concluded webinar in Cybercrime to evaluate the delivery of the webinar. Is there any reason to believe that the mean responses of the three teams are different from one another? Test this using a level of significance of 0.05. All the teams are being categorized as either Male or Female. In this scenario, can we...
Consider Dataset D for answering the questions that follows below. The median marks for Course X...
Consider Dataset D for answering the questions that follows below. The median marks for Course X and Y for the past 8 semesters were given on the dataset. Determine the strength of relationship between Course X and Course Y by calculating the correlation coefficient between them. What can you say about their relationship? Calculate the regression line that best explain the relationship between the dependent variable Course Y and independent variable course X. Estimate the most likely value for Course...
Consider Dataset C for answering the questions that follows below. Team Gender Responses 1 A Male...
Consider Dataset C for answering the questions that follows below. Team Gender Responses 1 A Male 3.25 2 A Male 3.54 3 A Male 1.08 4 A Male 2.14 5 A Male 3.60 6 B Male 4.36 7 B Male 4.66 8 B Male 1.52 9 B Male 3.99 10 B Male 3.60 11 C Female 3.86 12 C Female 4.89 13 C Female 1.46 14 C Female 4.74 15 C Female 4.16 Teams A, B and C have been...
For each variable: Construct a frequency distribution Calculate appropriate measures of central tendency Calculate appropriate measures...
For each variable: Construct a frequency distribution Calculate appropriate measures of central tendency Calculate appropriate measures of dispersion Write a sentence or two summarizing the frequency distribution, central tendency, and dispersion of each variable Each item is worth 10 points for a total of 40 points. Survey Items: 1. What is your age in years? (Values are actual ages.) 2. Educational level                         0. Less than HS                         1. High School                         2. Jr. College                         3. Bachelor's degree                        ...
In the GSS08 dataset, using Explore, determine the measures of central tendency and spread for MARITAL (Marital Status).
In the GSS08 dataset, using Explore, determine the measures of central tendency and spread for MARITAL (Marital Status). Examine the statistics and determine which measures of central tendency and spread are most appropriate.Mean:                         2.52Median:                      2.0Standard deviation:     1.692Range:                        4Interquartile Range:    4Highlight the measure you would use as the measure of central tendency:Mean               Median            ModeWhy did you make this selection? Do not give me a definition of mean/median/mode, but rather an explanation as to why it is appropriate.Highlight the measure you would use...
Consider the dataset shown below where the decision attribute is restaurant
Consider the dataset shown below where the decision attribute is restaurantShown below is a partially developed decision tree. Finish creating the tree using the ID3 method. YOU WILL NOT RECEIVE ANY CREDIT UNLESS YOU SHOW ALL OF YOUR WORK IN TERMS OF ENTROPY AND INFORMATION GAIN CALCULATIONS!!!
What are measures of central tendency and measures of dispersion? What are some of the commonly used measures of central tendency and dispersion?
What are measures of central tendency and measures of dispersion? What are some of the commonly used measures of central tendency and dispersion?
The following training dataset is “reading email dataset”. This dataset has four features as follows: author,...
The following training dataset is “reading email dataset”. This dataset has four features as follows: author, thread, length, and where to read the mail. According to the features the algorithm has to predict the user’s action whether to read or skip the mail. Use Naïve Bayes classifier to predict the user’s action (skips or reads) when the author of the mail is known, the thread of the mail is follow up, the length of the mail is short, and where...
On a separate tab of your Excel document you will be answering the questions noted below....
On a separate tab of your Excel document you will be answering the questions noted below. Ensure that your calculations are linked to your financial statement numbers in order to support your conclusions. If other documentation is used to answer the question, please indicate through a detailed reference where this information was obtained/copied. What type of income format(s) is used by these two companies? Identify any differences in income statement format between these two companies. What are the gross profits,...
You will be using your Framingham dataset for the following questions. 7. Calculate a multivariable regression...
You will be using your Framingham dataset for the following questions. 7. Calculate a multivariable regression where the outcome is total serum cholesterol and the independent variables are BMI, age, sex and smoking status. Interpret. 8. Use the regression from question 7 to answer the following. a. What is the predicted total serum cholesterol for a 50 year-old man who doesn’t smoke and whose BMI is 25? b. What is the predicted total serum cholesterol for a 25 year-old woman...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT