Question

In: Statistics and Probability

The data set “UCBAdmissions” in R contains admission decisions by gender at six departments of UC...

The data set “UCBAdmissions” in R contains admission decisions by gender at six departments of UC Berkeley. For this data set, carry out appropriate test for independence between the admission decision and gender for each of the departments.

What are your conclusions? Please submit your R script with the answer.

Solutions

Expert Solution

Install the package "vcd". You will find a function called assocstats().

install.package("vcd")

library(vcd)

> assocstats(UCBAdmissions)
$`Dept:A`
                    X^2 df   P(> X^2)
Likelihood Ratio 19.054  1 1.2707e-05
Pearson          17.248  1 3.2804e-05

Phi-Coefficient   : 0.136 
Contingency Coeff.: 0.135 
Cramer's V        : 0.136 

$`Dept:B`
                     X^2 df P(> X^2)
Likelihood Ratio 0.25864  1  0.61105
Pearson          0.25372  1  0.61447

Phi-Coefficient   : 0.021 
Contingency Coeff.: 0.021 
Cramer's V        : 0.021 

$`Dept:C`
                     X^2 df P(> X^2)
Likelihood Ratio 0.75098  1  0.38616
Pearson          0.75354  1  0.38536

Phi-Coefficient   : 0.029 
Contingency Coeff.: 0.029 
Cramer's V        : 0.029 

$`Dept:D`
                     X^2 df P(> X^2)
Likelihood Ratio 0.29787  1  0.58522
Pearson          0.29798  1  0.58515

Phi-Coefficient   : 0.019 
Contingency Coeff.: 0.019 
Cramer's V        : 0.019 

$`Dept:E`
                     X^2 df P(> X^2)
Likelihood Ratio 0.99039  1  0.31965
Pearson          1.00107  1  0.31705

Phi-Coefficient   : 0.041 
Contingency Coeff.: 0.041 
Cramer's V        : 0.041 

$`Dept:F`
                     X^2 df P(> X^2)
Likelihood Ratio 0.38362  1  0.53567
Pearson          0.38409  1  0.53542

Phi-Coefficient   : 0.023 
Contingency Coeff.: 0.023 
Cramer's V        : 0.0

Look at the p-values.Only department A has some dependence between admission decision and gender. For the rest, the conclusion is inconclusive as we cannot reject the null hypothesis of independence.


Related Solutions

Load “Lock5Data” into your R console. Load “OlympicMarathon” data set in “Lock5Data”. This data set contains...
Load “Lock5Data” into your R console. Load “OlympicMarathon” data set in “Lock5Data”. This data set contains population of all times to finish the 2008 Olympic Men’s Marathon. a) What is the population size? b) Now using “Minutes” column generate a random sample of size 5. c) Calculate the sample mean and record it (create a excel sheet or write a direct R program to record this) d) Continue steps (b) and (c) 10,000 time (that mean you have recorded 10,000...
Write R code: Here are the first six observations from the prostate data set found in...
Write R code: Here are the first six observations from the prostate data set found in the faraway library. Use help(prostate) to describe the dataset and the variables in the data sets. obs lcavol lweight age lbph svi lcp gleason pgg45 lpsa 1 -0.579819 2.7695 50 -1.38629 0 -1.38629 6 0 -0.43078 2 -0.994252 3.3196 58 -1.38629 0 -1.38629 6 0 -0.16252 3 -0.510826 2.6912 74 -1.38629 0 -1.38629 7 20 -0.16252 4 -1.203973 3.2828 58 -1.38629 0 -1.38629 6...
The dataset HairEyesColor in R contains classifications of students by gender, color, and eye color. a)...
The dataset HairEyesColor in R contains classifications of students by gender, color, and eye color. a) How many students are included in the data set? b) Display the information using mosaicplot (HairEyeColor, col=c(1,2)) c) Is hair color independent of eye color for men? d) Is hair color independent of eye color for women? Use R and show R code
Construct a scattergram for each data set. Then calculate r and r 2 for each data...
Construct a scattergram for each data set. Then calculate r and r 2 for each data set. Interpret their values. Complete parts a through d a. x −1 0 1 2 3 y −3 0 1 4 5 Calculate r. r=. 9853 ​(Round to four decimal places as​ needed.) Calculate r2. r2=0.9709. ​(Round to four decimal places as​ needed.) Interpret r. Choose the correct answer below. A.There is not enough information to answer this question. B.There is a very strong...
Construct a scattergram for each data set. Then calculate r and r2 for each data set....
Construct a scattergram for each data set. Then calculate r and r2 for each data set. Interpret their values. Complete parts a through d. a. x −1 0 1 2 3 y −3 0 1 4 5 Calculate r. r=. 9853.​(Round to four decimal places as​ needed.) Calculate r2. r2=0.9709​(Round to four decimal places as​ needed.) Interpret r. Choose the correct answer below. A.There is not enough information to answer this question. B.There is a very strong negative linear relationship...
For the data set below, calculate r, r 2, and a 95% confidence interval in r...
For the data set below, calculate r, r 2, and a 95% confidence interval in r units. Then write a one- to two-sentence conclusion statement that includes whether the null hypothesis was rejected or not. Assume a two-tailed hypothesis and α = .05. Case 1 Case 2 Case 3 Case 4 Case 5 Case 6 X 1.05 1.15 1.30 2.00 1.75 1.00 Y 2 2 3 4 5 2
Data shows graduate program admission decisions (Yes: 1 and No: 2), GRE score and undergraduate GPA...
Data shows graduate program admission decisions (Yes: 1 and No: 2), GRE score and undergraduate GPA for twenty-five students. Tasks: Examine if the given data is suitable for the application of linear discriminant analysis. Create a linear discriminant function predicting admission decisions. Comment on the classification accuracy. Predict the admission decision given GRE score = 690 and GPA = 3.2. Perform logistic regression analysis for the data. Compare the classification accuracies of both methods. Admit GRE GPA 2 790 3.8...
Use Minitab Data shows graduate program admission decisions (Yes: 1 and No: 2), GRE score and...
Use Minitab Data shows graduate program admission decisions (Yes: 1 and No: 2), GRE score and undergraduate GPA for twenty-five students. Tasks: Examine if the given data is suitable for the application of linear discriminant analysis. Create a linear discriminant function predicting admission decisions. Comment on the classification accuracy. Predict the admission decision given GRE score = 690 and GPA = 3.2. Perform logistic regression analysis for the data. Compare the classification accuracies of both methods. Admit GRE GPA 2...
Write code in R for this questions,, will vote!! Load the Taxi.txt data set into R....
Write code in R for this questions,, will vote!! Load the Taxi.txt data set into R. (a) Calculate the mean, median, standard deviation, 30th percentile, and 65th percentile for Mileage and TripTime. (b) Make a frequency table for PaymentProvider that includes a Sum column. Report the resulting table. (c) Make a contingency table comparing PaymentType and Airport. Report the resulting table. (d) Use the cor() function to find the correlation between each pair of the Meter, Tip, Mileage, and TripTime...
( In R / R studio ) im not sure how to share my data set,...
( In R / R studio ) im not sure how to share my data set, but below is the title of my data set and the 12 columns of my data set. Please answer as best you can wheather its pseudo code, partial answers, or just a suggestion on how i can in to answer the question. thanks #---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- The dataset incovid_sd_20201001.RDatacontains several variables related to infections of covid-19 for eachzip code in San Diego County as of October...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT