In: Statistics and Probability
Researchers want to study Drug-induced liver injury (DILI). DILIscore in the data set indicates how serious the injury is. From the data, please answer the following questions(you can use R):
1. Report descriptive statistics (Summary statistics, histogram and boxplots for numerical variables and frequency table for categorical variables) and outliers (if there is any).
2. Use correlation analysis to find if DILIscore is significantly correlated with patient age? Is DILIscore significantly correlated with drug dose?
3. Use T test to find if there is any significant difference in DILIscore for male and female? Is there any significant difference in DILIscore for positive RO2 and negative RO2? Use anova to find if there is any difference in DILIscore among racial groups.
4. Build a multiple regression model for DILIscore and summarize your results.
PatientGender | Race | PatientAge | DILIscore_dose | RO2 | Daily Dose (mg/day) |
Female | White | 76 | 0.43 | Negative | 10 |
Female | Black | 76 | 0.43 | Negative | 10 |
Female | White | 34 | 0.66 | Negative | 1.5 |
Female | White | 67 | 1.38 | Negative | 10 |
Male | White | 83 | 1.38 | Negative | 10 |
Male | White | 64 | 1.39 | Negative | 10 |
Female | Black | 49 | 1.71 | Negative | 10 |
Female | Black | 33 | 1.98 | Negative | 10 |
Male | Black | 55 | 2.05 | Negative | 40 |
Female | White | 74 | 2.44 | Negative | 20 |
Female | White | 30 | 2.54 | Negative | 20 |
Male | Asian | 44 | 2.6 | Negative | 75 |
Male | Asian | 32 | 2.73 | Negative | 100 |
Male | Hispanic | 59 | 2.74 | Negative | 50 |
Female | Hispanic | 67 | 3.08 | Negative | 500 |
Male | White | 32 | 3.08 | Negative | 300 |
Male | White | 43 | 3.08 | Negative | 300 |
Male | White | 32 | 3.23 | Negative | 200 |
Male | White | 45 | 3.29 | Negative | 400 |
Male | White | 36 | 3.29 | Negative | 400 |
Female | White | 10 | 3.29 | Negative | 400 |
Female | Asian | 32 | 3.33 | Negative | 1 |
Female | Asian | 12 | 3.45 | Negative | 80 |
Male | Asian | 58 | 3.5 | Negative | 35 |
Male | Asian | 66 | 3.5 | Negative | 35 |
Male | Asian | 53 | 3.7 | Negative | 1000 |
Male | Black | 26 | 3.73 | Negative | 400 |
Male | White | 48 | 3.81 | Negative | 600 |
Male | White | 39 | 3.81 | Negative | 600 |
Male | White | 61 | 3.82 | Negative | 6 |
Male | White | 55 | 4.02 | Negative | 10 |
Female | Black | 73 | 4.08 | Positive | 300 |
Female | Asian | 62 | 4.08 | Positive | 300 |
Female | White | 74 | 4.12 | Positive | 400 |
Female | Black | 74 | 4.14 | Negative | 500 |
Female | White | 80 | 4.16 | Negative | 1000 |
Female | Black | 84 | 4.16 | Negative | 1000 |
Male | White | 63 | 4.16 | Negative | 500 |
Female | White | 43 | 4.17 | Negative | 10 |
Male | White | 48 | 4.17 | Positive | 300 |
Male | Asian | 45 | 4.19 | Negative | 400 |
Male | White | 69 | 4.26 | Positive | 300 |
Female | White | 69 | 4.26 | Positive | 300 |
Male | Black | 61 | 4.3 | Positive | 200 |
Female | White | 74 | 4.3 | Positive | 200 |
Female | White | 34 | 4.56 | Positive | 300 |
Female | White | 26 | 4.56 | Positive | 300 |
Male | White | 41 | 4.59 | Negative | 2000 |
Female | Hispanic | 47 | 4.67 | Negative |
10 |