In: Computer Science
Task 1
Please import the “admit.csv” into Rstudio. In this dataset, we know the GRE score, the GPA, and the rankof 400 applicants for a graduate program. We also know if each of the candidates is admitted. In the admit column, 1 stands for “admitted”, and 0 stands for “rejected”. Please answer the following questions and include the codes.
1. import the dataset and call it "mydata". Then check the structure of the data
2. convert the data type of the admit and the rank column from int to factors
3. randomly select 80% of the dataset as training set and the rest as the testing set
4. train a decision tree model, using admit as the category, and gre, gpa, and rank as predictors. Then plot the tree
5. Please answer the question: if a candidate has a GPA of 3.7, and rank of 4, does this candidate have a higher chance to be admitted or to be rejected? Please note that when you only have two categories, the darker proportion stands for the proportion for 1 in the end node of the tree plot
6. Please calculate the accuracy of your decision tree model