In: Computer Science
Using the Titanic passenger dataset (titanic.csv). Is it better to split on gender or Pclass (1 or not 1)? Compute information gain for each option and say which is best. Submit the gain and show your work and your choice.
- I cant upload the dataset on here. Can I send a link of it from google? What can I do?
From above calculations its evident that Information gain in Gender is more than of Pclass hence split it on Gender.
COMMENT DOWN FOR ANY QUERY, IF IT HELPED YOU DO LIKE :)