Question

In: Computer Science

Using the Titanic passenger dataset (titanic.csv). Is it better to split on gender or Pclass (1...

Using the Titanic passenger dataset (titanic.csv). Is it better to split on gender or Pclass (1 or not 1)? Compute information gain for each option and say which is best. Submit the gain and show your work and your choice.

- I cant upload the dataset on here. Can I send a link of it from google? What can I do?

Solutions

Expert Solution

From above calculations its evident that Information gain in Gender is more than of Pclass hence split it on Gender.

COMMENT DOWN FOR ANY QUERY, IF IT HELPED YOU DO LIKE :)


Related Solutions

Use Titanic dataset and perform EDA on various columns. Without using any modeling algorithms, and only...
Use Titanic dataset and perform EDA on various columns. Without using any modeling algorithms, and only using basic methods such as frequency distribution, describe the most important predictors of survival of Titanic passengers, e.g. were males or females more likely to survive, were young and rich females more likely to survive than old poor males etc? Submit the  response in a fully "knit" R Markdown file.
Consider the Titanic data stored in a structure variable named T. Passenger ID Survived 1=Yes, 2=No...
Consider the Titanic data stored in a structure variable named T. Passenger ID Survived 1=Yes, 2=No Gender 1=Male, 2=Female Age Fare ($) T = 121 2 1 22 7.25 243 1 2 38 71.28 432 1 2 26 7.93 564 1 2 35 53.1 856 2 1 35 8.05 As a table of 6 students, complete the following – you may want to have a couple of students checking the syntax and output on MATLAB. 1. Draw the table (Titanic...
Titanic: A passenger jumped into the freezing water of temperature -2ºC (2ºC below zero). The person...
Titanic: A passenger jumped into the freezing water of temperature -2ºC (2ºC below zero). The person was wearing 3 layers of clothes with the combined thickness of 1 cm, and the clothes’ emissivity was 0.87. The person’s mass was 65 kg, surface area was 1.75 m^2 , and the body’s specific heat was 3500 J/kgºC. Assume that the skin temperature of the person was always constant at 13ºC. The thermal conductivity of the water is 0.6 W/mK, and the Stefan-Boltzmann...
Question 2: “The RMS Titanic was an Olympic-class passenger liner, launched for the first time in...
Question 2: “The RMS Titanic was an Olympic-class passenger liner, launched for the first time in 31 May 1911. The ship’s steel was “probably the best plain carbon ship plate of the time”. Nevertheless, shortly before midnight on 14 April 1912, four days into the ship’s maiden voyage, Titanic hit an iceberg. The impact caused the hull to fracture and water poured into the ship, sinking it two hours and forty minutes later. Of the 2223 people on board, only...
Consider Dataset C for answering the questions that follows below. Team Gender Responses 1 A Male...
Consider Dataset C for answering the questions that follows below. Team Gender Responses 1 A Male 3.25 2 A Male 3.54 3 A Male 1.08 4 A Male 2.14 5 A Male 3.60 6 B Male 4.36 7 B Male 4.66 8 B Male 1.52 9 B Male 3.99 10 B Male 3.60 11 C Female 3.86 12 C Female 4.89 13 C Female 1.46 14 C Female 4.74 15 C Female 4.16 Teams A, B and C have been...
ANSWER USING R CODE Using the dataset 'LakeHuron' which is a built in R dataset describing...
ANSWER USING R CODE Using the dataset 'LakeHuron' which is a built in R dataset describing the level in feet of Lake Huron from 1872- 1972. To assign the values into an ordinary vector,x, we can do the following 'x <- as.vector(LakeHuron)'. From there, we can access the data easily. Assume the values in X are a random sample from a normal population with distribution X. Also assume the X has an unknown mean and unknown standard deviation. With this...
The dataset HairEyesColor in R contains classifications of students by gender, color, and eye color. a)...
The dataset HairEyesColor in R contains classifications of students by gender, color, and eye color. a) How many students are included in the data set? b) Display the information using mosaicplot (HairEyeColor, col=c(1,2)) c) Is hair color independent of eye color for men? d) Is hair color independent of eye color for women? Use R and show R code
Use the multi-layer perceptron algorithm to learn a model that classifies IRIS flower dataset. Split the...
Use the multi-layer perceptron algorithm to learn a model that classifies IRIS flower dataset. Split the dataset into a train set to train the algorithm and test set to test the algorithm. Calculate the accuracy. Use Scikit-Learn
Fitting a linear model using R a. Read the Toluca.txt dataset into R (this dataset can...
Fitting a linear model using R a. Read the Toluca.txt dataset into R (this dataset can be found on Canvas). Now fit a simple linear regression model with X = lotSize and Y = workHrs. Summarize the output from the model: the least square estimators, their standard errors, and corresponding p-values. b. Draw the scatterplot of Y versus X and add the least squares line to the scatterplot. c. Obtain the fitted values ˆyi and residuals ei . Print the...
The dataset flatulence.xlsx includes the variables gender, the self-reported number of times per day the respondent...
The dataset flatulence.xlsx includes the variables gender, the self-reported number of times per day the respondent passes gas (perday), and the number of months the respondent claims to wait before passing gas in front of a romantic partner (howlong). Find the 95% confidence interval for the average number of times a person passes gas in a day. Find the 99% confidence interval for: the average number of months a female waits before passing gas in front of a romantic partner,...
ADVERTISEMENT
ADVERTISEMENT
ADVERTISEMENT