2 Titanic Survival Prediction
In machine learning applications, one of the first exercises is to build a model to classify Titanic survivors. The exercise has a little practical value beyond being a learning exercise. However, there are a lot of interesting findings from this data set.
Kaggle hosted a competition using Titanic data a while back, and it is accessible here: https://www.kaggle.com/c/titanic/data
You can also download CSV files from my Github repository: https://github.com/ashgreat/datasets. We will use these links in the code.
The objectives of this exercise are as follows:
Use binary logistic regression model to classify survivors and deaths
Use XGBoost to classify survivors and deaths