2 Titanic Survival Prediction

In machine learning applications, one of the first exercises is to build a model to classify Titanic survivors. The exercise has a little practical value beyond being a learning exercise. However, there are a lot of interesting findings from this data set.

Kaggle hosted a competition using Titanic data a while back, and it is accessible here: https://www.kaggle.com/c/titanic/data

You can also download CSV files from my Github repository: https://github.com/ashgreat/datasets. We will use these links in the code.

The objectives of this exercise are as follows:

  1. Use binary logistic regression model to classify survivors and deaths

  2. Use XGBoost to classify survivors and deaths