Description: Study data about the tragic sinking of the Titanic and develop a machine learning model which predicts the survival of passengers.
The team performed the following activities:
Data Cleaning
Exploratory Data Analysis
Selection of Random Forest Model
Optimization of Model
Running Model on Unseen Test Data to Form Survival Prediction
Used Google Colab for Running Python Code and as a Collaborative Platform
Results: Able to predict survival accuracy around 80% with random forest model. Preprocessing data (e.g. extracting Cabin letter) improved accuracy. The biggest factors were class, family size, gender, and title