Skip to content

Latest commit

 

History

History
5 lines (4 loc) · 632 Bytes

README.md

File metadata and controls

5 lines (4 loc) · 632 Bytes

Diploma in Data Science and Machine Learning

Supervised Learning Assignment (in spanish)

Dataset from Kaggle. Classification task on an imbalanced dataset where the objective is to predecit whether a customer will buy a travel insurance or not. We encoded, splitted and scaled the dataset, trained several models (SVM, XGB, Logistic Regression, KNN, Random Forest, Gradient Boosting), tunning the hyperparameters with grid search, evaluated them, and created an ensemble with the best ones. Doing this we got a F1 score of 0.72 in the test set.