This project is a part of Machine Learning Class for Data Science Master Program from Simplilearn
This project is done with guidance from:
https://github.com/subhadipml/Mercedes-Benz-Greener-Manufacturing
https://medium.com/swlh/greener-manufacturing-with-machine-learning-6ec77d0e7a91
-
Import all the required libraries (some libraries may be imported later)
-
Load and prepare the data for analysis. This step include:
~ Check for null and unique values for test and train sets
~ Find and remove outliers from the datasets
~ Check variance from categorical variables
~ Remove variables with very little to zero variance
~ Apply label encoder
-
Perform dimensionality reduction
-
XGBoost Analysis:
~ Predict your test values using XGBoost
~ k-fold Cross Validation using XGBoost
~ Test data prediction using XGBoost