Random Forest Regressor

This Jupyter notebook serves as part of the data science pipeline by providing a quick and easy framework to perform feature enginnering, model training and feature importance analysis for data exploration. In this particular notebook, Sci-Kit Learn's RandomForestRegressor was trained on information regarding housing in Perth to numerically predict house prices based on floor space, suburb, number of bedrooms, etc. Feature importance analysis was performed using built-in methods that calculate importance by node impurity. However, SHAP was also used to provide a more robust and in-depth analysis via Shapley values.

Features

Model saving and loading.
Hyperparameter tuning via Bayesian optimization.
Feature importance analysis using tree node impurity and Shapley values.

Future Improvements

Custom user input to the model (involves writting a custom data encoder instead of using pandas.get_dummies()).
Reducing the disk size of saved models.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
LICENSE		LICENSE
README.md		README.md
data_prep.ipynb		data_prep.ipynb
dataset.csv		dataset.csv
main.ipynb		main.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Random Forest Regressor

Features

Future Improvements

About

Releases

Packages

Languages

License

xPrithvi/Random-Forest-Regressor

Folders and files

Latest commit

History

Repository files navigation

Random Forest Regressor

Features

Future Improvements

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages