Car Price Prediction Using Machine Learning

Welcome to the Car Price Prediction project! This repository contains the code and resources used to predict car prices based on various features using machine learning techniques.

Project Overview

This project aims to predict the selling price of cars using a dataset from Kaggle. The dataset contains various features of cars such as age, mileage, fuel type, seller type, transmission type, and more. Multiple machine learning algorithms were used to build and evaluate models to achieve accurate predictions.

Dataset

The dataset used in this project is obtained from Kaggle. You can find it here.

Project Structure

The repository contains the following files and directories:

CarPricePrediction.ipynb: Jupyter notebook containing the code for data preprocessing, model building, and evaluation.
car_price_prediction_model.pkl: Trained Random Forest model saved using joblib.
README.md: Project overview and documentation.
requirements.txt: List of Python packages required to run the project.

Getting Started

Prerequisites

Make sure you have the following installed:

Python 3.6 or higher
Jupyter Notebook
Required Python packages (listed in requirements.txt)

Installation

Clone the repository:

git clone https://github.com/harishsemwal/CarPricePrediction.git
cd CarPricePrediction

Install the required packages:
```
pip install -r requirements.txt
```

Run the Jupyter notebook:

jupyter notebook CarPricePrediction.ipynb

Data Preprocessing

The data preprocessing steps include:

Importing necessary libraries and the dataset.
Exploring the dataset for understanding and identifying missing values.
Dropping irrelevant columns.
Creating new features such as the car's age.
Encoding categorical variables using one-hot encoding.
Visualizing correlations between features and the target variable.

Model Building

Several machine learning algorithms were used to build the prediction models:

Linear Regression
Multiple Linear Regression
Random Forest Regressor
Decision Tree Regressor

Model Evaluation

Each model was evaluated using the R-squared score to determine its performance. The results were as follows:

Random Forest Regressor: 95% accuracy
Decision Tree Regressor: 94% accuracy
Multiple Linear Regression: 91% accuracy

Visualizations

Multiple Linear Regression

Random Forest Regressor

Decision Tree Regressor

Hyperparameter Tuning

RandomizedSearchCV was used to find the optimal parameters for the Random Forest Regressor to improve its performance.

Results

The final model, Random Forest Regressor, was trained with the optimal parameters and achieved a high R-squared score on the test data.

Future Work

Incorporating additional features like car brand and model.
Exploring advanced machine learning algorithms like Gradient Boosting and XGBoost.
Enhancing data quality by collecting more recent car listings.
Deploying the model in a web application for real-time predictions.
Applying advanced feature engineering techniques.

Conclusion

This project successfully demonstrated the use of machine learning algorithms to predict car prices. The Random Forest Regressor provided the best performance among the models tested. Future enhancements can further improve the accuracy and usability of the model.

Contributing

Feel free to fork this repository and contribute by submitting a pull request. For major changes, please open an issue to discuss what you would like to change.

License

This project is licensed under the MIT License.

Acknowledgments

Thanks to Kaggle for providing the dataset.
Special thanks to all the contributors of the libraries used in this project.

Developed by Harish Prasad Semwal
Email: [email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
static/css		static/css
templates		templates
.gitattributes		.gitattributes
Car Price Prediction Model Using ML.ipynb		Car Price Prediction Model Using ML.ipynb
CarDekho_Car_Price_Prediction_Dataset.csv		CarDekho_Car_Price_Prediction_Dataset.csv
README.md		README.md
app.py		app.py
car_price_prediction_model.pkl		car_price_prediction_model.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Car Price Prediction Using Machine Learning

Project Overview

Dataset

Project Structure

Getting Started

Prerequisites

Installation

Data Preprocessing

Model Building

Model Evaluation

Visualizations

Multiple Linear Regression

Random Forest Regressor

Decision Tree Regressor

Hyperparameter Tuning

Results

Future Work

Conclusion

Contributing

License

Acknowledgments

About

Releases

Packages

Languages

harishsemwal/CarPricePrediction

Folders and files

Latest commit

History

Repository files navigation

Car Price Prediction Using Machine Learning

Project Overview

Dataset

Project Structure

Getting Started

Prerequisites

Installation

Data Preprocessing

Model Building

Model Evaluation

Visualizations

Multiple Linear Regression

Random Forest Regressor

Decision Tree Regressor

Hyperparameter Tuning

Results

Future Work

Conclusion

Contributing

License

Acknowledgments

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages