📩 Spam SMS Detection ML Project

📌 Project Objective

The goal of this project is to build and deploy a machine learning model that can classify SMS messages as Spam or Ham (Not Spam).
The model is trained using a labeled dataset and deployed for real-world testing.

🛠️ Tech Stack

Python
Scikit-Learn
Pandas, Numpy
Natural Language Processing (NLP)
NLTK
Streamlit (for deployment)
Heroku / Render (optional for web deployment)

📚 Dataset

Dataset Source: Kaggle - SMS Spam Collection Dataset
Description: 5,500 SMS messages labeled as Spam or Not Spam.

📊 Project Stages

Data Cleaning
Exploratory Data Analysis (EDA)
Text Preprocessing (tokenization, stemming, etc.)
Model Building (Naive Bayes, Logistic Regression, etc.)
Vectorization (TF-IDF, GridSearchCV)
Model Evaluation (Accuracy, Precision, Recall, F1 Score)
PyCharm App Development (Over Streamlit)
Heroku Deployment

📊 Model Performance

Metric	Score
Accuracy	97.9%
Precision	97.5%
Recall	96%

🚀 Deployment

The SMS Spam Detection model is deployable on Heroku and accessible online!

⚙️ Steps to Run the Project

1. Clone the repository:

git clone https://github.com/BleeGleeWee/Spam-SMS-Detection.git
cd Spam-SMS-Detection

2. Install dependencies:

pip install -r requirements.txt

3. Run the Jupyter Notebook:

jupyter notebook spam_sms_detection.ipynb

4. For deployed app:

streamlit run app.py

5. Deploy on Heroku

Install Heroku CLI
Run the following:

heroku login
heroku create spam-classifier-app
git push heroku main

Deployed link✨ Here

🌟 FINAL SHOWDOWN:

Email/SMS-spam-classifier
│
├── data/
│   └── spam.csv                         # Original dataset (or link to download in README)
│
├── notebooks/
│   ├── 01_data_cleaning.ipynb           # Handling nulls, duplicates, formatting
│   ├── 02_eda.ipynb                     # Visualizations and exploratory analysis
│   ├── 03_text_preprocessing.ipynb      # Tokenization, stemming, stopword removal
│   ├── 04_model_building.ipynb          # Naive Bayes, Logistic Regression, etc.
│   └── 05_model_improvement.ipynb       # TF-IDF, hyperparameter tuning, evaluation
│
├── models/
│   ├── model.pkl                        # Serialized trained model (pickle)
|   └── vectorizer.pkl                   # Trained model then vectorized
│
├── app/
│   ├── app.py                           # App entry point
│   ├── predict.py                       # Handles input, loads model, returns prediction
│   ├── model_loader.py                  # Utility to load the model
│   └── train_model.py                   # Training model before testing   
│                       
│
├── static/
│   └── setup.sh                         # Web Design
│
├── tests/
│   └── test_predict.py                  # Unit tests for prediction logic
│
├── .gitignore                           # Ignore notebooks checkpoints, model files, etc.
├── Procfile                             # For Heroku: e.g., `web: gunicorn app.main:app`
├── requirements.txt                     # All dependencies (Flask/FastAPI, sklearn, etc.)
├── nltk.txt                             # NLTK dependencies (stopwords, punkt)
├── README.md                            # Full documentation 
└── LICENSE                              # MIT or any preferred open-source license

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📩 Spam SMS Detection ML Project

📌 Project Objective

🛠️ Tech Stack

📚 Dataset

📊 Project Stages

📊 Model Performance

🚀 Deployment

⚙️ Steps to Run the Project

1. Clone the repository:

2. Install dependencies:

3. Run the Jupyter Notebook:

4. For deployed app:

5. Deploy on Heroku

🌟 FINAL SHOWDOWN:

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Procfile.txt		Procfile.txt
README.md		README.md
model.pkl		model.pkl
nltk.txt		nltk.txt
requirements.txt		requirements.txt
setup.sh		setup.sh
spam-sms-detection.ipynb		spam-sms-detection.ipynb
spam.csv		spam.csv
vectorizer.pkl		vectorizer.pkl

License

BleeGleeWee/Spam-SMS-Detection

Folders and files

Latest commit

History

Repository files navigation

📩 Spam SMS Detection ML Project

📌 Project Objective

🛠️ Tech Stack

📚 Dataset

📊 Project Stages

📊 Model Performance

🚀 Deployment

⚙️ Steps to Run the Project

1. Clone the repository:

2. Install dependencies:

3. Run the Jupyter Notebook:

4. For deployed app:

5. Deploy on Heroku

🌟 FINAL SHOWDOWN:

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages