Detect malicious URLs using machine learning models

Environment tips:

This project runs under python3.11. When you install lightgbm on macOS, there will be a problem as you need gcc to complie the package. Here is the instruction to install lightgbm on your macOS.

Description:

features_extraction.py is used to extract 31 features, including general features, length features, count features, ratio features and domain features as shown in the features table.
model training.py is used to train different ML models and draw the heapmap.

The datasets we collected from:

Kaggle
UNB
URLhaus
Mendeley

The applied models are:

Logistic, KNN, SVM, Decision Trees, Random Forest, Bagging, and AdaBoosting

Contributors:

xinyanzhang27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Detect malicious URLs using machine learning models

Environment tips:

Description:

The datasets we collected from:

The applied models are:

Contributors:

Files

README.md

Latest commit

History

README.md

File metadata and controls

Detect malicious URLs using machine learning models

Environment tips:

Description:

The datasets we collected from:

The applied models are:

Contributors: