Blood-Donation-Prediction

This predicts that will a previous donor donate blood in a given month or not.

Description

Blood transfusion saves lives - from replacing lost blood during major surgery or a serious injury to treating various illnesses and blood disorders. Ensuring that there's enough blood in supply whenever needed is a serious challenge for the health professionals. According to WebMD, "about 5 million Americans need a blood transfusion every year".

Our dataset is from a mobile blood donation vehicle in Taiwan. The Blood Transfusion Service Center drives to different universities and collects blood as part of a blood drive. We want to predict whether or not a donor will give blood the next time the vehicle comes to campus.

The data is stored in transfusion.data and it is structured according to RFMTC marketing model (a variation of RFM). We'll explore what that means later in this notebook. First, let's inspect the data.

About The Dataset

Let's briefly return to our discussion of RFM model. RFM stands for Recency, Frequency and Monetary Value and it is commonly used in marketing for identifying your best customers. In our case, our customers are blood donors.

RFMTC is a variation of the RFM model. Below is a description of what each column means in our dataset:

R (Recency - months since the last donation) F (Frequency - total number of donation) M (Monetary - total blood donated in c.c.) T (Time - months since the first donation) a binary variable representing whether he/she donated blood in March 2007 (1 stands for donating blood; 0 stands for not donating blood) It looks like every column in our DataFrame has the numeric type, which is exactly what we want when building a machine learning model. Let's verify our hypothesis.

Selection Model Using T-POT Library

TPOT is a Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.

TPOT will automatically explore hundreds of possible pipelines to find the best one for our dataset. Note, the outcome of this search will be a scikit-learn pipeline, meaning it will include any pre-processing steps as well as the model.

We are using TPOT to help us zero in on one model that we can then explore and optimize further.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
BloodPrediction_Visualisation.ipynb		BloodPrediction_Visualisation.ipynb
README.md		README.md
Report_PredictBloodDonation.docx		Report_PredictBloodDonation.docx
Report_PredictBloodDonation.pdf		Report_PredictBloodDonation.pdf
notebook.ipynb		notebook.ipynb
tpot-ml-pipeline.png		tpot-ml-pipeline.png
transfusion.data		transfusion.data

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Blood-Donation-Prediction

Description

About The Dataset

Selection Model Using T-POT Library

About

Languages

subsdevops/Blood-Donation-Prediction

Folders and files

Latest commit

History

Repository files navigation

Blood-Donation-Prediction

Description

About The Dataset

Selection Model Using T-POT Library

About

Topics

Resources

Stars

Watchers

Forks

Languages