This project aims to analyze and predict customer preferences for communication channels using data from various platforms. The primary goal is to understand and predict how customers interact with different communication channels for diffrent message types (promotional & trigger), including Email, SMS, and Push Notifications.
The project focuses on three types of communication platforms:
- Epsilon
- Bluecore
- SendGrid
- Attentive
- Airship (Data still not available in snowflake)
The project follows a structured workflow to ensure comprehensive analysis and modeling:
-
Understanding Communication Platforms:
- Gathered insights into the functionalities and data provided by each communication platform.
-
Data Extraction:
- Extracted data from the identified communication platforms.
-
Data Processing:
- Raw Data Processing: Cleaned and preprocess the raw data for analysis.
- Exploratory Data Analysis (EDA): Performed EDA to understand data distributions, trends, and patterns.
-
Data Transformation:
- Transformed raw data to prepare it for analysis.
-
Analysis:
- Conducted various analyses, including:
- Univariate Analysis: Analyze individual variables.
- Multivariate Analysis: Examine relationships between multiple variables.
- Categorical Analysis: Focus on categorical variables.
- Conducted various analyses, including:
-
Handling Missing Values:
- Implemented strategies to address missing data in the dataset.
-
Outlier Detection:
- Identified and handle outliers in the data.
-
Feature Engineering:
- Created new features based on existing data to improve model performance.
-
Correlation Check and Feature Selection:
- Analyzed feature correlations and select relevant features for modeling.
-
Modeling:
- Began with Unsupervised Learning techniques to identify patterns and clusters in the data.
- Progress to Supervised Learning, experimenting with various models including:
- Decision Trees (DT)
- Random Forest (RF)
- LightGBM (LGBM)
- XGBoost (XGB)
- Other ensemble-based models.
-
Model Score and Evaluation:
- Analyzed and Evaluated Model result
- Python
- Pandas
- NumPy
- Scikit-learn
- Matplotlib
- Seaborn
Contributions are welcome! Please feel free to submit a pull request or open an issue if you have suggestions or improvements.
- Epsilon
- Bluecore
- SendGrid
- Attentive
- Airship