Skip to content

This project aims to analyze the determinants for a woman being the primary source of revenue in a household using data from the "Conditions de Travail 2013" survey. The analysis will be focused on individuals living in couples in the same household. The project uses python along with multiple data science and statistics libraries. and

License

Notifications You must be signed in to change notification settings

Medkallel/Logistic-Regression-Analysis-Determinants-of-Women-as-Primary-Source-of-Revenue

Repository files navigation

👩‍🔬 Logistic Regression Analysis: Determinants of Women as Primary Source of Revenue

banner

Table of Contents


Technologies Used

Python Pandas NumPy Statsmodels Matplotlib Plotly Jupyter Notebook


Description

This project aims to analyze the determinants for a woman being the primary source of revenue in a household using data from the "Conditions de Travail 2013" survey. The analysis will be focused on individuals living in couples in the same household.

Objectives

The objective of the project is to analyze, based on the 2013 Working Conditions survey, the determinants for a woman being the primary source of revenue in the household.

  1. Data Exploration & Management: Explore the dataset and the available data and handle missing data.
  2. Feature Selection: The dataset presents 541 Columns/Features. A selection needs to be made to be able to conduct further analysis.
  3. Build and evaluate a logistic regression model to identify significant predictors of a women being the primary source of revenue in the household.

Dataset

  • Source: Conditions de Travail 2013 (Volet "Actif")
  • Initial Dataset Size: 33,673 individuals
  • Filtered Dataset: Individuals living in couples at the same household

Analysis Results

The results are in a presentation in the following link: Presentation Link

The presentation is also available as the pdf:

Results_Analysis.pdf

Installation

Important

The project was done on Python 3.11.6

To run this project locally, follow these steps:

  1. Clone the repository:
# Clone the repository
git clone https://github.com/Medkallel/Logistic-Regression-Analysis
# Navigate into the directory
cd Logistic-Regression-Analysis
  1. Install the required dependencies:
# Install the requirements
pip install -r requirements.txt

Usage

The analysis data is already available in the notebook but you can run it all

Tip

The notebook could take up to 15 minutes to run. Be patient :)


Project Structure

Here's a visual representation of the structure:
📦Project
 ┣ 📁Data/
   ┗ 🗃️individus_ct2013.sas7bdat
 ┣ 📁Doc/
 ┃ ┗ 📄Dictionnary_of_Variables.pdf
 ┣ 🐍Data_Analysis_Notebook.ipynb
 ┣ 📄requirements.txt
 ┣ 📄Results_Analysis.pdf
 ┗ 📄README.md

📫 Contact me

LinkedIn


Collaborators

This project is the result of the collaborative efforts of a diverse and talented team. Each member has contributed significantly to different aspects of the project, from initial research and data collection to final analysis and presentation.

  • Jean Christophe Rigoni
  • Simon Pierre Rodner

License

This project is under the CC BY-NC 4.0 License. Check the licence file for more info.
License: CC BY-NC 4.0

About

This project aims to analyze the determinants for a woman being the primary source of revenue in a household using data from the "Conditions de Travail 2013" survey. The analysis will be focused on individuals living in couples in the same household. The project uses python along with multiple data science and statistics libraries. and

Topics

Resources

License

Stars

Watchers

Forks