Skip to content

Implementation of the replicated SeeDB to visualize and extract knowledge from the 'Mental Health in Tech' dataset. Includes Exploratory Data Analysis and various pruning techniques for optimization.

Notifications You must be signed in to change notification settings

AyusheGangal/Mental-Health-in-Tech

Repository files navigation

Mental-Health-in-Tech

Implementation of the replicated SeeDB to visualize and extract knowledge from the 'Mental Health in Tech' dataset.

This is the readme file for implementing the extension of SeeDB, and it contains the instructions to run the extension iPython notebook. Prerequisites before running the code:

  1. Python 3.7
  2. Jupyter Notebook
  3. PostgreSQL
  4. Psycopg2

Follow the below steps to run the extension iPython notebook:

  1. Download the 'Mental Health in Tech' dataset from https://www.kaggle.com/datasets/osmi/mental-health-in-tech-survey.
  2. Ensure that the PostgreSQL Database is running and the files are in the correct database.
  3. Run the SQL queries given in the “Mental-Health-in-Tech/SQLQueries_extension.sql” file. Properly update the file paths in this SQL file.
  4. Run the extension iPython notebook.

File paths need to be cautiously and correctly set in the functions to run the iPython notebook “Mental-Health-in-Tech/main_extension.ipynb” successfully:

  1. Give the file path where the original unclean dataset is downloaded to import it using the read_csv() function.
  2. Give the correct file paths while loading the table's data from the dataset using the SQL queries and importing the tables after obtaining the desired target and reference views.
  3. Give the folder's file path where the cleaned dataset is imported in the split_csv() function.

About

Implementation of the replicated SeeDB to visualize and extract knowledge from the 'Mental Health in Tech' dataset. Includes Exploratory Data Analysis and various pruning techniques for optimization.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published