This project uses SQL to analyze a real-world employees database, extract the most useful information from the dataset, pre-process the data using Python for improved performance and uses structured query language to retrieve useful information from the database.
The repository consists of
- Uncleaned Dataset
- MODULE 1 containing the dataset I cleaned and stored in the form of a database table.
- MODULE 2 containing SQL queries that I created to analyse data
- Image gallery of project
This Module involves working on handling null values, deletion or transformation of irrelevant values, data type transformation, removing duplicates and data validations. Objective of this task is to get a refined and cleaner data set for further analysis.
This module involves working on performing data analysis on the pre-processed data from the previous module and conducting Data Analysis using SQL. Objective of this task is to generate queries for given problem statements.
I would like to thank HiCounselor for arranging this real-world data-set and a sandbox to work online.