KPMG Data Analytics Consulting Virtual Internship
-
Updated
Dec 31, 2021 - Jupyter Notebook
KPMG Data Analytics Consulting Virtual Internship
SQL based data profiling & data quality checks, which will help you to perform data profiling & data quality checks on SQL database at table & database level.
Ramblings of a curious mind
dbt Datasphere Plugin is for integrating multiple open-source data quality frameworks into your dbt projects. It unifies Soda SQL, Great Expectations, Datafold, providing a single interface to configure and run data quality checks.
Run greatexpectations.io on ANY SQL Engine using REST API. Supported by FastAPI, Pydantic and SQLAlchemy as best data quality tool
Data quality validations over PySpark DataFrame
Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.
🦆 Blazing Fast and highly customizable Github Action to setup a DuckDb runtime
This project extracts and cleans raw YouTube data from excel-csv (Kaggle) through SQL and identifies the top-performing UK-based YouTubers. Data Stack: Excel | Microsoft SQL Server | Power BI
An Apache Airflow data pipeline is designed to perform ELT operations, utilizing Amazon S3 and Amazon Redshift Serverless.
Add a description, image, and links to the dataqualitycheck topic page so that developers can more easily learn about it.
To associate your repository with the dataqualitycheck topic, visit your repo's landing page and select "manage topics."