Skip to content

Curated list of tools and frameworks assisting in monitoring data quality

License

Notifications You must be signed in to change notification settings

kwanUm/awesome-data-quality

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 

Repository files navigation

awesome-data-quality

A curated list of awesome tools for testing and monitoring data quality - typically at the data warehouse/lake or within running data pipelines.

If you want to contribute to this list (please do), send me a pull request or contact me.

Table of Contents

TBD

Frameworks and Libraries

Open sourced

Geared for ML
  • deepchecks - tool for validating your machine learning models and data. Implemented test suites tailored towards ML models datasets and outputs.
  • evidently - analyze and track data and ML model output quality.
Pipelines with data quality included
  • dbt, dataform - ELT tools that comes with a handy utility to define tests as SQL queries.

Paid

Offering ranges from data to pipelines testing, with focus on real-time monitoring, automation of tests creation & threshold setting, and addditional enterprise features.

TODOs

  • Add tools for unstructured data (Arthur, Robust)

About

Curated list of tools and frameworks assisting in monitoring data quality

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published