Skip to content
Change the repository type filter

All

    Repositories list

    • deequ

      Public
      Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
      Scala
      Apache License 2.0
      536101Updated Jul 24, 2024Jul 24, 2024
    • airflow

      Public
      Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
      Python
      Apache License 2.0
      14k100Updated Jun 17, 2024Jun 17, 2024
    • yap-csv

      Public
      Node.js package that manages CSV to JSON conversion supporting complex field mapping
      JavaScript
      Apache License 2.0
      263026Updated Apr 16, 2024Apr 16, 2024
    • Serve your fastText models for text classification and word vectors
      Python
      Apache License 2.0
      112300Updated Feb 12, 2024Feb 12, 2024
    • Contains resources that can be exposed to different applications
      CSS
      0000Updated Dec 28, 2023Dec 28, 2023
    • delta

      Public
      An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs
      HTML
      Apache License 2.0
      1.7k001Updated Dec 12, 2023Dec 12, 2023
    • Python API for Deequ
      Python
      Apache License 2.0
      134001Updated Dec 5, 2023Dec 5, 2023
    • docs

      Public
      Guidelines & Best Practices documentation shared by Nielsen with the Open Source community
      JavaScript
      Apache License 2.0
      7401126Updated Feb 8, 2023Feb 8, 2023
    • superset

      Public
      Apache Superset is a Data Visualization and Data Exploration Platform
      TypeScript
      Apache License 2.0
      14k3011Updated Jan 20, 2023Jan 20, 2023
    • superset-viz-plugins

      Public template
      TypeScript
      173310Updated Mar 8, 2022Mar 8, 2022
    • Python
      Other
      0000Updated Aug 22, 2021Aug 22, 2021
    • Python
      Other
      0700Updated Jun 25, 2021Jun 25, 2021
    • 0000Updated Jun 1, 2021Jun 1, 2021
    • A reverse proxy that provides authentication with Google, Github or other providers.
      Go
      MIT License
      1.6k100Updated Apr 3, 2021Apr 3, 2021
    • Apache Superset UI packages
      TypeScript
      Apache License 2.0
      273000Updated Feb 7, 2021Feb 7, 2021
    • A load balancer / proxy / gateway for prestodb
      JavaScript
      Apache License 2.0
      156000Updated Jan 27, 2021Jan 27, 2021
    • VS Code in the browser
      TypeScript
      MIT License
      5.6k000Updated Dec 7, 2020Dec 7, 2020
    • presto

      Public
      Home of the community managed version of Presto, the distributed SQL query engine for big data, under the auspices of the Presto Software Foundation.
      Java
      Apache License 2.0
      3k000Updated Nov 20, 2020Nov 20, 2020
    • Helm Chart & Documentation for deploying JupyterHub on Kubernetes
      Python
      Other
      796000Updated Nov 9, 2020Nov 9, 2020
    • JupyterHub proxy implementation with traefik
      Jupyter Notebook
      BSD 3-Clause "New" or "Revised" License
      29000Updated Nov 7, 2020Nov 7, 2020
    • Node.js CLI tool to manage a pact manifest
      JavaScript
      MIT License
      1200Updated Oct 7, 2018Oct 7, 2018
    • Pact Manifest is a Node.js library to publish pact contracts to a broker using a manifest
      JavaScript
      Apache License 2.0
      1400Updated Sep 16, 2018Sep 16, 2018
    • Isolate (reuse and share) React-Redux-Containers (Components)
      Apache License 2.0
      02000Updated Sep 13, 2018Sep 13, 2018