Change the repository type filter
All
Repositories list
23 repositories
herd-mdl
PublicHerd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.MegaSparkDiff
PublicA Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations of possible data sources. Multiple execution modes in multiple environments enable the user to generate a diff report as a Java/Scala-friendly DataFrame or as a file for future use. Comes with out of the box Spa…model-validation-toolkit
PublicModel Validation Toolkit is a collection of tools to assist with validating machine learning models prior to deploying them to production and monitoring them after deployment to production.finraos.github.com
PublicMLiy
PublicMLiy (pronounced “Emily”) is a machine-learning platform that allows data scientists to provision and manage processing power in the cloud. It provides an easy-to-use website to install customizable sets of machine learning software for use in data analysis and exploration. This allows data scientists to focus on data analysis rather than how to…herd
PublicHerd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabytes of data and make it accessible for data processing and analytical purposes by any cloud compute platform.CodeSamples
PublicDataGenerator
PublicDataGenerator is a Java library for systematically producing large volumes of data. DataGenerator frames data production as a modeling problem, with a user providing a model of dependencies among variables and the library traversing the model to produce relevant data sets.jenkins-build-collector
PublicMSL
Publicaphelion
Publicyum-nginx-api
Publicyum-nginx-api is a go API for uploading RPMs to yum repositories and configurations for running NGINX to serve them. It is a deployable solution with Docker or a single 8MB statically linked Linux binary. yum-nginx-api enables CI tools to be used for uploading RPMs and managing yum repositories.JTAF-ExtWebDriver
Public archiveExtensions for WebDriver is an enhancement to the powerful WebDriver API, with robust features that keep your browser automation running smoothly. It includes a widget library, improved session management and extended functions over the existing WebDriver API.HiveQLUnit
Public archiveTest your Hive scripts inside your favorite IDE with HiveQLUnit! Increase your developers productivity by testing on all operating systems including Windows, Linux and Mac OSX. Build continuous integration and delivery tests to control the releases of your big data products.JTAF-XCore
Public archiveCTGrazer
Public archivekarma-msl
PublicElasticd
Public archiveUMD-Bitcamp-2015
Public archive