Pull request accepted by RedHat into main repo on 06/11/2020.
- Michael Clifford, RedHat Inc.
- Anand Sanmukhani, RedHat Inc.
- Yousif Khaireddin [email protected]
- Zhuofa (Marco) Chen [email protected]
- Krishna Palle [email protected]
- Fangya Xu [email protected]
- Nihar Dwivedi [email protected]
- Ojasvi Jhamb [email protected]
Click here for a comprehensive Project Report
Our GitHub repo containing our Prometheus Anomaly Detector can be found here
The Final presentation video for our project can be found here.
The slides for this presentation can be found here.
-
Understanding the current ML models (SARIMA and Prophet) and workings of Prometheus Anomaly Detection (PAD), an existing open-source project developed by the AI team at RedHat to analyze time-series data generated by cloud infrastructure
-
Developing and deploying a new ML model (LSTM) onto DataHub, a blueprint for building an AI application as a service platform on the OpenShift Container Platform.
-
Testing the workings of DataHub on the MOC and reporting any problems for future ramifications.
-
The open-source community
-
MOC users that want to analyze time-series data generated by cloud infrastructure and send alerts for potential anomalies
-
Site Reliability Engineers that would like to monitor the health of their applications
-
Ensuring the deployment and functionality of DataHub on the MOC
-
Extension of the current Prometheus Anomaly Detection tool with a new model(s)
-
Exploring/reporting any issues and bugs we encounter along with the project for future ramifications
-
Installing OpenShift client and datahub on our locally for familiarization
-
Developing and testing PAD ML algorithms on JupiterHub
-
Testing the current models on JupyterHub
-
Building a simple LSTM model on JupyterHub
-
Experimenting with different architectures, optimizers, and layers in our model
-
Automating the tuning of our model
-
Comparing our model’s performance with that of the current deployment
-
Deploying our models onto the DataHub
Developing at least one new ML model (LSTM) and having it monitor any metric while running on DataHub.
-
Project familiarization (02/05)
-
Deploy DataHub onto MOC (03/01)
-
Set up Data Hub on Openshift (03/15)
-
Extend Prometheus with a new model(s) using Jupyter notebooks (04/15)
-
Test the models on Prometheus using data provided by the mentor (05/01)
-
Migrate built ML Algorithms and models onto DataHub (05/05)