Dynamic Risk Assessment System

This repository holds the fourth project completed towards Udacity's Machine Learning DevOps Engineer Nanodegree

Data ingestion: Automatically check a database for new data that can be used for model training. Compile all training data to a training dataset and save it to persistent storage. Write metrics related to the completed data ingestion tasks to persistent storage.
Training, scoring, and deploying: Write scripts that train an ML model that predicts attrition risk, and score the model. Write the model and the scoring metrics to persistent storage.
Diagnostics: Determine and save summary statistics related to a dataset. Time the performance of model training and scoring scripts. Check for dependency changes and package updates.
Reporting: Automatically generate plots and documents that report on model metrics. Provide an API endpoint that can return model predictions and metrics.
Process Automation: Create a script and cron job that automatically run all previous steps at regular intervals.

The project can be run by running the fullprocess.py file using the below commands in the project's root directory

pip install -r requirements.txt # to install all requirements in the environment

python src/app.py # To start the app

python src/fullprocess.py # To run all steps at once

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
ingesteddata		ingesteddata
models		models
practicedata		practicedata
practicemodels		practicemodels
production_deployment		production_deployment
sourcedata		sourcedata
src		src
testdata		testdata
.gitignore		.gitignore
README.md		README.md
config.json		config.json
cronjob.txt		cronjob.txt
requirements.txt		requirements.txt

Provide feedback