Ingest data from https://data.cityofnewyork.us/Health/DOHMH-New-York-City-Restaurant-Inspection-Results/43nn-pn8j and stage data to S3
Create schema in Postgres
Read CSV-File from S3
Sanitize/transform
Save data into Fact/Dimension tables in Postgres
A remote PostgresDB hosted at elephantsql.com is used.
The password for PostgresDB and S3 needs to be plugged into this python file: src/main/python/config_framework.py (Line 7 and Line 9)
cd src/main/scripts
./run_driver.sh
The Airflow script is unfinished and under construction