Skip to content

assamese/restaurant-instections

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 

Repository files navigation

Restaurant Inspection Data Pipeline

Ingest data from https://data.cityofnewyork.us/Health/DOHMH-New-York-City-Restaurant-Inspection-Results/43nn-pn8j and stage data to S3

Create schema in Postgres

Read CSV-File from S3

Sanitize/transform

Save data into Fact/Dimension tables in Postgres

Configuration

A remote PostgresDB hosted at elephantsql.com is used.

The password for PostgresDB and S3 needs to be plugged into this python file: src/main/python/config_framework.py (Line 7 and Line 9)

Usage

cd src/main/scripts
./run_driver.sh

Airflow

The Airflow script is unfinished and under construction

License

MIT

Releases

No releases published

Packages

No packages published