Skip to content

This project analyze licenses issued by DCWP to businesses and individuals so that they may legally operate in New York City.

Notifications You must be signed in to change notification settings

Lucy0906/NYC-Open-Data-Business-ETL

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

44 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

CIS9440 HW1&2

HW1 Data Sourcing

DataSource

Here is my data source: https://data.cityofnewyork.us/Business/Legally-Operating-Businesses/w7w3-xahh/about_data This link directs to the NYC Open Data portal where the dataset can be accessed directly. Data Provided By Department of Consumer and Worker Protection (DCWP)

Metadata

This dataset features licenses issued by DCWP to businesses and individuals so that they may legally operate in New York City.

Datasize

This data has 281K rows 27 Columns and each row is aDCA-Issued License

Description

This dataset reflects data as of 7/21/2023. The Department of Consumer and Worker Protection (DCWP) is working on an updated version of this dataset. This dataset features licenses issued by DCWP to businesses and individuals so that they may legally operate in New York City. This dataset is maintained by the City of New York and contains comprehensive information about businesses that are legally licensed to operate within the city limits. It includes details such as business names, addresses, industry types, license numbers, and status.

Data dictionary

Here is the data dictionary link: https://data.cityofnewyork.us/Business/Legally-Operating-Businesses/w7w3-xahh/about_data

HW1 Storage

I use Azure Blob Storage to store data.

HW1 Data Modeling

I use supabase to create the following diagram. Dimensional modeling for DCWP data involves creating a structure that facilitates analysis and reporting. This includes defining dimensions such as business type and date. image

HW2 Transformation

I use ETL tools to do the transformation and creat the data mapping.

HW2 Data Modeling

I use supabase to create the following diagram. Dimensional modeling for DCWP data involves creating a structure that facilitates analysis and reporting. This includes defining dimensions such as business type and date. image

HW2 Serving Data

I use the tableau to do data visualization. Visualizations:https://public.tableau.com/app/profile/lu.chen2788/viz/HW1_17156589017020/Dashboard1?publish=yes

About

This project analyze licenses issued by DCWP to businesses and individuals so that they may legally operate in New York City.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published