BEGO Data Platform

Simple, yet cool project.

This repository is used to experiment on how to implement data driven technologies or test new data services providers.

Chapter One.

Leveraging Cloud AMQP - RabbitMQ Free Services, an ingestion data layer was created, sending fake data via Python through RabbitMQ Queue, receiving it by listening to the topic and writing into Parquet files.

After that, Duckdb is used for data analysis of the Parquet Files.

Chapter Two (TBD)

Create more queues in RabbitMQ and simulate a live data platform ingestion scenario.

Implement ETL processes using Apache Beam Streaming.

Chapter Three (TBD)

Use dbt, AirFlow and Streamlit to Process Data, Orquestration and Live Stream data to Dashboards.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
data/bronze		data/bronze
src		src
.gitignore		.gitignore
README.md		README.md
dashboard.py		dashboard.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BEGO Data Platform

Chapter One.

Chapter Two (TBD)

Chapter Three (TBD)

About

Releases

Packages

Languages

brenopapa/bego-data-platform

Folders and files

Latest commit

History

Repository files navigation

BEGO Data Platform

Chapter One.

Chapter Two (TBD)

Chapter Three (TBD)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages