The German Bundesnetzagentur is providing data on the German electricity market. In this project, we want to download and analyze this data.
Bundesnetzagentur Download-Center: https://www.smard.de/home/downloadcenter/download-marktdaten SMARD API: https://smard.api.bund.dev
- conda create -n smard python=3.8
- conda activate smard
- pip install -r requirements.txt
Run ./download_all.py
to download data and ./merge_parquet.py
or ./merge_parquet_partitioned.py
to merge the data into one parquet-file. The latter will create a partitioned parquet file (which is at the moment not supported by DuckDB but it is supported by Spark).
Use the spark-starter.ipynb
notebook for analysis of the data with Apache Spark.
Use the duckdb-starter.ipynb
notebook for analysis of the data using SQL queries with DuckDB.