Skip to content

Commit

Permalink
Merge pull request #3 from eurostat/dev
Browse files Browse the repository at this point in the history
Dev
  • Loading branch information
mmatyi authored Feb 13, 2025
2 parents 57fc5af + aee5134 commit 5765c0a
Show file tree
Hide file tree
Showing 18 changed files with 2,980 additions and 16 deletions.
9 changes: 8 additions & 1 deletion CDSE.md
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,13 @@ Here are the links to the Jupyter notebooks showcased during the platform webina
- [SentinelHub data visualisation](cdse-notebooks/sentinelhub-data-visualisation.ipynb)
- [OpenEO data processing](cdse-notebooks/openeo-example.ipynb)


There are many other notebooks with examples how to use the CDSE services. They are available here:
- [https://github.com/eu-cdse/notebook-samples/](https://github.com/eu-cdse/notebook-samples/)
- [https://github.com/Open-EO/openeo-community-examples/tree/main/python](https://github.com/Open-EO/openeo-community-examples/tree/main/python).
- [https://github.com/Open-EO/openeo-community-examples/tree/main/python](https://github.com/Open-EO/openeo-community-examples/tree/main/python).


:::{Important}
When you use the JupyterLab in the CDSE save your data under the ```/mystorage``` folder, because this is the storage which is persistent between different JuputerLab sessions. All other folders are erased at the reluanch of the JupyterLab server.
![Persistant storage in CDSE](img/mystorage.png)
:::
118 changes: 109 additions & 9 deletions CF.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,14 +8,114 @@ On the [Horizon Dashboard](https://horizon.cloudferro.com/) the participants hav

![Horizon dashboard login screen](img/horizon-keystone.png)

After the first login the participants should be able to see the pre-launched server with GPU support. The participants from the Dashboard can terminate the preconfigured server and can launch additional service till the allowed overall limit per team:
After the first login, the participants should be able to see the pre-launched server with GPU support. The participants from the Dashboard can terminate the preconfigured server and can launch additional service till the allowed overall limit per team, which are:

- max 20 instances
- max 20 CPU cores
- max 50 GB RAM
- max 10 disk with a max 1000 GB storage in total

- **max 20 instances**
- **max 20 CPU cores** total in all instances
- **max 50 GB RAM** total in all instances
- **max 10 disks** with a **max 1000 GB** storage in total

From the virtual servers you can access the CDSE resources and the additional datasets which are mentioned in the [Data Catalogue](data-catalogue.md).

:::{Note}
The sample notebooks to access the datasets together with some description of the data are available in these folders:
- [https://github.com/eurostat/eubd2025_docs/tree/main/cdse-notebooks](https://github.com/eurostat/eubd2025_docs/tree/main/cdse-notebooks)
- [https://github.com/eurostat/eubd2025_docs/tree/main/cf-notebooks](https://github.com/eurostat/eubd2025_docs/tree/main/cf-notebooks)
- [https://github.com/eurostat/eubd2025_docs/tree/main/docs](https://github.com/eurostat/eubd2025_docs/tree/main/docs)
:::

From the virtual servers you can access the CDSE resources and the additional dataset which are mentioned in the Data Catalogue.
The sample notebooks are accessible under this folder:

## Step by step instructions to recreate the customized image with additional services (JupyterHub,Rstudio,Shiny,Superset)

1. After login in to the [Horizon dashboard](https://horizon.cloudferro.com/) go to the ***Instances*** in the left menu and choose ***Launch Instance***.

![Horizon dashboard instances](img/launch-instance.png)

2. In the new window first you give a name to your instance

![Instance name](img/instance-name.png)

3. Then select the source of the image: ***OSGeoLive 16.0***

![Image source](img/source.png)

4. The next step is to select the flavor of the instance e.g. size of CPU, RAM and disk.

:::{Important}
The instance should have at least 40GB of disk!
:::

![Flavor](img/flavor.png)

5. Under the networks add both networks

![Networks](img/networks.png)

6. Under the Security Groups enable the following ports for incoming connections:

- 22 (SSH)
- 3389 (RDP)
- 3838 (Shiny)
- 8000 (JupyterHub)
- 8088 (Superset)
- 8787 (Rstudio)

:::{Note}
In the account there can be already predefined security groups contain these port. If not, you have to create it following [these instructions](https://creodias.docs.cloudferro.com/en/latest/cloud/How-to-use-Security-Groups-in-Horizon-on-Creodias.html)
:::

![Security Groups](img/security-groups.png)

7. Finally you select or create an SSH key pair that will be used to connect to the instance for the first time.

![SSH key](img/key-pair.png)

8. Launch the instance and wait until it will be running

9. Create and associate an external IP following [these steps](https://creodias.docs.cloudferro.com/en/latest/networking/How-to-Add-or-Remove-Floating-IPs-to-your-VM-on-Creodias.html)

![Add IP](img/add-ip.png)

10. Then you can access your instance with an SSH client, and the first step will be to create a password for the default `eouser`. After creating the password the connection will be automatically closed.

![Initialize password](img/pwd-change.png)

11. To enable RDP and add the additional services you have to run the following shell script: [add_services_nogpu.sh](scripts/add_services_nogpu.sh)
For that, you have to upload/copy (e.g. WinSCP) to the server. Make it executable and run in a terminal.
```{bash}
chmod +x add_services_nogpu.sh
./add_services_nogpu.sh
```
:::{Important}
Before you run the script replace in the script the Superset admin password `<your_superset_password>` on line 136 with your own password.
:::
:::{Note}
The script runs and install all the services and libraries. It restarts several times the servers during this process and can last up to 1 hour.
You can check the installation status with 'more ~/.service_install_status' in the terminal. When it finishes the status will be `everything went well`.
![Instal finished](img/install-finished.png)
:::
## Accessing the services
**RDP**
The service is available under the external IP:3389 created under point 9 above in standard RDP clients. In case you are behind proxy you can tunnel the internal IP:3389 through SSH to your localhost and connect to your localhost port with the RDP client.
**Shiny**
The service is available in a browser under the external IP:3838 created under point 9 above.
**JupyterHub**
The service is available in a browser under the external IP:8000 created under point 9 above. The username is `eouser` and password what is defined under point 10.
**Apache Superset**
The service is available in a browser under the external IP:8088 created under point 9 above. The username is `eouser` and password what is defined under point 11 defined in the `add_services_nogpu.sh`.
**Rstudio**
The service is available in a browser under the external IP:8787 created under point 9 above. The username is `eouser` and password what is defined under point 10.
10 changes: 8 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,12 @@
# 2025 EU Big Data hackathon infrastructure
# 2025 EU Big Data hackathon documentation

The 2025 EU Big Data hackathon IT infrastructure has 2 main parts:
![hackathon logo](img/EU-Big-Data-Hackathon-2025_horiz_0.jpg)

The theme of the 2025 EU Big Data hackathon is the use of Earth Observation (EO) data for statistics.

**[The data catalogue](data-catalogue.md)** provides an overview of datasets that can be used during the hackathon.

The participants will be provided with a cloud environment to use the datasets mentioned above. The cloud platform has 2 main components:

- [Copernicus Data Space Ecosystem (CDSE)](CDSE.md)
- [Cloud servers](CF.md) in the [CREODIAS platform](https://creodias.eu/) provided by [CloudFerro](https://cloudferro.com/)
1 change: 1 addition & 0 deletions _toc.yml
Original file line number Diff line number Diff line change
@@ -1,5 +1,6 @@
format: jb-book
root: README
chapters:
- file: data-catalogue
- file: CDSE
- file: CF
Loading

0 comments on commit 5765c0a

Please sign in to comment.