-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ingestion of common/shared data #61
Comments
all UCs have access to a folder located at /shared/fairicube or /home//.shared/fairicube
every UC has its own s3 bucket associated. This is only accessible by the UC
all UCs have access to a folder located at /shared/fairicube or /home//.shared/fairicube The respective kernels (to be selected on the beginning of a session) influences the software tools available (as requested by each UC), but has no influence on the availability of the shared/private foldera (see above) |
@eox-cs1 where is this documented? |
Hi @eox-cs1, Thank you for your responses! Thanks in advance, Best regards, -Bachir. |
A direct cross-UC access is not foreseen - would somehow contradict the separation. However, you can either use the common /shared/fairicube folder for data exchange/access |
@mari-s4e Because data there is not located under shared... |
Thanks @eox-cs1! Thanks in advance. |
@BachirNILU I believe in your case the simplest would be to add another profile option to UC4. Note that this per se does not incur costs, only when you run a session there are costs. If you want to use data from a use case specific bucket at any other place you can retrieve the required details like access keys, etc. from the env variable for example with a command like |
Thank you @Schpidi for your response! |
@sonjastndl sorry, there is a little misunderstanding that I might have caused in the last call. We offer two types of storage which "File Storage" and "Object Storage" which have slightly different capabilities. Per default in a JupyterLab session you get your personal workspace as well as a shared folder ( In addition we provide Object Storage to each Use Case separately for example to use with Sentinel Hub. This Object Storage is for convenience mounted to On top of this we were asked to provide shared Object Storage accessible to all Use Cases. This is the I hope this clarifies your questions. |
@BachirNILU we'll roll out the required configuration tomorrow and enable a large profile for UC4. |
@BachirNILU the big UC4 profile (Server Option) is now available |
With this I believe we can come back to the original question 😉 From a technical point of view the questions to answer are:
How to organize data on the bucket doesn't matter from a technical point of view but ,I agree, should be agreed on within a UC team or all FAIRiCUBE users and documented. |
@Schpidi many thanks for the clarification, but I fear now I understand exactly nothing :( I read about I then find either 3 or 4 options of where to put data:
vs.
On common bucket I've found some documentation in the FAIRiBOOK, requirement to install the s3browser TL;DR; the more I read, the less I see. When can we expect clear documentation on this? |
@KathiSchleidt it is always 4 options:
The different capabilities specific to FiC are also mentioned: "... to use with Sentinel Hub." In general the difference is: "... used via the s3 protocol." from anywhere vs. normal file system only available in JupyterLab. What are you missing? |
@Schpidi what I'm missing is a clean description of these various dimensions (objects vs. files, buckets vs filesystem) and options for providing and using this data. I admit I'm exceptionally confused due to my being less active in FAIRiCUBE the last months, but based on discussions with UC partners, seems I'm not the only one. I have the impression that the applicability of APIs is also somehow related (still waiting on that answer, now close to 5 months waiting :( ), please clarify where we can apply APIs When can we expect this to be clearly explained in RTD? |
Thanks! It works. |
@Schpidi am I correct that there is no ambition to document this? I just checked the adding datasets section on RTD, nothing there. |
Added a first guide on storage to RTD for review either at https://fairicube--8.org.readthedocs.build/en/8/guide/storage/ or FAIRiCUBE/collaboration-platform#8 Happy to read your feedback. |
Hi everyone,
I am curious about the modalities of the shared s3 bucket on EOX Hub.
If I access data (Example: Requesting a .nc on Surface Map for the year 2019 via the CDS API) at the EOX Hub, I am currently storing that data in our UC specific bucket. I can imagine that happening for other Use Cases and data as well and I am wondering, if we are having a lot of redundant data in the UC specific buckets.
Are there any recommendations on how to handle this from EOX side @Schpidi @eox-cs1? Just copying files to the common s3 might cause a bit of a chaos due to naming and other reasons, that does not avoid redundancy.
I assume we will talk about this in our next weeks meeting, still it would be good to know if there are already some foreseen "best practices" and/or restrictions that need to be considered?
@KathiSchleidt @Susannaioni
The text was updated successfully, but these errors were encountered: