Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compare streaming vs file upload #41

Open
katilp opened this issue Aug 8, 2024 · 4 comments
Open

Compare streaming vs file upload #41

katilp opened this issue Aug 8, 2024 · 4 comments

Comments

@katilp
Copy link
Contributor

katilp commented Aug 8, 2024

Compare runtime and total time between

This does not depend on whether gcs or nfs as the input files are uploaded to the container's local disk.

@katilp
Copy link
Contributor Author

katilp commented Aug 30, 2024

To read with the root: protocol locally, in metadata-template, change file:/... to `root://localhost//

        awk -F/ '{ printf "root://localhost//code/files/"; print $NF }' bucket/files_{{inputs.parameters.recid}}.txt > bucket/files_local_{{inputs.parameters.recid}}.txt;

To set up a local xrootd server:

xrootd -l - -b /code 

before sed statements.

@tomcordruw
Copy link
Collaborator

tomcordruw commented Sep 20, 2024

I did a test using the root: protocol as instructed and the workflow completed successfully, but it didn't seem to have a big impact.

For reference:
argo_bucket_upload.yaml (standard way):

  • e2-standard-4: 4 hours 55 minutes (5 hours 17 minutes)

argo_bucket_upload.yaml (using root:):

  • e2-standard-4: 5 hours 5 minutes (5 hours 28 minutes)

Edit:
argo_bucket_run.yaml (streaming):

  • e2-standard-4: 4 hours 33 minutes (4 hours 57 minutes)

@katilp
Copy link
Contributor Author

katilp commented Sep 20, 2024

Thanks! I saw the same in a shorter test. Somewhat unexpected but now confirmed.
Do you also have the time for the same run using streaming (i.e. without the file upload)?

@tomcordruw
Copy link
Collaborator

Ah yes, it was otherwise identical as in the cost comparison in Issue #46 , so I now added the time for streaming to my comment above.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants