Adding cache test WDL and GitHub Action #66

tefirman · 2025-01-16T05:03:55Z

Description

Call-caching is only triggered during multiple calls of the same workflow
Adding a new GitHub action to call a very simple workflow twice
Custom Cromwell configuration is necessary to enable call-caching
Action checks the Cromwell logs of the second run to ensure call-caching was performed
Third run uses slightly different inputs to ensure that call-caching is not used in that case
Action again checks Cromwell logs to confirm this for the third run.

Related Issues

Testing

Executed GitHub Action on this branch multiple times, works as expected.

tefirman · 2025-01-16T22:51:50Z

Added @sitapriyamoorthi for review of the Cromwell/WDL aspects.
Added @sckott for review of the GitHub Action yml.
Tagging @seankross for visibility.

If we feel like this shouldn't be its own GitHub Action, we can definitely explore other options, just let me know.

sckott

looking great

.github/workflows/test-cromwell-cache.yml

cacheTest/README

.github/workflows/test-cromwell-cache.yml

sitapriyamoorthi

@tefirman sorry looking for some clarification here:

Why do we need to modify the cromwell.conf? Can we not articulate this simple by having three WDLs with different options.json run in sequence with different caching configurations?
Wouldnt modifying the cromwell.conf affect the other unit tests?

Sorry just trying to gain some clarity as to why we need to reconfigure cromwell.conf file?

tefirman · 2025-01-27T23:58:39Z

@sitapriyamoorthi -- No worries, the Cromwell setup is super confusing honestly.

The call-caching capability first needs to be enabled in your underlying config, and only then can it actually be triggered via the options json. Call-caching is disabled by default, so without providing that cromwell.conf file, the caching arguments in the options json get ignored and caching is never invoked. See Cromwell docs here.
The way the GitHub Actions are set up, cromwell.conf is only utilized for this caching unit test, so all the other unit tests will remain unchanged.

sitapriyamoorthi

My suggestions

Write two separate unit tests:
- One for validating that outputs are properly written to the cache.
- Another for ensuring the workflow reads outputs from the cache when appropriate.
Optionally, include a third combined test for end-to-end behavior to validate the full caching cycle.

Why Separate Tests?

Clarity: By isolating the read and write behaviors, you can clearly identify which part of the caching functionality is working or failing.
- A failure in a combined test could make it harder to pinpoint whether the issue lies in writing to the cache, reading from it, or both. Especially knowing that the cluster permission issues can also affect this.
Modular Testing: Unit tests should ideally focus on a single functionality or behavior.
- A separate test for writing ensures that the cache is being populated correctly.
- A separate test for reading ensures that previously cached results are being accessed as expected.
Debugging: If something goes wrong, separate tests make it easier to debug. For example:
- If the write-to-cache test fails, you know there's an issue with how the cache is being created or populated.
- If the read-from-cache test fails, it’s likely an issue with cache retrieval or matching logic.
Future Scalability: Separate tests make it easier to expand coverage for more complex caching scenarios in the future (e.g., testing caching with different runtime conditions or inputs).

tefirman · 2025-01-28T20:25:18Z

I definitely agree with splitting these unit tests in concept. I will say that in practice, that becomes kinda difficult due to fresh environments for each GitHub Action, i.e. I can't read from cache without writing from the cache first. It's possible in theory though, let me give it a shot. I'm thinking the general approach will be:

Write both jobs in the same yml
Have the read-from-cache check rely on the write-to-cache check
Pass the cache metadata from write-to-cache to read-to-cache

Although, now I'm wondering if this would just be easier as one of the python-based "api-tests"... @sckott, penny-for-your-thoughts here?

sckott · 2025-01-28T21:07:32Z

I'm wondering if this would just be easier as one of the python-based "api-tests"

happy to think about it with you but don't totally understand - maybe we could chat about it on a call

Adding cache test WDL and GitHub Action

8a9c7ae

tefirman linked an issue Jan 16, 2025 that may be closed by this pull request

Cache invalidation on environmental changes #19

Open

tefirman added 20 commits January 15, 2025 21:13

Adding options.json to cacheTest

89b5331

Identifying output.txt file after each task

459177f

Identifying output.txt file after each task

4666037

Identifying output.txt file after each task

1914d57

Adding initial version of config file for cache testing

cae2d06

Updating cache test cromwell config

41f824c

Updating cache test cromwell config

8de27fa

Updating cache test cromwell config

9781f4b

Updating cache test cromwell config

30aaad2

Finally have a successfully caching WDL

a2de077

Updating Cromwell cache GitHub Action

9514874

Updating Cromwell cache GitHub Action

9a4e8e9

Updating cache test Cromwell configuration

ebdfbc1

Updating Cromwell cache GitHub Action

54040b2

Fixing cache verification step in GitHub Action

ab11b82

Fixing cache verification step in GitHub Action

66011cc

Fixing cache verification step in GitHub Action

1154365

Fixing cache verification step in GitHub Action

5eb0f42

Adding third run to ensure no cache usage during cacheTest GitHub Action

9824149

Adding cacheTest README and fixing typo in cache test yml

6642f25

tefirman requested review from sckott and sitapriyamoorthi January 16, 2025 22:49

tefirman added unit test Adding or modifying a unit test infrastructure Infrastructure fix to execute WDL GitHub Actions labels Jan 16, 2025

tefirman mentioned this pull request Jan 16, 2025

Caching issue when deploying unit tests #26

Open

sckott requested changes Jan 17, 2025

View reviewed changes

.github/workflows/test-cromwell-cache.yml Outdated Show resolved Hide resolved

.github/workflows/test-cromwell-cache.yml Outdated Show resolved Hide resolved

cacheTest/README Outdated Show resolved Hide resolved

.github/workflows/test-cromwell-cache.yml Show resolved Hide resolved

sckott mentioned this pull request Jan 17, 2025

use cromwell and womtool 87 instead of 86 #72

Merged

tefirman added 2 commits January 24, 2025 15:44

Switching from Cromwell 86 to 87 and commenting out time check

9254c5b

Deleting time checks entirely

ff15737

sitapriyamoorthi reviewed Jan 27, 2025

View reviewed changes

sitapriyamoorthi reviewed Jan 28, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding cache test WDL and GitHub Action #66

Adding cache test WDL and GitHub Action #66

tefirman commented Jan 16, 2025 •

edited

Loading

tefirman commented Jan 16, 2025

sckott left a comment

sitapriyamoorthi left a comment

tefirman commented Jan 27, 2025 •

edited

Loading

sitapriyamoorthi left a comment

tefirman commented Jan 28, 2025

sckott commented Jan 28, 2025

Adding cache test WDL and GitHub Action #66

Are you sure you want to change the base?

Adding cache test WDL and GitHub Action #66

Conversation

tefirman commented Jan 16, 2025 • edited Loading

Description

Related Issues

Testing

tefirman commented Jan 16, 2025

sckott left a comment

Choose a reason for hiding this comment

sitapriyamoorthi left a comment

Choose a reason for hiding this comment

tefirman commented Jan 27, 2025 • edited Loading

sitapriyamoorthi left a comment

Choose a reason for hiding this comment

My suggestions

Why Separate Tests?

tefirman commented Jan 28, 2025

sckott commented Jan 28, 2025

tefirman commented Jan 16, 2025 •

edited

Loading

tefirman commented Jan 27, 2025 •

edited

Loading