MultiStateReporter variable pos/vel save frequency #712

richardjgowers · 2023-07-13T12:50:35Z

Description

allows MultiStateReporter to write positions and velocities and different frequency to energies data.

Todos

Implement feature / fix bug
Add tests
Update documentation as needed
Update changelog to summarize changes in behavior, enhancements, and bugfixes implemented in this PR

Status

Ready to go

Changelog message

MultiStateReporter now takes optional `position_interval` and `velocity_interval` keyword args to control the frequency these are saved at

…rent frequency to energies data.

mikemhenry · 2023-07-13T18:39:41Z

Will we run into key errors if you try and use this with an .nc file from an older version?

codecov · 2023-07-13T19:01:35Z

Codecov Report

Merging #712 (d8f80b9) into main (cbff4c8) will increase coverage by 0.00%.
The diff coverage is 92.85%.

❗ Current head d8f80b9 differs from pull request most recent head 981fe1b. Consider uploading reports for the commit 981fe1b to get more accurate results

Additional details and impacted files

richardjgowers · 2023-07-17T15:46:27Z

@mikemhenry it shouldn't affect reading old files, I've only played with the writing code.

In terms of old code reading newer nc files, unless you've changed the defaults from writing positions/velocities on every energy write you should get the same. If you read files with missing data you now get:

>>> ds.variables['positions'][1, :, :]

masked_array(
  data=[[--, --, --],
        [--, --, --],
        [--, --, --],
        [--, --, --],
        [--, --, --],
        [--, --, --],
        [--, --, --],

so it looks like netcdf isn't erroring but instead giving you the lack of data.

I have added two attributes to the DataSet, so if you write code that expects these you're going to have a bad time.

Is this what ncfile.ConventionVersion = '0.2' is made to handle? Should I be bumping that?

ijpulidos

Thanks for this contribution! This looks good to me.

I do have questions about the motivation for this, since it adds a complexity that I don't seem to get why it is needed, can you expand on this?

Also, we probably want to write tests for this, since it's changing the behavior of the reporter and that can be critical for our users.

richardjgowers · 2023-08-23T19:32:37Z

@ijpulidos the filesizes for MultiStateReporter can get pretty large, since you're saving ~11 trajectories. You typically want to save the energies at a high frequency (since you're very interesting in seeing correlation times) but positions you can have these at a lower frequency, velocities you might not even care about.

Can you point me towards the existing tests so I can make sure the tests I write fit in?

ijpulidos · 2023-08-25T15:03:29Z

I see, yeah I had the feeling it was a storage issue. We probably need to clean things up in general but I think this will help. Thanks for clarifying the motivation.

As per the tests, I think having them as part of the TestReporter class should be the place, we might consider moving this class to its own test module, but we don't need that right now. Thanks!

mikemhenry · 2023-11-30T16:10:29Z

In addition to adding tests, we also want to add

energy frequency

And double check what gets saved when

mikemhenry · 2023-11-30T16:21:18Z

We want to figure out what should be saved in the checkpoint and what should be saved in simulation.nc

ijpulidos · 2023-12-01T18:23:51Z

@mikemhenry Yes, I agree we probably need to discuss first what we want to store and when. And which of these are needed when resuming simulations and similar situations.

If we want to have all of these independently we probably want to sync them with the checkpointing. Such that the required values are up to date when resuming or extending simulations.

mikemhenry · 2023-12-01T18:41:00Z

Yah I think we need to also make sure we figure out what our analysis tools expect. My hunch is that we need to save a lot of stuff to resume a simulation, but that should be a checkpoint, the time series data we save for analysis should be user configurable.

ijpulidos · 2023-12-01T19:33:19Z

the time series data we save for analysis should be user configurable.

Yes, decoupling the checkpointing with the user-configurable time series data would probably work better here. I'd vote for that route if it makes sense.

As far as I can tell these are coupled. We would need to think a little bit further about this, since I'm not sure how big of a refactor this could be. There's a lot of moving back and forth between the MultiStateReporter and the MultiStateSampler.

currently no pos test is failing...

richardjgowers · 2024-01-12T16:04:57Z

openmmtools/tests/test_sampling.py

+            for state, restored_state in zip(sampler_states, restored_sampler_states):
+                # missing values are returned as numpy masked array
+                # so we check that these arrays are all masked
+                assert restored_state.positions._value.mask.all()


this is something that I'm not 100% sure of currently. netCDF will return a numpy masked array when you access data that isn't present (here we're saving vels every other frame, so accessing velocities here has no data). I hadn't ever encountered these, so maybe it's not the best thing to return? (or maybe it is if it's the netCDF normal return).

richardjgowers · 2024-01-12T16:05:32Z

openmmtools/tests/test_sampling.py

+                # so we check that these arrays are all masked
+                assert restored_state.positions._value.mask.all()
+                assert restored_state.velocities._value.mask.all()
+                assert restored_state.box_vectors is None  # not periodic


this is something I need to double check, I'm a little confused why the checkpoint has a box and future frames don't, so this is still WIP till I figure that out

richardjgowers · 2024-01-12T16:06:30Z

@ijpulidos @mikemhenry this is much closer to ready, I've added some tests so might be a good time for you both to read through

mikemhenry · 2024-02-26T17:07:14Z

@richardjgowers getting some errors in CI

Traceback (most recent call last):
  File "/home/runner/micromamba-root/envs/openmmtools-test/lib/python3.9/site-packages/nose/case.py", line 197, in runTest
    self.test(*self.arg)
  File "/home/runner/work/openmmtools/openmmtools/openmmtools/tests/test_sampling.py", line 523, in test_write_sampler_states_no_pos
    assert restored_state.positions._value.mask.all()
AttributeError: 'numpy.ndarray' object has no attribute 'mask'

See the CI log here: https://github.com/choderalab/openmmtools/actions/runs/8052365133

Honestly if you make a commit to bump this PR, you should get the same results, I was having an issue re-starting CI

ijpulidos · 2024-03-21T16:50:00Z

I wonder if this is what's biting us here, though I don't know why are we expecting these arrays to be masked? #701

ijpulidos · 2024-03-21T16:56:09Z

Now, to be fair, I don't think the changes in that linked PR have anything to do with the serialized/deserialized positions or velocities. So I doubt it.

allow MultiStateReporter to write positions and velocities at a diffe…

4805553

…rent frequency to energies data.

MultiStateReporter use 0 for do not write

59ca4d5

mikemhenry requested a review from ijpulidos August 1, 2023 15:19

Merge branch 'main' into multistatereporter_variable_pos_frequency

7a3bd40

mikemhenry approved these changes Aug 23, 2023

View reviewed changes

ijpulidos reviewed Aug 23, 2023

View reviewed changes

Merge branch 'main' into multistatereporter_variable_pos_frequency

d8f80b9

ijpulidos assigned richardjgowers Dec 4, 2023

richardjgowers added 4 commits January 11, 2024 15:36

WIP of multistatereporter tests

8fba23f

test for variable position saving

e0555c5

more tests for smaller nc files

2d88a33

currently no pos test is failing...

catch IndexError when file had no position/velocity data ever stored

9a24479

richardjgowers commented Jan 12, 2024

View reviewed changes

richardjgowers changed the title ~~MultiStateReporter variable pos/vel save frequency~~ [wip] MultiStateReporter variable pos/vel save frequency Jan 12, 2024

Merge branch 'main' into multistatereporter_variable_pos_frequency

981fe1b

mikemhenry mentioned this pull request Feb 26, 2024

Test pr #725

Closed

mikemhenry changed the title ~~[wip] MultiStateReporter variable pos/vel save frequency~~ MultiStateReporter variable pos/vel save frequency Feb 26, 2024

Merge branch 'main' into multistatereporter_variable_pos_frequency

542f341

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MultiStateReporter variable pos/vel save frequency #712

MultiStateReporter variable pos/vel save frequency #712

richardjgowers commented Jul 13, 2023 •

edited by mikemhenry

Loading

mikemhenry commented Jul 13, 2023

codecov bot commented Jul 13, 2023 •

edited

Loading

richardjgowers commented Jul 17, 2023

ijpulidos left a comment •

edited

Loading

richardjgowers commented Aug 23, 2023

ijpulidos commented Aug 25, 2023

mikemhenry commented Nov 30, 2023

mikemhenry commented Nov 30, 2023

ijpulidos commented Dec 1, 2023

mikemhenry commented Dec 1, 2023

ijpulidos commented Dec 1, 2023

richardjgowers Jan 12, 2024

richardjgowers Jan 12, 2024

richardjgowers commented Jan 12, 2024

mikemhenry commented Feb 26, 2024

ijpulidos commented Mar 21, 2024

ijpulidos commented Mar 21, 2024

MultiStateReporter variable pos/vel save frequency #712

Are you sure you want to change the base?

MultiStateReporter variable pos/vel save frequency #712

Conversation

richardjgowers commented Jul 13, 2023 • edited by mikemhenry Loading

Description

Todos

Status

Changelog message

mikemhenry commented Jul 13, 2023

codecov bot commented Jul 13, 2023 • edited Loading

Codecov Report

richardjgowers commented Jul 17, 2023

ijpulidos left a comment • edited Loading

Choose a reason for hiding this comment

richardjgowers commented Aug 23, 2023

ijpulidos commented Aug 25, 2023

mikemhenry commented Nov 30, 2023

mikemhenry commented Nov 30, 2023

ijpulidos commented Dec 1, 2023

mikemhenry commented Dec 1, 2023

ijpulidos commented Dec 1, 2023

richardjgowers Jan 12, 2024

Choose a reason for hiding this comment

richardjgowers Jan 12, 2024

Choose a reason for hiding this comment

richardjgowers commented Jan 12, 2024

mikemhenry commented Feb 26, 2024

ijpulidos commented Mar 21, 2024

ijpulidos commented Mar 21, 2024

richardjgowers commented Jul 13, 2023 •

edited by mikemhenry

Loading

codecov bot commented Jul 13, 2023 •

edited

Loading

ijpulidos left a comment •

edited

Loading