[QUESTION] Memory management suggestion #393

yswhynot · 2024-12-12T21:38:44Z

Dear amazing warp team,

I'm working on a design optimization project for tendon-driven soft fingers, adapted from the example_walker.py code. Although everything works fine for the forward and backward gradients, I noticed very heavy memory usage of the cuda graph. To get my soft structures to work reliably with the euler integrator, I had to set super large frame rate (1000) and substeps (300). Otherwise, the contacts between soft and rigid bodies get unreasonable. This makes it only feasible to run a short period of time without running into memory issue on RTX 4090.

I noticed the memory usage happens mostly on creating the cuda graph. May I know is there any suggestions or pointers over what's a better option?

My soft mesh is already the simplest possible, ~4k tetrahedron.

Thank you so much for your help!

The text was updated successfully, but these errors were encountered:

shi-eric · 2024-12-12T23:22:36Z

Hi @yswhynot, are you sure that it's the use of CUDA graphs specifically that's causing the memory issue? It seems to me that it's more likely that the number of simulation states you need to keep in memory in order to run the backward pass is probably the real culprit. Have you tried running the problem with CUDA graph capture disabled?

If it's simply that you can't fit all the simulation states in memory, you would have to implement something like a gradient checkpointing strategy.

yswhynot · 2024-12-13T00:48:48Z

Hi Eric,

Oh yes I'm sorry, you are absolutely right. It is the list of states causing the memory issue.

If this is the case, do you have any recommendations of gradient checkpointing with warp? Or are there better ways to either turn something off to save the memory each states need, or better tuning to simulate soft-rigid contacts more stably with larger steps?

Thank you so much for your help!

eric-heiden · 2024-12-13T17:09:15Z

Hi @yswhynot,

We currently don't have an example for gradient checkpointing, we will consider adding this in the future.

Another thing worth trying is to improve the discretization of the mesh to make sure the tetrahedra are not degenerate (too small, almost planar, etc.). I noticed the voxelized discretization that we have in the bear model from example_walker.py seems to be particularly efficient to simulate for this reason.

yswhynot · 2024-12-13T20:44:50Z

Thank you so much for your suggestion!

My tetrahedra are indeed quite small because I am simulating the tendon's tension forces along the soft finger, so I need to put vertices all along the way and add forces to those vertices. Or if there is better ways of simulating tendons, I'd be happy to know!

yswhynot added the question The issue author requires information label Dec 12, 2024

shi-eric assigned eric-heiden Dec 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[QUESTION] Memory management suggestion #393

[QUESTION] Memory management suggestion #393

yswhynot commented Dec 12, 2024

shi-eric commented Dec 12, 2024

yswhynot commented Dec 13, 2024 •

edited

Loading

eric-heiden commented Dec 13, 2024

yswhynot commented Dec 13, 2024

[QUESTION] Memory management suggestion #393

[QUESTION] Memory management suggestion #393

Comments

yswhynot commented Dec 12, 2024

shi-eric commented Dec 12, 2024

yswhynot commented Dec 13, 2024 • edited Loading

eric-heiden commented Dec 13, 2024

yswhynot commented Dec 13, 2024

yswhynot commented Dec 13, 2024 •

edited

Loading