Breaking a stalled simulation #4074

lefile · 2024-05-08T06:57:24Z

lefile
May 8, 2024

Hi, I am running a few hundred simulations in a loop with a parameter sweep. As some of the parameter combinations are not feasible, the solver gets very slow at a time. Sometimes it even gets stucked, and doesn't solve or crash the simulation for a few hours, e.g.
`
At t = 0.460958 and h = 1.50237e-20, the corrector convergence failed repeatedly or with |h| = hmin.
At t = 0.460958 and h = 1.01936e-14, the corrector convergence failed repeatedly or with |h| = hmin.
At t = 0.460958 and h = 2.74873e-18, the corrector convergence failed repeatedly or with |h| = hmin.
At t = 0.460958 and h = 4.07547e-16, the corrector convergence failed repeatedly or with |h| = hmin.
At t = 0.0822078 and h = 1.30316e-14, the corrector convergence failed repeatedly or with |h| = hmin.
At t = 0.0822074 and h = 2.22506e-22, the corrector convergence failed repeatedly or with |h| = hmin.
At t = 0.0822079 and h = 9.38556e-15, the corrector convergence failed repeatedly or with |h| = hmin.
At t = 0.0348644 and h = 5.10754e-28, the corrector convergence failed repeatedly or with |h| = hmin.
At t = 0.0111925 and h = 2.81322e-15, the corrector convergence failed repeatedly or with |h| = hmin.
At t = 0.0111924, repeated recoverable residual errors.
At t = 0.00527447, repeated recoverable residual errors.
'
Is there a way to for example monitor the execution time of a simulation.solve() function and then break it if it overruns some threshold? Or some other way to deal with such cases?

I'm using "safe" mode btw, and have already tried to play with dt_min and tolerances.

Saransh-cpp · 2024-05-08T08:20:01Z

Saransh-cpp
May 8, 2024
Collaborator

Hi, if you want to break out of a simulation after a specific time period, BattBot uses the following snippet to break out of stuck simulations -

while True:
  manager = multiprocessing.Manager()
  return_dict = manager.dict()
  
  choice_list = [
      "degradation comparison",
      "model comparison",
      "parameter comparison",
  ]
  if choice is None:
      choice = random.choice(choice_list)
  
  p = Process(
      target=random_plot_generator, args=(return_dict, choice, None, testing)
  )
  
  p.start()
  # time-out
  p.join(1200)
  
  if p.is_alive():  # pragma: no cover
      print(
          "Simulation is taking too long, "
          + "KILLING IT and starting a NEW ONE."
      )
      p.kill()
      p.join()
  else:  # pragma: no cover
      break

1 reply

lefile May 8, 2024
Author

Thank you for the answer, Saransh. I'm afraid I don't understand much of the code, though. How can I integrate it in a code where I am solving a model with simulation.solve()?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Breaking a stalled simulation #4074

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment 1 reply

{{title}}

{{title}}

Select a reply

Breaking a stalled simulation #4074

lefile May 8, 2024

Replies: 1 comment · 1 reply

Saransh-cpp May 8, 2024 Collaborator

lefile May 8, 2024 Author

lefile
May 8, 2024

Replies: 1 comment 1 reply

Saransh-cpp
May 8, 2024
Collaborator

lefile May 8, 2024
Author