Open
Description
Minimal code to reproduce the error on local Jupyter notebook:
import distributed
import adaptive
adaptive.notebook_extension()
cluster = distributed.LocalCluster()
cluster.adapt(minimum=0, maximum=5) # works with manual scaling cluster.scale(5)
client = distributed.Client(cluster)
learner = adaptive.Learner1D(lambda x: x, bounds=(-1, 1))
runner = adaptive.Runner(learner, executor=client, goal=lambda l: l.loss() < 0.01)
runner.live_info()
cluster.close()
returns error:
Task exception was never retrieved
future: <Task finished name='Task-327' coro=<live_info.<locals>.update() done, defined at /opt/conda/lib/python3.9/site-packages/adaptive/notebook_integration.py:217> exception=AssertionError()>
Traceback (most recent call last):
File "/opt/conda/lib/python3.9/site-packages/adaptive/notebook_integration.py", line 226, in update
status.value = _info_html(runner)
File "/opt/conda/lib/python3.9/site-packages/adaptive/notebook_integration.py", line 258, in _info_html
("elapsed time", datetime.timedelta(seconds=runner.elapsed_time())),
File "/opt/conda/lib/python3.9/site-packages/adaptive/runner.py", line 658, in elapsed_time
assert self.task.cancelled()
AssertionError
The same thing happens when running on a cluster with manual scaling without giving enough time to connect to the workers. It seems adaptive does not see any workers and terminates the process.
Metadata
Metadata
Assignees
Labels
No labels