Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

MultiprocessVectorEnv blocked forever #389

Open
oribarel opened this issue Jan 27, 2019 · 5 comments
Open

MultiprocessVectorEnv blocked forever #389

oribarel opened this issue Jan 27, 2019 · 5 comments

Comments

@oribarel
Copy link

Hi, I've noticed that sometimes, when running more than one multiprocessed agent the program is blocked forever because of the line results = [remote.recv() for remote in self.remotes] under MultiprocessVectorEnv . Do you have any workaround for that?

@muupan
Copy link
Member

muupan commented Jan 28, 2019

I think I've not experienced it. Can you provide more details?

@oribarel
Copy link
Author

oribarel commented Jan 28, 2019

It happened three times so far. In all cases the system was under a heavy load and I was running more than one agent concurrently. The blocking agent was using at least 8 environments and all training was using CPU only. If you need more specific details don't hesitate.

@muupan
Copy link
Member

muupan commented Jan 31, 2019

I have no idea how it can happen. Can you provide code that can reproduce this issue?

@oribarel
Copy link
Author

Unfortunately, I can't. In addition it is not reproducible in a deterministic fashion.

@oribarel
Copy link
Author

oribarel commented Feb 4, 2019

Update: it also happened when training a single agent with 16 envs on a 24-core CPU.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants