-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Flaky] Exception in GraphListener thread: rcl_wait unexpectedly timed out #527
Comments
This can also be reproduced while running navigation2 test: |
Looking at the code:
So the issue seems to be some race condition between the insertion of an event, the setting of |
I did some testing of this using the jazzy backport and the foxglove bridge. Was getting the same issue. I'll do some more testing tomorrow to confirm in a more isolated environment. |
I am also experiencing the same issue even with #528. |
@mgraoikerlan we are still investigating the issue but we weren't able to reproduce it from our side. Do you have a minimal example that would trigger it on your side? |
You can use the example @evshary provided:
I'm getting the error with that too. |
If I run
on my side I get the same error as @evshary (see #527 (comment)). I.e. I do NOT see any timed out error
That error was solved by #528 on my side and on the CI. Did you manage to trigger the rcl_wait unexpectedly timed out error in any other way? |
I confirm that this test does not give this error after #528. In the next few days I will do more tests with the code of my project. |
Closing as the original rcl_wait error has been resolved. |
Despite #528 reduces the frequency of this error, it's still happening.
And another user reported it in the launch phase of its robot:
|
I did some "printf debugging" and here is what I found: When the bug occurs (i.e. I think the issue is that during the previous loop iteration:
@Yadunund , @clalancette , what do you think ? |
In #575 we make sure that when entering in Still, I'm not clear on 2 points in
|
Is
I don't think there is anything that specifies that the user of the waitset handles every ready entity in the waitset. You could (in theory) wait for a ready condition for a specific entity and ignore the rest. This would cause a lot more churn/spin, but is a "legal" use of the pattern. Specifically though in the case of the GraphListener, this may indicate a bug if the notifications aren't propagating. |
The same error log has been reported by several users in different contexts, using latest release, or commit
The text was updated successfully, but these errors were encountered: