-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Timeout Warnings #165
Comments
The first part is very reasonable. The second part is harder since by the time we recognize that we are approaching the timeout, it's too late to turn on the call stack recording to find out which source/factory/processor is taking too long. I'm still thinking through the design implications, but here goes, ordered from "Nathan is optimistic" to "Nathan is pessimistic":
I'm going to start by implementing (1) and see what happens when I try for (2). |
As of #385, the supervisor thread uses the USR2 signal to obtain a full backtrace from any timed-out workers. It now also logs the arrow name and the event number. |
Question from Wouter in ePIC compSW MatterMost channel:
Wouter Deconinck (he/him)
8:49 PM
@david Lawrence On the EICrecon timeout and warmup timeout, is it technically possible to print a warning for instances when we hit, say, over 80% of the time out but still succeed, including some info from the 'recovered' thread about what it was doing that took so long? That would be very helpful in debugging timeouts.
The text was updated successfully, but these errors were encountered: