Skip to content

Qmaster / Execd Daemon Randomly Crashes with got nullptr element for Jb_owner #64

@eddiewang927

Description

@eddiewang927

We are experiencing an intermittent issue where both Qmaster and some execd hosts randomly crash.
In the logs, we repeatedly see the following fatal message right before the daemon stops:
XXX|C|!!!!!!!!!! got nullptr element for Jb_owner !!!!!!!!!!
Observed Behavior
• The message appears randomly on different hosts (Qmaster or execd).
• Once the message is printed, the daemon immediately exits.
• Restarting the service temporarily resolves the issue, but the crash eventually reoccurs.

Metadata

Metadata

Assignees

Labels

questionFurther information is requested

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions