Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Restart model service process without reloading the container #3376

Open
kyujin-cho opened this issue Jan 5, 2025 — with Lablup-Issue-Syncer · 0 comments
Open
Assignees

Comments

@kyujin-cho
Copy link
Member

Motivation  

In early bootstrap phases, model service needs to be restarted frequently to find out the working and performant parameter. In such cases following the original model service restart - terminating and creating a fresh new session - would be a bottleneck, especially when compared to just restarting the inference framework process itself. By extending current kernel runner spec a little bit mode, we can think achieve restarting the model service process itself while not touching the container lifecycle.

Required Features

  • Create a new REST API to trigger restarting the model service process
  • Create a new RPC Handler to handle model service process restart request
  • Add a new kernel runner interface which shuts down ongoing model service process

Impact  

Testing Scenarios  

  • Trigger restart of the model service process and check if the inference worker loads like before
  • Reiterate step 1, this time modifying the model-definition.yaml file. Check if the change is reflected to the runner.
@kyujin-cho kyujin-cho self-assigned this Jan 5, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant