[db] Request database optimizations #7602

SeungjinYang · 2025-10-13T19:05:51Z

The PR introduces a series of changes to make accesses to request DB more efficient.

General changes

~~The database is given an index on created_at. This is because several queries in requests.py have ORDER BY created_at statements, which can be accelerated by an index.~~
RequestTaskFilter now has a sort parameter that can toggle the inclusion of ORDER BY created_at statement. This allows callers that do not need the results to be sorted to benefit from not sorting the result.
RequestTaskFilter had fields parameter introduced 2 days ago. This PR adds appropriate fields parameter to callers that do not need all of the parameters (especially request / response bodies) to perform their tasks.
get_request_tasks_with_fields_async is merged with get_request_tasks_async, allowing the latter function to handle an optional fields parameter if provided. Similarly, get_request_tasks is modified to handle an optional fields parameter.
exact_match fields are added to some queries that acts on a request given a request_id. Since the API server wants to handle clients submitting a request ID prefix, query functions use WHERE request_id LIKE <prefix>% statement to handle prefixes. However, in cases where we know an exact request ID is supplied, using WHERE request_id = <id> is more efficient.

Case studies of specific codepaths

sky api cancel -a:

Uses kill_requests. Since no request IDs are specified, the request IDs are retrieved from DB. This DB call now only returns request IDs (instead of whole requests) and does not sort. I expect this to be the bulk of the efficiency improvement.
Since the request IDs are retrieved from DB, we can set exact_match to True on update_request. This uses an exact match query (WHERE request_id = <id> instead of WHERE request_id LIKE <prefix>%) making the operation more efficient.

sky logs

uses _tail_log_file. We now establish an exact_request_id at the start of _tail_log_file, and use exact match query making the operation more efficient.

Tested (run the relevant ones):

Code formatting: install pre-commit (auto-check on commit) or bash format.sh
Any manual or new tests for this PR (please specify below)
All smoke tests: /smoke-test (CI) or pytest tests/test_smoke.py (local)
Relevant individual tests: /smoke-test -k test_name (CI) or pytest tests/test_smoke.py::test_name (local)
Backward compatibility: /quicktest-core (CI) or pytest tests/smoke_tests/test_backward_compat.py (local)

SeungjinYang · 2025-10-13T23:06:17Z

/quicktest-core --base-branch v0.10.1
/smoke-test --kubernetes --no-resource-heavy

SeungjinYang · 2025-10-14T00:26:17Z

/smoke-test --aws --no-resource-heavy

SeungjinYang · 2025-10-14T15:25:01Z

/quicktest-core --base-branch v0.10.1
/smoke-test --kubernetes --no-resource-heavy
/smoke-test --aws --no-resource-heavy

SeungjinYang · 2025-10-14T19:10:39Z

There's a lot of stuff going on in this PR, so I'll separate the changes out and submit incremental PRs as recommended by @rohansonecha

optimization2 consolidate funcs more consolidation more optimizations add index fix unit tests address TODO cancel optimizations 1 revert count testfix

SeungjinYang force-pushed the request-db-optimizatons branch 2 times, most recently from 491bd65 to fe85ad9 Compare October 13, 2025 23:02

SeungjinYang marked this pull request as ready for review October 13, 2025 23:21

SeungjinYang requested review from cg505 and rohansonecha October 14, 2025 00:22

SeungjinYang requested a review from kyuds October 14, 2025 00:52

SeungjinYang force-pushed the request-db-optimizatons branch from fe85ad9 to 2df7191 Compare October 14, 2025 15:24

SeungjinYang marked this pull request as draft October 16, 2025 19:02

SeungjinYang mentioned this pull request Oct 16, 2025

[db] Add indices on requests db columns #7642

Merged

5 tasks

SeungjinYang force-pushed the request-db-optimizatons branch 5 times, most recently from bc520c8 to baf7335 Compare October 19, 2025 02:10

optimize delete requests

e26f57e

optimization2 consolidate funcs more consolidation more optimizations add index fix unit tests address TODO cancel optimizations 1 revert count testfix

SeungjinYang force-pushed the request-db-optimizatons branch from baf7335 to e26f57e Compare October 19, 2025 19:26

SeungjinYang closed this Oct 20, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[db] Request database optimizations #7602

[db] Request database optimizations #7602

Uh oh!

SeungjinYang commented Oct 13, 2025 •

edited

Loading

Uh oh!

SeungjinYang commented Oct 13, 2025

Uh oh!

SeungjinYang commented Oct 14, 2025

Uh oh!

SeungjinYang commented Oct 14, 2025

Uh oh!

SeungjinYang commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[db] Request database optimizations #7602

[db] Request database optimizations #7602

Uh oh!

Conversation

SeungjinYang commented Oct 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

SeungjinYang commented Oct 13, 2025

Uh oh!

SeungjinYang commented Oct 14, 2025

Uh oh!

SeungjinYang commented Oct 14, 2025

Uh oh!

SeungjinYang commented Oct 14, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

SeungjinYang commented Oct 13, 2025 •

edited

Loading