Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

enhance: Log error instead of panicking if load lock wait timeout #39308

Conversation

congqixia
Copy link
Contributor

Related to #39205
Previous PR #39206

This PR change wait timeout behavior to log error and return to avoid making other collection read failure in only some collections have deadlock

Related to milvus-io#39205
Previous PR milvus-io#39206

This PR change wait timeout behavior to log error and return to avoid
making other collection read failure in only some collections have
deadlock

Signed-off-by: Congqi Xia <[email protected]>
@sre-ci-robot sre-ci-robot added approved size/M Denotes a PR that changes 30-99 lines. labels Jan 15, 2025
@mergify mergify bot added dco-passed DCO check passed. kind/enhancement Issues or changes related to enhancement labels Jan 15, 2025
Copy link
Contributor

mergify bot commented Jan 15, 2025

@congqixia E2e jenkins job failed, comment /run-cpu-e2e can trigger the job again.

@xiaofan-luan
Copy link
Collaborator

/lgtm
/approve

@sre-ci-robot
Copy link
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: congqixia, xiaofan-luan

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@congqixia
Copy link
Contributor Author

/run-cpu-e2e

Copy link

codecov bot commented Jan 15, 2025

Codecov Report

Attention: Patch coverage is 55.55556% with 4 lines in your changes missing coverage. Please review.

Project coverage is 81.04%. Comparing base (657550c) to head (3408d2f).
Report is 2 commits behind head on master.

Files with missing lines Patch % Lines
internal/querynodev2/segments/segment.go 0.00% 4 Missing ⚠️
Additional details and impacted files

Impacted file tree graph

@@             Coverage Diff             @@
##           master   #39308       +/-   ##
===========================================
+ Coverage   69.63%   81.04%   +11.40%     
===========================================
  Files         299     1406     +1107     
  Lines       26807   198421   +171614     
===========================================
+ Hits        18668   160806   +142138     
- Misses       8139    31945    +23806     
- Partials        0     5670     +5670     
Components Coverage Δ
Client 79.50% <ø> (∅)
Core 69.63% <ø> (ø)
Go 82.95% <55.55%> (∅)
Files with missing lines Coverage Δ
...rnal/querynodev2/segments/state/load_state_lock.go 82.78% <100.00%> (ø)
internal/querynodev2/segments/segment.go 67.67% <0.00%> (ø)

... and 1105 files with indirect coverage changes

sre-ci-robot pushed a commit that referenced this pull request Jan 15, 2025
Cherry-pick from master
pr: #39206 #39308
Related to #39205

This PR merge `RLock` & `PinIfNotReleased` into `PinIf` function
preventing segment being released before any Read operation finished.

---------

---------

Signed-off-by: Congqi Xia <[email protected]>
Copy link
Contributor

mergify bot commented Jan 15, 2025

@congqixia E2e jenkins job failed, comment /run-cpu-e2e can trigger the job again.

@congqixia
Copy link
Contributor Author

/run-cpu-e2e

sre-ci-robot pushed a commit that referenced this pull request Jan 15, 2025
Cherry-pick from master
pr: #39206 #39308
Related to #39205

This PR merge `RLock` & `PinIfNotReleased` into `PinIf` function
preventing segment being released before any Read operation finished.

---------

---------

Signed-off-by: Congqi Xia <[email protected]>
@mergify mergify bot added the ci-passed label Jan 15, 2025
@sre-ci-robot sre-ci-robot merged commit 57e5652 into milvus-io:master Jan 15, 2025
19 of 20 checks passed
gifi-siby pushed a commit to gifi-siby/milvus that referenced this pull request Jan 16, 2025
…lvus-io#39308)

Related to milvus-io#39205
Previous PR milvus-io#39206

This PR change wait timeout behavior to log error and return to avoid
making other collection read failure in only some collections have
deadlock

Signed-off-by: Congqi Xia <[email protected]>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
approved ci-passed dco-passed DCO check passed. kind/enhancement Issues or changes related to enhancement lgtm size/M Denotes a PR that changes 30-99 lines.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants