Update retry and timeout logic around Publish / Unpublish requests #23

DMajrekar · 2024-06-05T06:44:43Z

When working with 150+ volumes publishing / unpublishing at the same time, the current timeout logic causes grpc timeout errors to be seen by the csi-attacher process

This PR updates the timeout down to 5 seconds (from effectively 100s in waitForVolumeStatus) and hands off retry logic to the csi-attacher process.

In addition, the fixes a few logic bugs around unpublish when NodeID is passed as an arg in NodeUnpublishVolumeRequest and set to a different node to the currently attached node. This should safely clear out dangling VolumeAttachments.

DMajrekar added 8 commits May 30, 2024 21:03

updates

739bd82

fix

1cc0694

updates

584147a

Fixes

9e7cc9e

Fixes

a5309e9

fix

2a309e5

tidy changes

8b7b06b

Fix

5b74ed2

RealHarshThakur approved these changes Jun 5, 2024

View reviewed changes

DMajrekar merged commit 3d6898a into master Jun 5, 2024
4 checks passed

DMajrekar deleted the dm-fix branch June 5, 2024 07:01

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update retry and timeout logic around Publish / Unpublish requests #23

Update retry and timeout logic around Publish / Unpublish requests #23

DMajrekar commented Jun 5, 2024

Update retry and timeout logic around Publish / Unpublish requests #23

Update retry and timeout logic around Publish / Unpublish requests #23

Conversation

DMajrekar commented Jun 5, 2024