Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Preheating a 20G image is prone to failure,"error decoding response body" #3675

Open
WaterAndBread opened this issue Nov 28, 2024 · 4 comments
Assignees
Labels
enhancement New feature or request

Comments

@WaterAndBread
Copy link

config:
seedClient resources: requests: 2c4G ,limits 8c16G version v0.1.118
client resources: requests: 1c2G ,limits 4c8G version v0.1.118
client dowload rateLimit 20GiB upload rateLimit 20GiB

problem 1
when preheating about 20G image ,it is easy to appear "rpc error: code = Internal desc = error decoding response body" in manager console web.

preheat type is single_preheat
dragonfly installed by helm version 1.2.23 , dragonfly version v2.1.63

seedClient error log is ":download:download_partial_from_source:download_from_source:download_piece_from_source_finished:write_piece: dragonfly-client-storage/src/content.rs:317: copy "/var/lib/dragonfly/content/tasks/a7b/a7bb6d3a74392ff0e00f158f6639953f2f00ad8a65d367a0837d04ca6a98f6f8" failed:"
download_task:download:download_partial_from_source: [dragonfly-client/src/resource/task.rs:1594]
See attachment for error logs
problem1-dfdaemon.log

problem 2
When problem 1 occurs, it seems that the preheat failure will recur 。
image

20G image after preheat ,pod running needs about 11min, longer than no preheat , Is there any configuration that can be modified to improve the effect?

@WaterAndBread
Copy link
Author

I opened the debug log, hoping that helps

Uploading dfdaemon-debug.zip…

@gaius-qi
Copy link
Member

gaius-qi commented Nov 28, 2024

image

I think it is a network problem. The connection is broken when reading piece. Need to check the storage service.

@gaius-qi gaius-qi self-assigned this Nov 28, 2024
@gaius-qi gaius-qi added the enhancement New feature or request label Nov 28, 2024
@gaius-qi gaius-qi modified the milestone: v2.2.0 Nov 28, 2024
@WaterAndBread
Copy link
Author

image

I think it is a network problem. The connection is broken when reading piece. Need to check the storage service.

Because the image download is normal, but the preheat is abnormal, is there any command in Dragonfly that can confirm the network problem I can test it separately. thanks.

@gaius-qi
Copy link
Member

gaius-qi commented Dec 4, 2024

@WaterAndBread I can't reproduce it locally. Is there an public Container Registry that I can reproduce it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants