Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问8张A100 80GB怎么设置参数 #30

Open
HypherX opened this issue Nov 6, 2023 · 1 comment
Open

请问8张A100 80GB怎么设置参数 #30

HypherX opened this issue Nov 6, 2023 · 1 comment
Assignees

Comments

@HypherX
Copy link

HypherX commented Nov 6, 2023

您好,感谢您的工作。我想请问一下8张A100 80GB上微调flan-t5-11B原论文是如何设置各项参数的。例如deepspeed选择什么模式,batch_size等等参数

@nitwtog
Copy link
Collaborator

nitwtog commented Nov 8, 2023

deepspeed使用zero3, 设置每张卡batch_size为2,梯度累计到为16,训练5个epoch

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants