Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

请问大模型微调需要多少显存? #51

Open
Crayon-s7 opened this issue Apr 20, 2023 · 3 comments
Open

请问大模型微调需要多少显存? #51

Crayon-s7 opened this issue Apr 20, 2023 · 3 comments

Comments

@Crayon-s7
Copy link

No description provided.

@HarderThenHarder
Copy link
Owner

这取决于您的 max_soruce_lengthmax_target_length 的设置。
在我的实验中,整个训练句子(source + target)加起来 800 个 token 下需要大约 31 G 左右的显存(V100)。

@cheney369
Copy link

请问一下,如果有两片16G(T4)的卡,要怎么跑这个实验呢?用MUlti_gpu的方法试了一下,都是爆显存错误。

@Tungsong
Copy link

请问一下,如果有两片16G(T4)的卡,要怎么跑这个实验呢?用MUlti_gpu的方法试了一下,都是爆显存错误。

@HarderThenHarder 相同的显卡,len调到200两张卡都爆,同问

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants