a bug to use 16 * A10(16 * 23g) to inference llama2-70b #4

babytdream · 2023-08-25T02:22:55Z

I have 16 gpus in one machine.

Hello!when I use 16 * A10(16 * 23g) to inference llama2-70b, it appears error:

I ask many people to solve this problem,but failed.
I know 8 gpu can work it! But I need to increase the prompt of llama2, the 8 GPU is not enough!
Do you have some ideas, thanks!

babytdream changed the title ~~a buig to use 16 * A10(16 * 23g) to inference llama2-70b~~ a bug to use 16 * A10(16 * 23g) to inference llama2-70b Aug 25, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

a bug to use 16 * A10(16 * 23g) to inference llama2-70b #4

a bug to use 16 * A10(16 * 23g) to inference llama2-70b #4

babytdream commented Aug 25, 2023 •

edited

Loading

a bug to use 16 * A10(16 * 23g) to inference llama2-70b #4

a bug to use 16 * A10(16 * 23g) to inference llama2-70b #4

Comments

babytdream commented Aug 25, 2023 • edited Loading

babytdream commented Aug 25, 2023 •

edited

Loading