About Training Settings #2

baifanxxx · 2024-10-28T05:43:47Z

Hi,

Thank you for your good work for the community. Can I ask about the settings for training VILA-U? For example, GPU type, quantity, and days.

Best regards,
BAI Fan

zhuoyang20 · 2024-10-29T21:00:24Z

Hi @baifanxxx,

Thank you for your interest!

We trained VILA-U on 16 nodes of A100 for around 20K GPU hours. This includes the training of the unified vision tower, multi-modal pre-training and sft. Please feel free to reach out if you have additional questions.

Best,
Zhuoyang

John-Ge · 2024-11-21T12:59:47Z

I am not sure if the details of sft data are in the paper. I do not find it. Could this be released? Thank you!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About Training Settings #2

About Training Settings #2

baifanxxx commented Oct 28, 2024

zhuoyang20 commented Oct 29, 2024

John-Ge commented Nov 21, 2024

About Training Settings #2

About Training Settings #2

Comments

baifanxxx commented Oct 28, 2024

zhuoyang20 commented Oct 29, 2024

John-Ge commented Nov 21, 2024