[Question] How do you fine-tune LLaVA-NeXT on video data? #1795

DrVictorBenjamin · 2024-12-11T04:21:47Z

Question

I have a collection of videos and annotations. How do I fine-tune one of the LLaVA-NeXT models? I see the instructions for how to do so with traditional LLaVA but the directions for LLaVA-NeXT with video data are unclear. Thank you very much

DrVictorBenjamin · 2024-12-11T05:21:18Z

Ay after spending some time digging around, I came across this tutorial in case anyone else is searching for an answer: https://github.com/NielsRogge/Transformers-Tutorials/blob/master/LLaVA-NeXT-Video/Fine_tune_LLaVa_NeXT_Video_with_HFTrainer.ipynb

I haven't tried it yet but I will

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] How do you fine-tune LLaVA-NeXT on video data? #1795

[Question] How do you fine-tune LLaVA-NeXT on video data? #1795

DrVictorBenjamin commented Dec 11, 2024

DrVictorBenjamin commented Dec 11, 2024

[Question] How do you fine-tune LLaVA-NeXT on video data? #1795

[Question] How do you fine-tune LLaVA-NeXT on video data? #1795

Comments

DrVictorBenjamin commented Dec 11, 2024

Question

DrVictorBenjamin commented Dec 11, 2024