Did you use the temporal multi-frame inputs when finetuning? #2

SeaBird-Go · 2024-10-09T12:48:52Z

Hi, thanks for sharing this wonderful work. Since you use the multi-frame multi-view inputs during pretraining stage, I want to know whether did you still use the temporal multi-frame inputs during fine-tune stage?

If you did not use the temporal multi-frame inputs in the downstream tasks, did it mean you discard the voxel decoder in the finetune stage and only load the pre-trained voxel encoder?

Doctor-James · 2024-10-18T07:06:12Z

Thank you for your interest in our work. Whether we used temporal multi-frame inputs during fine-tuning depended on whether the methods we were comparing against did so. You can roughly understand it as us providing a pre-trained backbone (such as ResNet50), and during fine-tuning, we adopted exactly the same training strategy as the baseline.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Did you use the temporal multi-frame inputs when finetuning? #2

Did you use the temporal multi-frame inputs when finetuning? #2

SeaBird-Go commented Oct 9, 2024

Doctor-James commented Oct 18, 2024

Did you use the temporal multi-frame inputs when finetuning? #2

Did you use the temporal multi-frame inputs when finetuning? #2

Comments

SeaBird-Go commented Oct 9, 2024

Doctor-James commented Oct 18, 2024