Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

some questions about freezing training #39

Open
yan1617262965 opened this issue May 15, 2023 · 1 comment
Open

some questions about freezing training #39

yan1617262965 opened this issue May 15, 2023 · 1 comment

Comments

@yan1617262965
Copy link

Hello author, I have some questions about freezing training and hope to get your reply
I noticed that you mentioned freezing the backbone network and detector, I want to ask what the specific purpose of this is, in order to understand that you can give some specific instructions for freezing the backbone and detector, when reading the code I found that the backbone is included in the model, what is the purpose of freezing the backbone in this case

@davidnvq
Copy link
Owner

Thanks for your question.
There are two main purposes:

  1. To show that trade-off between finetuning the entire model (including the backbone) and finetuning the model partially (excluding the backbone). It is seen that finetuning the entire model improves the overall performance.
  2. Although freezing the backbone decreases the captioning scores, the decrease is acceptable or the overall performance is acceptable (~140 CIDEr). Freezing the backbone also SPEED UP the training phase, which is suitable when you don't have so much compute.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants