-
Notifications
You must be signed in to change notification settings - Fork 903
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
differences vs. original paper #102
Comments
Hi great to see you went deep into the paper.
I guess you can interpret this as x,y are a result of logistic regression on center coordinate offset. You
|
@zzh8829 Speaking of which, two questions
Thanks! :) |
|
Hi @zzh8829,
Thanks for your work. I must admit I never dig into original darknet implementation, but after getting back to the original paper I noticed two inconsistencies (?):
box loss: in your code
data:image/s3,"s3://crabby-images/a339c/a339c730b0e29605e18cede28ac9372ae7217372" alt="image"
data:image/s3,"s3://crabby-images/739d6/739d63eb626742257fd6bc3a6ca99d30b68d14ed" alt="image"
data:image/s3,"s3://crabby-images/2bce8/2bce8d705ff576b3ff306288cbf319773ac2d0e4" alt="image"
According to paper
see
scales of anchors: in your code all anchors are on the same grid 13x13, but in the model there are 3 scales. Shouldn't we use more granular scales for smaller anchors? (similar as in RetinaNet)
Regards,
The text was updated successfully, but these errors were encountered: