-
Notifications
You must be signed in to change notification settings - Fork 9
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Over 450 Generated Images. FID 271.254 . What's Wrong.... #11
Comments
I think you misunderstand here. To compute FID, following standard practice, you should generate 50_000 images for statistical significance. |
50_000 is the number of output images that you use your trained model at epoch 450 |
Also, I tried to use the 475.pth file from the original GitHub repository to test, but I found that it only generated one image and couldn't calculate the FID score. Tôi đã thử sử dụng tệp 475.pth từ kho lưu trữ GitHub gốc để kiểm tra, nhưng tôi thấy rằng nó chỉ tạo ra một hình ảnh và không thể tính điểm FID được. |
You‘ve not indicate the appropriate epoch to evaluate the fid of celeba-256-adm. Can you show the number? |
Hi, I'm understanding that you retrain our model and get 9.21. Is it correct ? |
Please note that: our stat file is computed using jpg images. If the generated image is png image, it leads to very high fid. |
I'm sure that I generate jpg images ,because I used your code directly, and I checked that moments ago. Maybe you can provide the pth file, I don't have idea about the concrete epoch to stop, but I'm sure that the outcome of 475 epoch is 9.21. |
I trained the model for 600 epochs and evaluate at 475 for CelebHQ-256 |
I've found that the model has more fluctuation after 500 epochs(just fids), do you think so? |
I began to test DiT. I think that wouldn't cause doubt. |
Yes, the model seems unstable after 500 epoch. In our paper, we use Cosine Learning rate decay and it depends on the total epoch. To be more stable, we suggest to use ema model and you could use ema code from DiT repo. Ema model is more stable and have better FID. Please, consider to use dropout if model converge to fast, you could have a look at https://arxiv.org/pdf/2102.09672 appendix section about overfitting on CIFAR10 |
OK, thanks. I'm trying again with ema. |
When I use your EMA.py, I find that “AttributeError: 'EMA' object has no attribute '_optimizer_state_dict_pre_hooks'. Did you mean: 'register_state_dict_pre_hook'?” What do you mean by "ema code from DiT repo" ? |
Should I use the file DiT/train.py to revise your code? I've made the revision, but I'm not sure about it. Why you have an EMA.py, but I still need to use the ema in DiT? |
Yes, you should use DiT/train.py to revise my code. I found it is more easier and compact when following DiT repo. |
So, by running the code |
Yes, I think you run it correctly, I wonder what environment you use to run model. I found that the architecture is more stable with torch 1.x version. I retrained our model on torch 2.x, the result is around 5.8 to 6.1, same to you. |
Thank you. |
Hi, thanks for your great work. I ran your pre-trained ImageNet kept and reproduced it as 21.01, which is evaluated on 50k samples. |
Hi! I have generated 450 images, and the facial features are already clear. However, I'm not sure why the FID value is 270. Should I keep only the 'model_450.pth' and 'image_epoch_450.png' checkpoints for testing?
Tôi đã tạo ra 450 hình ảnh, và các đặc điểm khuôn mặt đã rõ ràng. Tuy nhiên, tôi không chắc chắn tại sao giá trị FID lại là 270. Tôi có nên chỉ giữ lại các điểm kiểm tra 'model_450.pth' và 'image_epoch_450.png' để kiểm tra không?
The text was updated successfully, but these errors were encountered: