-
Notifications
You must be signed in to change notification settings - Fork 460
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Doubt on Train Prototext in this repo. Anyone can repeat their training ever? #130
Comments
Hi, Thanks for your contribution!
|
@bittnt Hey i was wondering if you could let us know how many iterations it took for you to get your results. I have read the paper a couple of times and i cannot see any mention about the number of iterations you used (If it is mentioned there somewhere and i missed it, i'm very sorry). Also as a side note, i am currently training the crf-rnn on 6 channel satellite images and it is quite the hassle. Is it recommended to train an fcn-8s first on this 6-channel data before plugging in the meanfield iteration layer? I copied weights for the first 3 channels of the fcn-8s caffemodel, then i randomized the last three channels of the model. This new model is what i am using for training/finetuning. Are there any concerns, tips or issues you can give me on this. Thanks!! |
@bittnt
I tried to repeat the training process on PASCAL VOC 2012 (20classes plus background).
Here's my documents: https://github.com/KleinYuan/train-crfasrnn
And here's the prototext I used for training, which is exact same as what you posted in here.
I trained with fcn-8s model and this caffe version and after around 200k iterations. The result was pretty bad (even much worse than fcn-8s) as attached.
Then I thought that I may miss something, so I used this tool to extract info about layers on your pre-trained caffemodel and the result kinda surprised me that I cannot find the
MultiStageMeanfield
as well asmulti_stage_meanfield_param
on 57th layer. Were u training with a different architecture or ?I am quite confused now and it will be great for you to give me some hint and potentially share the actual training prototext?
Note:
At beginning I realized that I used a newer caffe than this repo and therefore I need to add crop_param in here, here and here. After adding those, with your pre-trained model, the demo script output expected images. Therefore, I think the training pipeline and test scripts are ok.
Attached Code, 57th layer of the Caffe -> Json on your pre-trained model:
Where as, here's my 57th layer:
The text was updated successfully, but these errors were encountered: