Cannot reproduce the results reported in the Paper (CD=2.723) #8

AlphaPav · 2020-07-29T08:58:49Z

You need to train the whole network with Chamfer Distance. It reaches CD ~0.40 on ShapeNet.
Then, you need to fine-tune the network with Gridding Loss + Chamfer Distance on the Coarse Point Cloud.
Finally, you fine-tune the network with Chamfer Distance. Chamfer Distance is taken as a metric, therefore, you cannot get lower CD without using Chamfer Distance as a loss.

Originally posted by @hzxie in #3 (comment)

hzxie · 2020-07-29T09:00:05Z

So, what's the problem you are facing now?
Please provide more details.

AlphaPav · 2020-07-29T09:08:58Z

Hi author, thanks for the amazing work.

On your released pre-trained model, I can get 0.7082 F-score, 2.722 CD.
However, when I train from scratch, I had some problems listed below:

"You need to train the whole network with Chamfer Distance." --- It reaches 4.588 CD, 0.6133 F-score, which is similar with Table 7&Not Used&CD&Complete = 4.460 in your paper.

"Then .. fine-tune the network with Gridding Loss + Chamfer Distance on the Coarse Point Cloud." ---- It reaches 4.536 CD, 0.6255 F-score. It was supposed to be about ~2.7, right?

sparse_loss = chamfer_dist(sparse_ptcloud, data['gtcloud'])
dense_loss = chamfer_dist(dense_ptcloud, data['gtcloud'])
grid_loss = gridding_loss(sparse_ptcloud, data['gtcloud'])
_loss = sparse_loss + dense_loss + grid_loss

__C.NETWORK.GRIDDING_LOSS_SCALES = [128]
__C.NETWORK.GRIDDING_LOSS_ALPHAS = [0.1]

"Finally, you fine-tune the network with Chamfer Distance." --- the CD didn't decrease below 4.536.

I'm wondering what steps am I making mistakes? (like learning rate/loss weight of gridding loss)

AlphaPav · 2020-07-30T11:43:27Z

your processed ShapeNet dataset has 28974 training data samples
while the PCN dataset has 231792 training data samples

is it because your provided dataset is not completed?

hzxie · 2020-08-05T14:24:54Z

@AlphaPav
Sorry for the late reply. I don't have time to check this issue these days.
But I'm sure that there is nothing wrong with the released dataset. 231792 / 28974 = 8, which indicates that there are 8 partial input point cloud for each model in ShapeNet.

AlphaPav · 2020-08-05T14:35:06Z

@AlphaPav
Sorry for the late reply. I don't have time to check this issue these days.
But I'm sure that there is nothing wrong with the released dataset. 231792 / 28974 = 8, which indicates that there are 8 partial input point cloud for each model in ShapeNet.

The PCN dataset is about 48 GB, while the released dataset is about 10 GB. Do you mean that you randomly augment each point cloud 8 times during training?

hzxie · 2020-08-06T01:39:31Z

No. I think the difference may be caused by different compression ratios.
You can also generate the ShapeNet dataset from PCN with this script.

SarahChane98 · 2020-08-11T16:16:13Z

Hi! I also cannot reproduce the results. The highest CD I got after training three times was 5.2. May I know how many epochs you've trained for each round respectively? (i.e. CD only, CD + gridding loss, CD only)

hzxie · 2020-08-12T01:20:13Z

@SarahChane98
I cannot report the exact numbers of epochs for each round.
For each round, I train several times until the loss does not decrease.
Try to fine-tune the network again with the previous weights (from last training).

paulwong16 · 2020-10-09T17:16:38Z

Hi there, I just tested your pretrained model on test set, and the result is close to the value reported in paper. However, when I tested on validation dataset, it reported a dense CD around 7.177. I was wondering why there is a hugh gap between CDs on val set and test set?

and a dense cd around 5.087 for training set reported with pretrained model (should be the same as training dense loss if i understand correctly)

hzxie · 2020-10-10T02:39:56Z

@paulwong16
If the reported results are correct, one possible reason why the pretrained model performs worse in the validation and training set is that we choose the best model for the test set instead of the validation set and training set.

paulwong16 · 2020-10-10T02:42:19Z

@paulwong16

Because we choose the best model for the test set instead of the validation set.

but why CD on test set could be even much lower than on training set?

hzxie · 2020-10-10T02:44:39Z

@paulwong16
Because the pretrained model is best for fitting distribution of the testing set.
Instead, the distribution of the training and validation set may be different from the testing set.

paulwong16 · 2020-10-10T02:55:25Z

@paulwong16

Because the pretrained model is best for fitting distribution of the testing set.

Instead, the distribution of the training and validation set may be different from the testing set.

well...i believe the best model should not be chosen according to the test result (instead, should be the validation result). And from the best results I could reproduce, the training loss was a little lower than val loss and test loss, and test loss was close to the val loss.

Anyway, thanks for your kind reply, I will try to reproduce the result.

hzxie · 2020-10-10T03:09:29Z

@paulwong16
Yes, choosing models from the testing set is not a good option.
For the Completion 3D benchmark, the best model is chosen from the validation set. (Because we don't have the ground truth for the testing set.)

wangyida · 2020-10-31T15:52:49Z

@hzxie Hi I'm wondering how you incoorporate gridding_loss in training? I have not found it in the script Thanks

hzxie · 2020-11-01T02:43:01Z

@wangyida

You can use the Gridding Loss here:

GRNet/core/train.py

Line 113 in 3352592

sparse_loss = chamfer_dist(sparse_ptcloud, data['gtcloud'])

when fine-tuning the network.

wangyida · 2020-11-01T14:29:25Z

@hzxie Thank you, I tried it out and the result seems to be fitting with the expected trends. Thanks for your inspiring work;)

Lillian9707 · 2020-11-03T12:24:19Z

Hi, I'm wondering how to fine-tune the network with the previous weight? I've tried the same configuration as your paper but the best model gets CD=4.538 and F-Score=6.206 while your pre-trained model can get CD=2.723 and F-Score=7.082.

And I check the log and find that the network had converged to the optimal in 20 epochs. Why you set 150 epoch as the default?

hzxie · 2020-11-04T01:29:43Z

@Lillian9707

In my experiments, the loss will continue to decrease after 20 epochs.
Moreover, you need to fine-tune the network with the Gridding Loss.

Lillian9707 · 2020-11-10T02:10:45Z

Hi, I still cannot reproduce the result. Can you provide more details?

I've tried to fine-tune the framework with gridding loss and lower learning rate. But the CD score and F-score got worse.

hzxie · 2020-11-10T02:27:02Z

@Lillian9707
Keep the learning rate unchanged during fine-tuning.
According to the experimental results of AlphaPav, the CD and F-Score got better after applying Gridding Loss.

Lillian9707 · 2020-11-10T03:26:58Z

Thank you for your reply!
But AlphaPav only gets '4.536 CD, 0.6255 F-score' after fine-tuning, which looks more stochastic.
So the fine-tuning process is to train with 1* CD on both Sparse and Dense Point Cloud + 1* Gridding Loss on Sparse Point Cloud? And the learning rate is always 5e-5?

Lillian9707 · 2020-11-17T12:55:16Z

hi, sorry to bother you. I still cannot reproduce the results in the paper.
I have tried several times to fine-tune the network, including use lr=5e-5, 1e-5, 1e-6 and Multi-stepLR, training with CD + Gridding loss on sparse or dense cloud, and so on. But the results are always around CD=4.5 and F-Score=6.2. Can you provide more details about fine-tuning?

hzxie · 2020-11-18T01:10:47Z

@Lillian9707

Try to fine-tune the network w/ and w/o Gridding Loss several times.
During fine-tuning, try to use top-10 (not always the best) weights from the previous training should be loaded.
The init learning rate for fine-tuning should be 1e-4.

hzxie changed the title ~~can't reproduce the 2.723 Chamfer Distance result~~ Cannot reproduce the results reported in the Paper (CD=2.723) Jul 29, 2020

Repository owner deleted a comment from AlphaPav Sep 3, 2020

hzxie mentioned this issue Oct 31, 2020

gridding_loss is not applied during training? #10

Closed

hzxie pinned this issue Apr 1, 2021

hzxie mentioned this issue Mar 10, 2023

gridding loss #54

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cannot reproduce the results reported in the Paper (CD=2.723) #8

Cannot reproduce the results reported in the Paper (CD=2.723) #8

AlphaPav commented Jul 29, 2020

hzxie commented Jul 29, 2020 •

edited

Loading

AlphaPav commented Jul 29, 2020 •

edited

Loading

AlphaPav commented Jul 30, 2020

hzxie commented Aug 5, 2020

AlphaPav commented Aug 5, 2020

hzxie commented Aug 6, 2020

SarahChane98 commented Aug 11, 2020

hzxie commented Aug 12, 2020 •

edited

Loading

paulwong16 commented Oct 9, 2020 •

edited

Loading

hzxie commented Oct 10, 2020 •

edited

Loading

paulwong16 commented Oct 10, 2020

hzxie commented Oct 10, 2020 •

edited

Loading

paulwong16 commented Oct 10, 2020

hzxie commented Oct 10, 2020 •

edited

Loading

wangyida commented Oct 31, 2020

hzxie commented Nov 1, 2020 •

edited

Loading

wangyida commented Nov 1, 2020

Lillian9707 commented Nov 3, 2020

hzxie commented Nov 4, 2020

Lillian9707 commented Nov 10, 2020 •

edited

Loading

hzxie commented Nov 10, 2020

Lillian9707 commented Nov 10, 2020

Lillian9707 commented Nov 17, 2020 •

edited

Loading

hzxie commented Nov 18, 2020

Cannot reproduce the results reported in the Paper (CD=2.723) #8

Cannot reproduce the results reported in the Paper (CD=2.723) #8

Comments

AlphaPav commented Jul 29, 2020

hzxie commented Jul 29, 2020 • edited Loading

AlphaPav commented Jul 29, 2020 • edited Loading

AlphaPav commented Jul 30, 2020

hzxie commented Aug 5, 2020

AlphaPav commented Aug 5, 2020

hzxie commented Aug 6, 2020

SarahChane98 commented Aug 11, 2020

hzxie commented Aug 12, 2020 • edited Loading

paulwong16 commented Oct 9, 2020 • edited Loading

hzxie commented Oct 10, 2020 • edited Loading

paulwong16 commented Oct 10, 2020

hzxie commented Oct 10, 2020 • edited Loading

paulwong16 commented Oct 10, 2020

hzxie commented Oct 10, 2020 • edited Loading

wangyida commented Oct 31, 2020

hzxie commented Nov 1, 2020 • edited Loading

wangyida commented Nov 1, 2020

Lillian9707 commented Nov 3, 2020

hzxie commented Nov 4, 2020

Lillian9707 commented Nov 10, 2020 • edited Loading

hzxie commented Nov 10, 2020

Lillian9707 commented Nov 10, 2020

Lillian9707 commented Nov 17, 2020 • edited Loading

hzxie commented Nov 18, 2020

hzxie commented Jul 29, 2020 •

edited

Loading

AlphaPav commented Jul 29, 2020 •

edited

Loading

hzxie commented Aug 12, 2020 •

edited

Loading

paulwong16 commented Oct 9, 2020 •

edited

Loading

hzxie commented Oct 10, 2020 •

edited

Loading

hzxie commented Oct 10, 2020 •

edited

Loading

hzxie commented Oct 10, 2020 •

edited

Loading

hzxie commented Nov 1, 2020 •

edited

Loading

Lillian9707 commented Nov 10, 2020 •

edited

Loading

Lillian9707 commented Nov 17, 2020 •

edited

Loading