-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question when reproducing experiment #10
Comments
Hi @g-lopes, thank you for your kind words! Im glad I can help :) It looks like your plots converging much slower than mine.. May I ask which setting you are exactly running? like msmarco or xorqa dataset? Are you running DSI-QG or the original DSI? how many gpus and batch sizes are used? |
That was quick :) First, I generated the queries using the following command:
After that, I started training the model running:
I am running DSI-QG, with 8 Nvidia A100 GPUs and batch_size equals to 32. |
Hi @g-lopes , according to your scripts, I think in step 2 you made the mistake of using Or, you can try our trained cross-lingual QG models which I just uploaded on huggingface recently: https://huggingface.co/ielabgroup/xor-tydi-docTquery-mt5-large |
😃 thank you very much for your help. |
@ArvinZhuang I've double-checked the scripts that I was using and I've achieved much better results. I am trying to convert the hits@10 scores that I have to absolute values so that I can compare my results with the ones in your paper. Can you please explain to me how the hit scores of Table 1 were computed? I am multiplying my score by the number of documents in the evaluation dataset. In my case, the evaluation dataset has Thanks for your help one more time :) |
@g-lopes Hi, the code for computing hits scores is in this function Line 43 in 479d8d7
if you want to do inference with saved model checkpoints, you can try something like this #1 (comment) |
I saw this function. My idea of multiplying by the number of documents in the evaluation dataset came from line 61: Line 61 in 479d8d7
Did the results from Table 1 of your paper come directly from this function? Or have you also multiplied by the size of the evaluation set? Which test dataset did you use to generate these results? |
Ah, I see what you mean, yes my results are directly from this function, the numbers in the table are simply percentages, which you just need to multiply the numbers from the function by 100. @g-lopes |
Aha! Great! Thank you very much :D |
First of all, I would like to thank you for making the code for work available and I also say that I really liked your paper. It is very interesting.
I am currently writing my master's thesis and I would like to use part of your code to build my own mathematical IR system. So, the first thing I've done is to try to run your scripts and see if my results match yours.
So what I did was to run the
get_data.sh
script and then the scripts of steps 2 and 3 of the README file. The model is still being trained but I am skeptical about the results I am getting until now.Can you please confirm if the graphs below are in accordance with your results?
Thank you for your attention :)
The text was updated successfully, but these errors were encountered: