Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

EXAQT on CronQuestions dataset #4

Open
apoorvumang opened this issue Apr 4, 2022 · 4 comments
Open

EXAQT on CronQuestions dataset #4

apoorvumang opened this issue Apr 4, 2022 · 4 comments

Comments

@apoorvumang
Copy link

Hi Zhen

Did you have any luck in trying EXAQT on CronQuestions dataset (https://github.com/apoorvumang/CronKGQA)? If not, what kind of dataset processing do you think is needed to get EXAQT to work on CronQuestions?

Thanks
Apoorv

@zhenjia2017
Copy link
Owner

Hi Apoorv, I have tried EXAQT on CronQuestions last year but due to some reasons (one is the number of CronQuestions is large and another is I was busy with other things) I did not complete it. If you are trying EXAQT on CronQuestions, the TagMe (I use WAT, improved TagMe https://sobigdata.d4science.org/web/tagme/wat-api) and ELQ (https://github.com/facebookresearch/BLINK) for NERD are needed. CLOCQ (https://clocq.mpi-inf.mpg.de/) is used to extract one-hop facts for the NERD entities of each question and is also used to extract two-hop temporal facts for the completed GST subgraphs. To obtain question-relevant facts, we need to create the training dataset and train the BERT classifier. The pipeline of EXAQT is a little bit long so I think I can restart the work of trying EXAQT on CronQuestions and share the data with you.

@zhenjia2017
Copy link
Owner

Since CronQuestions dataset has gold topic entities, I think the NERD step can be removed from the EXAQT pipeline.

@apoorvumang
Copy link
Author

Hi Zhen, thanks for the response

Since CronQuestions dataset has gold topic entities, I think the NERD step can be removed from the EXAQT pipeline.

Yes we can probably do away with the NERD step in the pipeline

The pipeline of EXAQT is a little bit long so I think I can restart the work of trying EXAQT on CronQuestions and share the data with you.

That would be extremely useful! Please let me know if you make any progress on this front. I am also trying to understand the steps of EXAQT, and even data of intermediate steps would be useful

@zhenjia2017
Copy link
Owner

You're welcome. When the pipeline is done, I will let you know.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants