GraphQuestions

GraphQuestions^[1] is such a QA dataset specified many characteristics of a question, including structural complexity, answer cardinality, function, commonness, and even paraphrasing. It contains over 5000 pairs of question-logical forms, where each question is associated with ground-truth answer from the knowledge base. In the process of constructing this dataset, a semi-automated framework was proposed, where an intermediate graph query representation, automatically generated from a knowledge base, was used to guide to formalize the characteristics of a question. Human annotators converted these graph queries to natural language questions, and then produced many more questions with the same meaning using two levels of paraphrases, i.e. entity and question.

This dataset can be downloaded via the link

Leaderboard

Model / System	Year	Accuracy	F1	Reported by
ChatGPT	2023	53.10	-	Tan et. al.
GPT-3.5v3	2023	47.95	-	Tan et. al.
GPT-3.5v2	2023	40.85	-	Tan et. al.
GPT-3	2023	38.32	-	Tan et. al.
FLAN-T5	2023	32.27	-	Tan et. al.

References

[1] Su, Yu, Huan Sun, Brian Sadler, Mudhakar Srivatsa, Izzeddin Gür, Zenghui Yan, and Xifeng Yan. On generating characteristic-rich question sets for qa evaluation. In Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing, pp. 562-572. 2016.

Go back to the README

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

graph_questions.md

graph_questions.md

GraphQuestions

Leaderboard

References

Files

graph_questions.md

Latest commit

History

graph_questions.md

File metadata and controls

GraphQuestions

Leaderboard

References