-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
英文数据集改为DuEEData #89
Comments
嗨,您好,感谢您对本项目的关注。
|
这个是我的一个例子,我是按照DuEEData的格式修改的,DuEEData是DuEE-Fin{ |
了解了。您需要根据Doc2EDAG的格式重新整理数据格式,对于英文数据来说,可以参考下面的例子(这里是一条数据)。注意里面的span都是按空格tokenize之后的indices。此外,如果是自有数据,则需要根据事件类型模板自行调整template,实现方案可参考这个文件夹:https://github.com/Spico197/DocEE/tree/main/dee/event_types
|
嗨,抱歉回复晚了。我不记得具体数值了,但印象中模型在英文上的效果确实很差,可能的原因有:
|
您好奥!我想请教一个问题,我将英文数据集改为了DuEEData的数据格式,并且将--bert_model='/home/pretrained_model/bert_en' \,在dee_task.py里面将 elif self.run_mode == "dueefin_wo_tgg":
self.train_file_name = "dueefin_train_wo_tgg.json"
self.dev_file_name = "dueefin_dev_wo_tgg.json"
self.test_file_name = "dueefin_submit_wo_tgg.json"
self.inference_file_name = "dueefin_submit_wo_tgg.json"
self.doc_lang = "en"
elif self.run_mode == "dueefin_w_tgg":
self.train_file_name = "dueefin_train_w_tgg.json"
self.dev_file_name = "dueefin_dev_w_tgg.json"
self.test_file_name = "dueefin_submit_w_tgg.json"
self.inference_file_name = "dueefin_submit_w_tgg.json"
self.doc_lang = "en"
的self.doc_lang = "zh"都修改为英文en。出现了
报错。我想请问一下 我将英文数据集改为DuEEData格式跑这个是可行的吗
The text was updated successfully, but these errors were encountered: