Training dependency parser and POS tagger on modified OntoNotes 5.0 #13454
Unanswered
skarokin
asked this question in
Help: Other Questions
Replies: 1 comment 1 reply
-
There's a variety of dependency converters, but we use the ClearNLP one because it makes use of the trace nodes and function tags to get better results. We also aligned the data to raw source texts where available so that we could train the parser on non-sentence-segmented text. The parser doesn't depend on the POS tagger. You can train them separately. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have OntoNotes 5.0 dataset that I see that spaCy's pretrained dependency parser and POS tagger is trained on. I need to modify this dataset and train both.
I understand that OntoNotes 5.0 is constituency parsed, not dependency parsed. Is there a way to convert between the two? Furthermore, how are there any modifications I need to do on the OntoNotes 5.0 dataset to train the POS tagger?
Finally, to my best beliefs the parser is reliant on the tagger. How can I ensure that in actual usage, my custom tagging model is used when running my own trained parser?
Beta Was this translation helpful? Give feedback.
All reactions