Training NER component using pre-trained tok2vec #13695
Unanswered
mikelgda
asked this question in
Help: Coding & Implementations
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I want to train a NER component from scratch and I suppose it would be faster to re-use the
tok2vec
component fromen_core_web_lg
. However, I'm having issues defining the configuration file for this.First I tried using the recommended configuration from the documentation but I think this also trains the
tok2vec
component, which includes the static vectors fromtok2vec
.I also tried modifying the configuration to source the
tok2vec
component fromen_core_web_lg
and then adding a listener for a NER component created from scratch like thiswhich is taking the recommended configuration and replacing the
tok2vec
by the pre-trained component. However, this shows a bug sincecomponents.tok2vec.model.encoder.width
is not accessible because I am sourcing thetok2vec
component.Last, I also tried using the recommended configuration and adding the
tok2vec
to thefrozen_components
andannotating_components
lists, but I also get a bug because thetok2vec
is not trained although it has the static vectors fromen_core_web_lg
.My main question is how to source the
tok2vec
while training a new NER from scratch and not training thetok2vec
.Thank you!
Beta Was this translation helpful? Give feedback.
All reactions