pretraining to init_tok2vec (textcat) - ValueError: Cannot deserialize model: mismatched structure #12922
Unanswered
barebra
asked this question in
Help: Model Advice
Replies: 3 comments 6 replies
-
Hi @barebra! Yes, please provide your stack trace/error log and your config file. |
Beta Was this translation helpful? Give feedback.
5 replies
-
Hi @barebra, Sorry that this has been confusing. Just to double check - have you been using the exact same config file for both your pretraining and training steps? You really should only need one file, in which the |
Beta Was this translation helpful? Give feedback.
0 replies
-
Hi Sofie,
yes, I.m using the config.cfg - that was created from spaCy during pretraining (based upon the config_pretrain.cfg) and is stored in the "pretraining" directory together with the model.bins and the log.json - for training.
Yesterday I tried a "positive control" for the pretraining2training task, and it worked: I used the word vectors from spaCy's en_core_web_lg (300 dimensions) in combination with default settings of pretraining (e.g. spacy.PretrainCharacters.v1, hidden_size=300 etc.).
The word vectors I would like to use are from en_core_sci_lg (Allen AI, 200 dimensions). So, I assume I need to change hidden_size to 200 as well? Correct?
But, using the "positive control", I will also change certain parameters, e.g. pretraining.component and pretraining.layer, trying to reproduce the error.
|
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm currently pretraining with 170.000 PubMed-abstracts (component: "textcat", layer: "tok2vec"). Epoch-loss is still decreasing and I just wanted to run a first pilot-experiment (train/evaluate) with an intermediate model (model75.bin). I learned from this forum that the config for training has to resemble the one of pretraining, so I copy/pasted the [pretraining]-section from the config_pretrain.cfg to the config.cfg and started spacy train.
Anyway, there seems to be a problem with mismatched structure (see title). Can you help me? Do you need more information (error log, config.cfg)?
Beta Was this translation helpful? Give feedback.
All reactions