Skip to content

Evaluating word embeddings #1265

Answered by zphang
osotsia asked this question in Q&A
Jan 10, 2021 · 1 comments · 2 replies
Discussion options

You must be logged in to vote

Hi osotsia,

My sense is that jiant might not be the appropriate library for your use-case:

  • If you have a set of newly-trained word embeddings, they likely have a specific tokenization that differs from the models we currently support.
  • We also do not currently support models such as SVMs and random forests jiant primarily supports transformer-based models from the transformers library.

Instead, I would recommend you look into the following:

  • Use Hugging Face's datasets library to obtain task data as well as metrics
  • Use scikit-learn's suite of machine learning models for operating on the embeddings

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@osotsia
Comment options

@jpmcd
Comment options

Answer selected by osotsia
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants