Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
I changed some minor functions (print, range) in the notebook to make it run in a python 3 kernel. Renamed the notebook, so that python 2 and 3 files can co-exist (not sure if that is best practice / your preferred way of doing it, thought this might create the least problems). environment3.yaml contains my conda environment, which has way more than the necessary dependencies, but lists all versions I use.
As I did not have the original yelp data set, I created a mock one to test. However in my case, the cell containing
did not run, as line 431 in gensim\models\word2vec.py (self.build_vocab(sentences, trim_rule=trim_rule)) seem to set self.vocab to an empy dict (or not set it at all) or as trim_rule might not be set. (This is likely a problem with my mock data, and as all cells following this one should run in python 3, I did not look into it further.)