-
https://blogs.lse.ac.uk/polis/2022/09/15/bad-will-hunting-the-story-so-far/
-
SparkNLP: https://nlp.johnsnowlabs.com/
-
Rosette Text Analytics: https://www.rosette.com/capability/entity-extractor/
Tips from Tobias:
-
Dimensionality reduction: https://umap-learn.readthedocs.io/en/latest/basic_usage.html
-
One-class classification: https://en.wikipedia.org/wiki/One-class_classification
- Get confidence scores for classification?
-
Or: fuzzying out names for other names and see if it can disambiguate