tAPAS

Assisted Parameter optimization by Approximating neighbourhood Similarity

Bayesian optimization of hyperparameters for t-SNE in the context of word embeddings - i. e.: Input is a word embedding (labels + coordinates in high-dimensional space). The optimization procedure samples the parameter space to generate low-dimensional approximations of the original word embeddig data using t-SNE. The quality/truthfulness of the resulting models are evaluated with several metrics:

Trustworthiness: Measure for proportion of points too close together¹ in the low-dimensional space.
Continuity: Measure for proportion of points too far apart¹ in the low-dimensional space.
Generalization: Generalization error of 1-nearest neighbour classifier (e. g. word embedding is clustered in high-dimensional and low-dimensional space - the higher the similarity between the cluster labels, the lower the generalization error).
Relative word embedding quality: QVEC [1] is used to evalute the intrinsic quality of the original word embedding and its dimensionality-reduced projection. The ratio is referred to as 'relative word embedding quality'.

The first three measures were chosen following [2].

¹: In terms of neighbourhood ranks, not absolute distances.

[1] Y. Tsvetkov, M. Faruqui, W. Ling, G. Lample, and C. Dyer, “Evaluation of Word Vector Representations by Subspace Alignment,” 2015, pp. 2049–2054.

[2] L. J. P. van der Maaten, E. O. Postma, and H. J. van den Herik, Dimensionality Reduction: A Comparative Review. 2008.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
data		data
doc		doc
setup		setup
source		source
.dockerignore		.dockerignore
.gitignore		.gitignore
.travis.yml		.travis.yml
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tAPAS

Assisted Parameter optimization by Approximating neighbourhood Similarity

About

Releases

Packages

Languages

License

rmitsch/tAPAS

Folders and files

Latest commit

History

Repository files navigation

tAPAS

Assisted Parameter optimization by Approximating neighbourhood Similarity

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages