Skip to content

Consistent MinHash hash values across Python versions

Compare
Choose a tag to compare
@ekzhu ekzhu released this 26 Mar 17:32
· 116 commits to master since this release

MinHash now uses Numpy's random number generator instead of Python's built-in random. This makes MinHash generate consistent hash values across different Python versions.

The side-effect is that now MinHash created before version 1.1.3 won’t work (i.e., jaccard, merge and union) correctly with those created after.