Skip to content

Locality-sensitive hashing algorithm for text similarity comparisons

License

Notifications You must be signed in to change notification settings

diffeo/py-nilsimsa

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ce92981 · Oct 12, 2021

History

18 Commits
Jan 31, 2016
Jun 11, 2015
Oct 12, 2021
Jan 23, 2014
Mar 27, 2015
Mar 27, 2015
Mar 27, 2015
Jun 11, 2015

Repository files navigation

This is a implementation of the nilsimsa algorithm, see http://en.wikipedia.org/wiki/Nilsimsa_Hash

An earlier version of this library was a port to Python of nilsimsa.pl (by way of a ruby port), which was GPLed. The reimplementation has an explanation of how these hashes work, and is MIT/X11 licensed.

"A nilsimsa code is something like a hash, but unlike hashes, a small change in the message results in a small change in the nilsimsa code. Such a function is called a locality-sensitive hash." Quoted from: http://ixazon.dynip.com/~cmeclax/nilsimsa.html