-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Function find_near_matches is alternating results for the same situation #40
Comments
Hi Cristiano, thanks for this report! That is indeed very strange. I have been able to reproduce this locally using your script; many thanks for that! I'll look into this. |
Hi @taleinat! I'm facing a similar issue here. Do you have any suggestion on this? |
Hi, Unfortunately I haven't had a chance to pin this down. I will mention, though, that the original question is searching for a pattern of length 11 ( @jcsilva, I suggest opening a separate issue with the relevant details, and I'll try to see what can be done. |
@RagnarCris, apologies for not having followed up your issue. Is this still relevant for you? |
Hi @taleinat , sorry for my late response. To provide a bit more of context, I'm trying to replace @RagnarCris on this topic, as he is now working in other subject here. So, I can open a new issue, but it would be basically the same described here. As you mentioned, the threshold is not used in the example provided. Actually it is just a small snippet of a major function ... this example could be reduced to:
Using this example, we would like to find a pattern of length 11 with The issue we didn't understand is that the index return by |
Hi, my name is Cristiano and I'm using the library to determine how far someone has read, by comparing a transcript to a text to be read.
And strangely, for the same parameters and strings, the function find_near_matches returned two different results, in 5 to 6 times that i runned the script.
I'm using the version 0.7.3 of fuzzysearch.
I'm leaving an example of a script in which this case is happening. If you want, just run around 7 times to see the results changing at least two times.
Thanks in advance!
Example:
P.S.: The text in the script is in Portuguese.
The output that i get by running 7 times this script is:
The text was updated successfully, but these errors were encountered: