You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I believe this is a new bug in the Pinecone Python Client
I have searched the existing issues, and I could not find an existing issue for this bug
Current Behavior
If you query pinecone index with a vector ID with topk=1, the returned vectors sometimes have different id. If you keep top_k > 1, sometimes the correct vector is found in positions k>1.
Expected Behavior
If I search using vector id, the whole point is to get the vector whose id matches the query. Then find other vectors with high similarity scores.
@zackproser Thanks for pointing to the documentation. That was really helpful. I want to suggest two things here
When a user is querying vector by an ID it is expected behaviour to get that particulat vector as the first hit and then the nearestest neighbours as other hits.
If I search by an ID that doesn't exist in the database, we get the response that the ID doesn't exists. That indicates, when you search by an ID that exixts, you should have the ID in the results. Otherwise the product design is not consistent.
When a user is querying vector by an ID it is expected behaviour to get that particulat vector as the first hit and then the nearestest neighbours as other hits.
I think @ayansengupta17 is right, it should be the first hit when query with the ID. Below is what I'm using in another vector database, it return the ID in first hit as I expected
Is this a new bug in the Pinecone Python client?
Current Behavior
If you query pinecone index with a vector ID with topk=1, the returned vectors sometimes have different id. If you keep top_k > 1, sometimes the correct vector is found in positions k>1.
Expected Behavior
If I search using vector id, the whole point is to get the vector whose id matches the query. Then find other vectors with high similarity scores.
Steps To Reproduce
It's hard to provide a reproducible steps, because it happens sometimes. We see it happening a lot in our production environment. So I rather attach some relevant screenshots from the UI.
checkour more examples https://community.pinecone.io/t/bug-pinecone-search-by-id-is-returning-incorrect-result/5554
Relevant log output
Environment
Additional Context
No response
The text was updated successfully, but these errors were encountered: