Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FR: Reranker customization for paper search #212

Open
phongnguyen-aveksana opened this issue Sep 12, 2024 · 1 comment
Open

FR: Reranker customization for paper search #212

phongnguyen-aveksana opened this issue Sep 12, 2024 · 1 comment
Assignees
Labels
feature request New feature or request

Comments

@phongnguyen-aveksana
Copy link

Hi Semantic Scholar team. I am using Semantic Scholar for our product. One major problem I encounter is that the paper search results are very often empty. Due to the nature of my product, most of the search queries are longer and have a lot of keywords. I suspect this is he main reason why the empty results are frequent.
I have read your blog post about the search algorithm currently used. Although I have found many problems with this approach (one of which is to train the reranker on fairly limited ranking data without training updates), this shouldn't be the sole reason of empty results. I suspect you are using a very low (or high) threshold for the reranker that eliminates all of the results for certain queries.
So I wonder if any of these features can be added for the API:
_ Allowing getting the original search results, bypassing the reranker.
_ Allowing user to adjust the reranker threshold to make it more "lenient".
Thank you so much for your help and the wonderful product.

@cfiorelli
Copy link
Collaborator

@phongnguyen-aveksana Thank you for this feedback. Could you provide some example queries where i can test on what you're seeing? Thanks !

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants