-
Notifications
You must be signed in to change notification settings - Fork 124
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
the definition of QPS in internet services is different #24
Comments
In most internet services, QPS means the server can process how much requests in parallel (every second). That is not only affected by the cost time of each request. |
Are you referring to our existing formula for measuring throughput?
(latency to search batch of queries) / ( number of queries in a batch )
…On Mon, Aug 23, 2021 at 5:06 AM nk2014yj ***@***.***> wrote:
In most internet services, QPS means the server can process requests in
parallel (every second). That is not only affected by the cost time of each
request.
In faiss-lib is using like "pragma omp parallel " methods to reduce runing
time, which does not promote the ability of process requests in each
second. The results of our experiment showed that use parallel instructions
may reduce the QPS.
would you considering using the commonly definition of QPS, and provideing
a rpc-service to compare the true ability to process search-requests of
each algorithm.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#24 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AADL6CMKXBHZJK2NAJB4OV3T6I2VXANCNFSM5CUNFRIQ>
.
|
No description provided.
The text was updated successfully, but these errors were encountered: