Blog post: Scaling Valkey Cluster to 1 Billion RPS #392

hpatro · 2025-10-13T21:25:40Z

Description

Adds content for "1 Billion RPS blog post"

Issues Resolved

#337

Check List

Commits are signed per the DCO using --signoff

By submitting this pull request, I confirm that my contribution is made under the terms of the BSD-3-Clause License.

Signed-off-by: Harkrishn Patro <[email protected]>

Signed-off-by: Kyle J. Davis <[email protected]>

stockholmux

Over all good blog post. I made a bunch of small edits in a pull request on to your fork @hpatro see: hpatro#1

One thing that needs cleaning up is a collection pronoun issue - there are uses of 'we'/'our' that I think are referring to the project but others that are referring to the group of authors. This needs to be super crisp - take a look through and determine who did what and revise. If it's the project, try to make a cleaner attribution.

content/blog/2025-10-20-1-billion-rps/index.md

stockholmux · 2025-10-14T20:21:08Z

content/blog/2025-10-20-1-billion-rps/index.md

+
+## Closing thoughts
+
+With all these improvements made in Valkey, a cluster can now scale to 1B RPS / 2000 nodes which is quite a remarkable feat to achieve. However, there is plenty of room to improve further. The steady state CPU utilization overhead from the cluster bus message transfer/processing can be reduced further by incorporating [SWIM protocol](https://en.wikipedia.org/wiki/SWIM_Protocol) or move it off the main thread into an independent separate thread. The failover logic can be made smarter as well by incorporating the AZ placement of nodes. We would also like to introduce more observability metrics/logs into the system for better manageability of it. All of these are being linked under [Support Large Cluster](https://github.com/valkey-io/valkey/issues/2281) issue. Feel free to check it out and add in your suggestions.


I would like to have a little better call to action. What can a user do next aside from look at the issue? Is there further reading? Another blog post of interest?

Engaging on the issue should be the next step.

"Feel free to check it out and add in your suggestions." - This statement was my call to action.

There is a detailed spec available here https://valkey.io/topics/cluster-spec/. Do we want to link it here ?

stockholmux · 2025-10-14T20:23:29Z

content/authors/maheshcherukumilli.md

+    github: cherukum-Amazon
+---
+
+Mahesh Cherukumilli is an engineering leader passionate about large-scale distributed systems and high-performance databases. He sets strategy and builds high-performing teams that align architecture with business outcomes and deliver reliable, at-scale infrastructure.


The phrase 'large-scale distributed systems' appears in all three bios. Can we mix it up a bit? Or maybe you all just share a very similar passion? 🤣

This is something quite difficult to update 😂

content/blog/2025-10-20-1-billion-rps/index.md

madolson

I just looked at the technical details and they looked good, will leave it to Kyle to ship.

review pass on 1 billion RPS blog

Signed-off-by: Harkrishn Patro <[email protected]>

hpatro · 2025-10-14T22:19:24Z

Over all good blog post. I made a bunch of small edits in a pull request on to your fork @hpatro see: hpatro#1

Thanks @stockholmux for the help, pulled that in.

One thing that needs cleaning up is a collection pronoun issue - there are uses of 'we'/'our' that I think are referring to the project but others that are referring to the group of authors. This needs to be super crisp - take a look through and determine who did what and revise. If it's the project, try to make a cleaner attribution.

Cleaned up the our and some usage of we. Left the usage of we in the benchmarking section to indicate the authors setup/work.

Please take a look.

content/blog/2025-10-20-1-billion-rps/index.md

Signed-off-by: Harkrishn Patro <[email protected]>

hpatro · 2025-10-15T17:53:59Z

@stockholmux / @makubo-aws please have a look, addressed your feedback.

stockholmux

Two small changes and I'm clear to publish on Monday ahead of the 9.0 launch.

content/blog/2025-10-20-1-billion-rps/index.md

Signed-off-by: Harkrishn Patro <[email protected]>

hpatro · 2025-10-15T21:09:13Z

Two small changes and I'm clear to publish on Monday ahead of the 9.0 launch.

Have addressed them. The folder name and the release date on the post is also set for 10/20. So, we are good to release on Monday. @stockholmux Thanks for the help!

stockholmux

LGTM

Blog post: Scaling Valkey Cluster to 1B RPS

89c8b6d

Signed-off-by: Harkrishn Patro <[email protected]>

hpatro requested review from madolson and stockholmux as code owners October 13, 2025 21:25

hpatro self-assigned this Oct 13, 2025

review pass

e1d33c6

Signed-off-by: Kyle J. Davis <[email protected]>

stockholmux requested changes Oct 14, 2025

View reviewed changes

madolson reviewed Oct 14, 2025

View reviewed changes

content/blog/2025-10-20-1-billion-rps/index.md Outdated Show resolved Hide resolved

madolson reviewed Oct 14, 2025

View reviewed changes

hpatro and others added 5 commits October 14, 2025 14:48

Merge pull request #1 from stockholmux/blog_1b_rps

5e05b37

review pass on 1 billion RPS blog

Address feedback

20eafa1

Signed-off-by: Harkrishn Patro <[email protected]>

Address feedback

07641bc

Signed-off-by: Harkrishn Patro <[email protected]>

Address feedback

25e71de

Signed-off-by: Harkrishn Patro <[email protected]>

Address feedback

76e3ae4

Signed-off-by: Harkrishn Patro <[email protected]>

hpatro requested a review from stockholmux October 14, 2025 22:19

makubo-aws reviewed Oct 14, 2025

View reviewed changes

hpatro added 3 commits October 14, 2025 16:04

Address feedback

2fefb60

Signed-off-by: Harkrishn Patro <[email protected]>

Address feedback

fda0018

Signed-off-by: Harkrishn Patro <[email protected]>

Add a high level intro to the blog

f454552

Signed-off-by: Harkrishn Patro <[email protected]>

hpatro requested a review from makubo-aws October 15, 2025 17:53

stockholmux requested changes Oct 15, 2025

View reviewed changes

content/blog/2025-10-20-1-billion-rps/index.md Outdated Show resolved Hide resolved

content/blog/2025-10-20-1-billion-rps/index.md Outdated Show resolved Hide resolved

Address feedback

c3b6246

Signed-off-by: Harkrishn Patro <[email protected]>

hpatro requested a review from stockholmux October 15, 2025 21:09

stockholmux approved these changes Oct 16, 2025

View reviewed changes

stockholmux merged commit 063c3be into valkey-io:main Oct 20, 2025
3 checks passed


		## Closing thoughts

		With all these improvements made in Valkey, a cluster can now scale to 1B RPS / 2000 nodes which is quite a remarkable feat to achieve. However, there is plenty of room to improve further. The steady state CPU utilization overhead from the cluster bus message transfer/processing can be reduced further by incorporating [SWIM protocol](https://en.wikipedia.org/wiki/SWIM_Protocol) or move it off the main thread into an independent separate thread. The failover logic can be made smarter as well by incorporating the AZ placement of nodes. We would also like to introduce more observability metrics/logs into the system for better manageability of it. All of these are being linked under [Support Large Cluster](https://github.com/valkey-io/valkey/issues/2281) issue. Feel free to check it out and add in your suggestions.

Blog post: Scaling Valkey Cluster to 1 Billion RPS #392

Blog post: Scaling Valkey Cluster to 1 Billion RPS #392

Uh oh!

Conversation

hpatro commented Oct 13, 2025

Description

Issues Resolved

Check List

Uh oh!

stockholmux left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

stockholmux Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

madolson Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

hpatro Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

stockholmux Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

hpatro Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

madolson left a comment

Choose a reason for hiding this comment

Uh oh!

hpatro commented Oct 14, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hpatro commented Oct 15, 2025

Uh oh!

stockholmux left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

hpatro commented Oct 15, 2025

Uh oh!

stockholmux left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants