Introducing Vector Semantic Search To Valkey #276

yairgott · 2025-06-13T21:18:09Z

Introducing Vector Semantic Search To Valkey

Signed-off-by: yairgott <[email protected]>

Signed-off-by: Yair Gottdenker <[email protected]>

Signed-off-by: yairgott <[email protected]>

content/blog/2025-06-13-introducing-valkey-search/index.md

Signed-off-by: Yair Gottdenker <[email protected]>

madolson

I don't have any major issues outside of the lack of benchmarking.

content/blog/2025-06-13-introducing-valkey-search/index.md

madolson · 2025-06-13T22:19:07Z

content/blog/2025-06-13-introducing-valkey-search/index.md

@@ -0,0 +1,173 @@
+The Valkey project is introducing vector similarity search capabilities through [valkey-search](https://github.com/valkey-io/valkey-search) (BSD-3-Clause licensed), an official Valkey module compatible with Valkey versions 8.1.1 and above. 
+
+With valkey-search you can search through billions of vectors with single-digit millisecond latencies and greater than 99% recall. Whether you're building semantic search, fraud detection systems, or conversational AI experiences, valkey-search offers a performant and flexible foundation.


Without a real benchmark to back this up, I don't feel great saying this basically as our tagline.

I second this. Especially given that performance is the key competitive advantage.

Summarizing a meeting discussion with Allen and Madelyn:
Ideally, the benchmarks would include comparative results against leading vector databases. However, due to publishing constraints from both GCP and AWS, a more practical path forward may be to have these numbers shared by an external blogger. As a result, this blog post will focus primarily on the functionality.

I would avoid being specific because the numbers aren't backed up by a benchmark, but you can definitely say something looser but accurate "you can search through billions of vectors with the kind of performance you expect out of Valkey"

Co-authored-by: Madelyn Olson <[email protected]> Signed-off-by: Yair Gottdenker <[email protected]>

Signed-off-by: Yair Gottdenker <[email protected]>

Signed-off-by: yairgott <[email protected]>

Signed-off-by: Yair Gottdenker <[email protected]>

content/blog/2025-06-13-introducing-valkey-search/index.md

madolson · 2025-06-14T00:35:29Z

content/blog/2025-06-13-introducing-valkey-search/index.md

@@ -0,0 +1,152 @@
+++
+title= "Introducing Vector Search To Valkey" 
+description = "Learn how to use valkey-search to search through billions of vectors with single-digit millisecond latencies and greater than 99% recall." 


Suggested change

description = "Learn how to use valkey-search to search through billions of vectors with single-digit millisecond latencies and greater than 99% recall."

description = "Learn how to use valkey-search to implement highly reliable vector similarity search."

Based on another thread, I think we should split the performance from the use case. We can have a followup that details the performance characteristics.

By removing this claim, or not stating something similar, I'm concerned that we leave almost no appetite to try it out...

I find the "high reliable" to be totally bland and a turn-off. It's worse than saying nothing.

Can we leverage some of the Valkey verbiage? Something like "Learn all about doing vector similarity search on Valkey, the worlds fastest xxxxx" ?? Here the claim is for Valkey and VSS lives in the reflected glory.

Valkey is not he worlds fastest datastore. We don't make that claim anywhere.

We sometimes just say "Built for high-performance applications".

content/blog/2025-06-13-introducing-valkey-search/index.md

Co-authored-by: Madelyn Olson <[email protected]> Signed-off-by: Yair Gottdenker <[email protected]>

Signed-off-by: Yair Gottdenker <[email protected]>

madolson · 2025-06-19T15:54:45Z

content/blog/2025-06-13-introducing-valkey-search/index.md

@@ -0,0 +1,152 @@
+++
+title= "Introducing Vector Search To Valkey" 
+description = "Learn how to use valkey-search to search through billions of vectors with single-digit millisecond latencies and greater than 99% recall." 


Valkey is not he worlds fastest datastore. We don't make that claim anywhere.

madolson · 2025-06-19T15:58:10Z

content/blog/2025-06-13-introducing-valkey-search/index.md

+
+The Valkey project is introducing vector similarity search capabilities through [valkey-search](https://github.com/valkey-io/valkey-search) (BSD-3-Clause licensed), an official Valkey module compatible with Valkey versions 8.1.1 and above. 
+
+With valkey-search you can search through billions of vectors with single-digit millisecond latencies and greater than 99% recall. Whether you're building semantic search, fraud detection systems, or conversational AI experiences, valkey-search offers a performant and flexible foundation.


Suggested change

With valkey-search you can search through billions of vectors with single-digit millisecond latencies and greater than 99% recall. Whether you're building semantic search, fraud detection systems, or conversational AI experiences, valkey-search offers a performant and flexible foundation.

With valkey-search you can easily create indexes to search through billions of vectors stored within your Valkey instances. Whether you're building semantic search, fraud detection systems, or conversational AI experiences, valkey-search offers a flexible foundation for your application.

stockholmux

I have a lot of nits on this, but it's overall a very comprehensive blog post.

The one place that needs a little work that I couldn't really resolve is how the headings are supposed to work. I would flatten the overall structure: you have all the way up to an H5 but the post seems to be written more flatly than this.

stockholmux · 2025-06-19T16:03:35Z

content/authors/allenss.md

+    github: allenss-amazon 
+---
+
+Allen Samuels is a Principal Engineer at AWS.


Ideally, this will have a couple of lines beyond title and affliation.

Let's add this:

He is passionate about distributed, performant systems. When not travelling the world for pleasure or playing duplicate bridge, Allen can be found in San Jose, California.

stockholmux · 2025-06-19T16:09:52Z

content/blog/2025-06-13-introducing-valkey-search/index.md

+featured_image = "/assets/media/featured/random-08.webp"
+++
+
+The Valkey project is introducing vector similarity search capabilities through [valkey-search](https://github.com/valkey-io/valkey-search) (BSD-3-Clause licensed), an official Valkey module compatible with Valkey versions 8.1.1 and above. 


Valkey project is introducing

Present tense is odd and I'm not sure that this is a great first sentence. I would drop it.

The next line way more impactful - take the technical aspects here (license, compatibility, link) and put them into the next line.

stockholmux · 2025-06-19T16:26:54Z

content/blog/2025-06-13-introducing-valkey-search/index.md

@@ -0,0 +1,173 @@
+The Valkey project is introducing vector similarity search capabilities through [valkey-search](https://github.com/valkey-io/valkey-search) (BSD-3-Clause licensed), an official Valkey module compatible with Valkey versions 8.1.1 and above. 
+
+With valkey-search you can search through billions of vectors with single-digit millisecond latencies and greater than 99% recall. Whether you're building semantic search, fraud detection systems, or conversational AI experiences, valkey-search offers a performant and flexible foundation.


I would avoid being specific because the numbers aren't backed up by a benchmark, but you can definitely say something looser but accurate "you can search through billions of vectors with the kind of performance you expect out of Valkey"

stockholmux · 2025-06-19T16:30:15Z

content/blog/2025-06-13-introducing-valkey-search/index.md

+
+## Semantic Search
+
+The ability of AI models to extract semantic meaning enables new classes of searching algorithms, collectively known as semantic search. An AI model can process an input and convert it into a single high-dimension numeric vector – known as an embedding. Inputs with similar meaning will have similar embeddings. Semantic search is the process of converting a query into its embedding and searching a database of embeddings to find the embeddings that are most alike. 


'searching algorithms' -> 'search algorithms'

stockholmux · 2025-06-19T16:33:06Z

content/blog/2025-06-13-introducing-valkey-search/index.md

+
+## Semantic Search
+
+The ability of AI models to extract semantic meaning enables new classes of searching algorithms, collectively known as semantic search. An AI model can process an input and convert it into a single high-dimension numeric vector – known as an embedding. Inputs with similar meaning will have similar embeddings. Semantic search is the process of converting a query into its embedding and searching a database of embeddings to find the embeddings that are most alike. 


The term 'embedding' is used a ton here and it reduces the clarity of the the sentences. We should try to minimize 'embedding' where possible to make it less repetitious.

Example:

searching a database of embeddings to find the embeddings that are most alike

to

searching a database of embeddings to find one that are most alike

stockholmux · 2025-06-19T21:08:43Z

content/blog/2025-06-13-introducing-valkey-search/index.md

+
+## Performance & Low Latency
+
+Valkey-search is designed as an in-memory secondary index, achieving exceptional performance. A multi-threaded architecture optimizes query and mutation processing with minimal thread contention, enabling near-linear vertical scalability.


As mentioned earlier, I think I would avoid directly describing performance since we haven't proven this in any way.

If you alter it slightly, the reader will fill in the blanks as far as performance without needing a proof point.

'Valkey-search was designed from the group up as an in-memory secondary index.'

stockholmux · 2025-06-19T21:11:26Z

content/blog/2025-06-13-introducing-valkey-search/index.md

+
+Valkey-search is designed as an in-memory secondary index, achieving exceptional performance. A multi-threaded architecture optimizes query and mutation processing with minimal thread contention, enabling near-linear vertical scalability.
+
+At its core, valkey-search’s threading architecture follows a common design pattern: a worker thread pool combined with task queues. It employs advanced synchronization mechanisms to maintain index consistency while minimizing contention among worker threads. By time-slicing CPU access between read and write operations, the system enables an almost lock-free read path, delivering high performance and consistently low search latency. 


My skeptical brain reads: 'the system enables an almost lock-free read path' as 'the read path isn't lock free'.

I might change it to 'the system minimizes locks on the read path'.

It's harder to twist this around.

stockholmux · 2025-06-19T21:12:55Z

content/blog/2025-06-13-introducing-valkey-search/index.md

+
+At its core, valkey-search’s threading architecture follows a common design pattern: a worker thread pool combined with task queues. It employs advanced synchronization mechanisms to maintain index consistency while minimizing contention among worker threads. By time-slicing CPU access between read and write operations, the system enables an almost lock-free read path, delivering high performance and consistently low search latency. 
+
+Valley-search’s HNSW implementation is based on the OSS project [HNSWLib](https://github.com/nmslib/hnswlib). While HNSWLib is well-regarded for its speed, we have enhanced its performance and efficiency for our use case. These improvements include better `SIMD` utilization, promotion of CPU cache efficiency, memory utilization and more.


Valley-search -> Valkey-search :)

I also wouldn't code block SIMD (it's just an initialism)

stockholmux · 2025-06-19T21:14:58Z

content/blog/2025-06-13-introducing-valkey-search/index.md

+
+We welcome contributions of all kinds - code, documentation, testing, and feedback. Join the community, file issues, open pull requests, or suggest improvements. Your involvement helps make valkey-search better for everyone.
+
+Ready to dive in? Clone the repo, fire up the [dev container](https://hub.docker.com/r/valkey/valkey-extensions), and start building high-performance vector search with valkey-search.


it's valkey-bundle now

stockholmux · 2025-06-19T21:15:46Z

content/blog/2025-06-13-introducing-valkey-search/index.md

+
+Clients must send data mutation (write) commands to the primary node which are executed and then automatically asynchronously transmitted to each replica. Clients can send data read operations to any node in the cluster, recognizing that reading from a replica delivers a result reflecting a historical point in time.
+
+When Valkey-search is used, each node, whether a primary or a replica, builds and maintains its own indexes. No additional traffic on the replication channel is generated for index maintenance. Search query operations sent to a replica will be executed against its indexes, reflecting the historical point in time of the data within that node.


lower case Valkey-search

initial version

f83e2e4

Signed-off-by: yairgott <[email protected]>

yairgott requested review from madolson and stockholmux as code owners June 13, 2025 21:18

yairgott marked this pull request as draft June 13, 2025 21:18

yairgott added 2 commits June 13, 2025 15:02

Update index.md

ad4b531

Signed-off-by: Yair Gottdenker <[email protected]>

adding author images

bb4d4e4

Signed-off-by: yairgott <[email protected]>

madolson reviewed Jun 13, 2025

View reviewed changes

content/blog/2025-06-13-introducing-valkey-search/index.md Outdated Show resolved Hide resolved

Update index.md

771797c

Signed-off-by: Yair Gottdenker <[email protected]>

madolson reviewed Jun 13, 2025

View reviewed changes

yairgott marked this pull request as ready for review June 13, 2025 22:22

yairgott and others added 8 commits June 13, 2025 15:23

Update content/blog/2025-06-13-introducing-valkey-search/index.md

3032f51

Co-authored-by: Madelyn Olson <[email protected]> Signed-off-by: Yair Gottdenker <[email protected]>

Linking to the dev container

a8a377b

Signed-off-by: Yair Gottdenker <[email protected]>

removing authors section

bb883ac

Signed-off-by: Yair Gottdenker <[email protected]>

Create yairgott.md

a45e594

Signed-off-by: Yair Gottdenker <[email protected]>

Create allenss.md

27beddf

Signed-off-by: Yair Gottdenker <[email protected]>

moving author images

c395890

Signed-off-by: yairgott <[email protected]>

Linking to HNSW wiki

8fcef93

Signed-off-by: Yair Gottdenker <[email protected]>

fixing authors

96b676f

Signed-off-by: Yair Gottdenker <[email protected]>

madolson reviewed Jun 13, 2025

View reviewed changes

content/blog/2025-06-13-introducing-valkey-search/index.md Outdated Show resolved Hide resolved

madolson reviewed Jun 14, 2025

View reviewed changes

yairgott and others added 7 commits June 16, 2025 17:49

Update content/blog/2025-06-13-introducing-valkey-search/index.md

82a38bc

Co-authored-by: Madelyn Olson <[email protected]> Signed-off-by: Yair Gottdenker <[email protected]>

Update content/blog/2025-06-13-introducing-valkey-search/index.md

f71b5c7

Co-authored-by: Madelyn Olson <[email protected]> Signed-off-by: Yair Gottdenker <[email protected]>

Update content/blog/2025-06-13-introducing-valkey-search/index.md

2851e22

Co-authored-by: Madelyn Olson <[email protected]> Signed-off-by: Yair Gottdenker <[email protected]>

Update content/blog/2025-06-13-introducing-valkey-search/index.md

a78abc7

Co-authored-by: Madelyn Olson <[email protected]> Signed-off-by: Yair Gottdenker <[email protected]>

Update content/blog/2025-06-13-introducing-valkey-search/index.md

060d348

Co-authored-by: Madelyn Olson <[email protected]> Signed-off-by: Yair Gottdenker <[email protected]>

Update content/blog/2025-06-13-introducing-valkey-search/index.md

dddfa79

Co-authored-by: Madelyn Olson <[email protected]> Signed-off-by: Yair Gottdenker <[email protected]>

addressing commets

a2fe745

Signed-off-by: Yair Gottdenker <[email protected]>

madolson reviewed Jun 19, 2025

View reviewed changes

stockholmux requested changes Jun 19, 2025

View reviewed changes

		@@ -0,0 +1,173 @@
		The Valkey project is introducing vector similarity search capabilities through [valkey-search](https://github.com/valkey-io/valkey-search) (BSD-3-Clause licensed), an official Valkey module compatible with Valkey versions 8.1.1 and above.

		With valkey-search you can search through billions of vectors with single-digit millisecond latencies and greater than 99% recall. Whether you're building semantic search, fraud detection systems, or conversational AI experiences, valkey-search offers a performant and flexible foundation.

	description = "Learn how to use valkey-search to search through billions of vectors with single-digit millisecond latencies and greater than 99% recall."
	description = "Learn how to use valkey-search to implement highly reliable vector similarity search."

	With valkey-search you can search through billions of vectors with single-digit millisecond latencies and greater than 99% recall. Whether you're building semantic search, fraud detection systems, or conversational AI experiences, valkey-search offers a performant and flexible foundation.
	With valkey-search you can easily create indexes to search through billions of vectors stored within your Valkey instances. Whether you're building semantic search, fraud detection systems, or conversational AI experiences, valkey-search offers a flexible foundation for your application.


		## Semantic Search

		The ability of AI models to extract semantic meaning enables new classes of searching algorithms, collectively known as semantic search. An AI model can process an input and convert it into a single high-dimension numeric vector – known as an embedding. Inputs with similar meaning will have similar embeddings. Semantic search is the process of converting a query into its embedding and searching a database of embeddings to find the embeddings that are most alike.


		## Performance & Low Latency

		Valkey-search is designed as an in-memory secondary index, achieving exceptional performance. A multi-threaded architecture optimizes query and mutation processing with minimal thread contention, enabling near-linear vertical scalability.


		Valkey-search is designed as an in-memory secondary index, achieving exceptional performance. A multi-threaded architecture optimizes query and mutation processing with minimal thread contention, enabling near-linear vertical scalability.

		At its core, valkey-search’s threading architecture follows a common design pattern: a worker thread pool combined with task queues. It employs advanced synchronization mechanisms to maintain index consistency while minimizing contention among worker threads. By time-slicing CPU access between read and write operations, the system enables an almost lock-free read path, delivering high performance and consistently low search latency.


		At its core, valkey-search’s threading architecture follows a common design pattern: a worker thread pool combined with task queues. It employs advanced synchronization mechanisms to maintain index consistency while minimizing contention among worker threads. By time-slicing CPU access between read and write operations, the system enables an almost lock-free read path, delivering high performance and consistently low search latency.

		Valley-search’s HNSW implementation is based on the OSS project [HNSWLib](https://github.com/nmslib/hnswlib). While HNSWLib is well-regarded for its speed, we have enhanced its performance and efficiency for our use case. These improvements include better `SIMD` utilization, promotion of CPU cache efficiency, memory utilization and more.


		We welcome contributions of all kinds - code, documentation, testing, and feedback. Join the community, file issues, open pull requests, or suggest improvements. Your involvement helps make valkey-search better for everyone.

		Ready to dive in? Clone the repo, fire up the [dev container](https://hub.docker.com/r/valkey/valkey-extensions), and start building high-performance vector search with valkey-search.


		Clients must send data mutation (write) commands to the primary node which are executed and then automatically asynchronously transmitted to each replica. Clients can send data read operations to any node in the cluster, recognizing that reading from a replica delivers a result reflecting a historical point in time.

		When Valkey-search is used, each node, whether a primary or a replica, builds and maintains its own indexes. No additional traffic on the replication channel is generated for index maintenance. Search query operations sent to a replica will be executed against its indexes, reflecting the historical point in time of the data within that node.

Introducing Vector Semantic Search To Valkey #276

Are you sure you want to change the base?

Introducing Vector Semantic Search To Valkey #276

Conversation

yairgott commented Jun 13, 2025

Uh oh!

Uh oh!

madolson left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

yairgott Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

stockholmux left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

yairgott Jun 17, 2025 •

edited

Loading