-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Feature]: Clustering optimization #28410
Labels
kind/feature
Issues related to feature request from users
Comments
/assign @wayblink |
1 task
This was referenced Dec 18, 2023
This was referenced Dec 29, 2023
sre-ci-robot
pushed a commit
that referenced
this issue
Jan 7, 2024
#28410 /kind feature Signed-off-by: wayblink <[email protected]>
sre-ci-robot
pushed a commit
that referenced
this issue
Feb 18, 2024
#28410 Signed-off-by: wayblink <[email protected]>
/assign |
sre-ci-robot
pushed a commit
that referenced
this issue
Dec 8, 2024
…imits of the DataNode (#38210) issue: #28410 master pr: #38209 --------- Signed-off-by: Cai Zhang <[email protected]>
sre-ci-robot
pushed a commit
that referenced
this issue
Dec 8, 2024
… of the DataNode (#38209) issue: #28410 --------- Signed-off-by: Cai Zhang <[email protected]>
This was referenced Dec 12, 2024
sre-ci-robot
pushed a commit
that referenced
this issue
Dec 13, 2024
issue: #28410 master pr: #38417 Signed-off-by: Cai Zhang <[email protected]>
sre-ci-robot
pushed a commit
that referenced
this issue
Dec 13, 2024
issue: #28410 Signed-off-by: Cai Zhang <[email protected]>
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Is there an existing issue for this?
Is your feature request related to a problem? Please describe.
Umbrella issue for clustering key optimization for milvus.
In the realm of database management, maximizing the efficiency of data storage and retrieval is of utmost importance. A clustering key stands out as a crucial element in database design, guiding the physical storage arrangement based on the distribution of data within a table. In conventional database systems, the usual data distribution revolves around the minimum and maximum values of scalar fields. However, in the case of a vector database, vectors take precedence as our primary entities. Consequently, in Milvus, we're committed to supporting both scalar clustering keys and vector clustering keys.
Key change:
1, Support designating a scalar or vector field as the clustering key for a collection.
2, Enabling bulk insert data with specific clustering information. Milvus will organize the data based on the provided clustering information.
3, Filtering out irrelevant data during searches based on clustering information.
4, Implementing a feature in Milvus to compact collections with a clustering key, leading to a rearrangement of storage.
Phase 1: Support bulk insert and query data with clustering info
Tasks:
Phase 2: Clustering based compaction
Dependency:
Tasks:
Describe the solution you'd like.
No response
Describe an alternate solution.
No response
Anything else? (Additional Context)
No response
The text was updated successfully, but these errors were encountered: