Skip to content

sharding and replication_factor for a collections #4340

Closed Answered by timvisee
mm3906078 asked this question in Q&A
Discussion options

You must be logged in to vote

Each single point is in all replicas of a single shard.

If you have a replication factor of 5, it means you have 5 copies of all your data across your nodes. 5 replications also mean that 4 nodes may fail without taking the whole cluster down.

Above you mention you have 5 nodes. In such case I'd recommend one of the two following configurations to minimize the amount of data on your cluster:

  • shard_number: 10, replication_factor: 2 - allows at most 1 node to fail, has 2 copies of all points across the cluster
  • shard_number: 10, replication_factor: 3 - allows at most 2 nodes to fail, has 3 copies of all points across the cluster

Replies: 1 comment 4 replies

Comment options

You must be logged in to vote
4 replies
@timvisee
Comment options

Answer selected by mm3906078
@mm3906078
Comment options

@timvisee
Comment options

@mm3906078
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants