educative.io

Does the Distributed Cache Use Nested Clustering?

Hi all,

I noticed the following pattern:

  • One Data Center
  • This data Center has a Cluster. This Cluster consists of multiple nodes, each node is a Cache Server.
  • The cache server to achieve higher availability has replicas (in a primary secondary fashion, where primary has 2 backups read replicas).
  • Therefore a cache server consist of 3 nodes, the primary, and other 2 machines (the backups). So in other words, it is a “cluster” with 3 nodes (1 primary with 2 secondary).
  • That being said, we have the outer cluster (with N number of nodes/cache servers) and each node/cache server consist in one primary with 2 replicas (so 3 nodes).

Could you guys clarify if I’m seeing it right?


Course: Grokking Modern System Design Interview for Engineers & Managers - AI-Powered Learning for Developers
Lesson: Detailed Design of a Distributed Cache - Grokking Modern System Design Interview for Engineers & Managers

Hi Rodrigo,

You’re on the right track with your understanding of a distributed cache in a data center. Your data center’s cache architecture seems to consist of an outer cluster with multiple cache servers, and each cache server has its own inner cluster of 3 nodes (1 primary and 2 replicas) to ensure high availability and fault tolerance.

Let’s clarify the structure a bit more:

  • Outer Cluster: This is the larger cluster within the data center that consists of N number of nodes. Each node in this cluster is a cache server designed to handle caching operations.
  • Inner Cluster (Cache Server): Each cache server in the outer cluster is further composed of an inner cluster of 3 nodes for high availability.

Thank you.