Viktor Somogyi-Vass created KAFKA-14281:
-------------------------------------------
Summary: Multi-level rack awareness
Key: KAFKA-14281
URL: https://issues.apache.org/jira/browse/KAFKA-14281
Project: Kafka
Issue Type: Improvement
Components: core
Affects Versions: 3.4.0
Reporter: Viktor Somogyi-Vass
Assignee: Viktor Somogyi-Vass
h1. Motivation
With replication services data can be replicated across independent Kafka
clusters in multiple data center. In addition, many customers need "stretch
clusters" - a single Kafka cluster that spans across multiple data centers.
This architecture has the following useful characteristics:
- Data is natively replicated into all data centers by Kafka topic replication.
- No data is lost when 1 DC is lost and no configuration change is required -
design is implicitly relying on native Kafka replication.
- From operational point of view, it is much easier to configure and operate
such a topology than a replication scenario via MM2.
Kafka should provide "native" support for stretch clusters, covering any
special aspects of operations of stretch cluster.
h2. Multi-level rack awareness
Additionally, stretch clusters are implemented using the rack awareness
feature, where each DC is represented as a rack. This ensures that replicas are
spread across DCs evenly. Unfortunately, there are cases where this is too
limiting - in case there are actual racks inside the DCs, we cannot specify
those. Consider having 3 DCs with 2 racks each:
/DC1/R1, /DC1/R2
/DC2/R1, /DC2/R2
/DC3/R1, /DC3/R2
If we were to use racks as DC1, DC2, DC3, we lose the rack-level information of
the setup. This means that it is possible that when we are using RF=6, that the
2 replicas assigned to DC1 will both end up in the same rack.
If we were to use racks as /DC1/R1, /DC1/R2, etc, then when using RF=3, it is
possible that 2 replicas end up in the same DC, e.g. /DC1/R1, /DC1/R2, /DC2/R1.
Because of this, Kafka should support "multi-level" racks, which means that
rack IDs should be able to describe some kind of a hierarchy. With this
feature, brokers should be able to:
# spread replicas evenly based on the top level of the hierarchy (i.e. first,
between DCs)
# then inside a top-level unit (DC), if there are multiple replicas, they
should be spread evenly among lower-level units (i.e. between racks, then
between physical hosts, and so on)
## repeat for all levels
--
This message was sent by Atlassian Jira
(v8.20.10#820010)