Hi, Martin, Thanks for the reply.
Yes, I agree that the latency for KIP-1150 may not be as good as the traditional Kafka and is potentially a con for some users. Regarding the availability for KIP-1176, the following is what I posted in the discussion thread for that KIP. "In the proposed design, we propose using 1 leader in AZ1, 1 follower in AZ2 and 1 S3E1Z bucket in AZ1." This means that if S3E1Z in AZ1 is not available, we lose the availability of that partition, right? Thanks, Jun On Wed, Aug 13, 2025 at 2:13 AM Martin Andersson <martin.anders...@kambi.com> wrote: > Hello Jun. > > It seems to me that some of the drawbacks of KIP-1150 are not included > properly as Cons in your summary. Having a produce latency of ~1-2 sec for > P99 severly limits the useability of diskless topics. I'm quite sure the > company I work at would not be able to use this solution due to the effects > it has on latency. > > Also for KIP-1176 you state that the current availability story is weak. I > don't follow, can you please elaborate? > > Regards, > ________________________________ > From: Jun Rao <j...@confluent.io.INVALID> > Sent: Wednesday, August 6, 2025 19:05 > To: dev@kafka.apache.org <dev@kafka.apache.org> > Subject: Re: [DISCUSS] The Path Forward for Saving Cross-AZ Costs KIPs > > EXTERNAL SENDER. Do not click links or open attachments unless you > recognize the sender and know the content is safe. DO NOT provide your > username or password. > > > Hi, Luke, > > Thanks for starting the discussion. I took a look at all three proposals > and the following is my assessment. > > KIP-1150 (diskless): > Pros: > * has the most benefits to the users. > -- most complete saving of cross zone network cost (enabled by leader less > design) > -- better durability (by leveraging block storage) > -- best scalability (by separating data from the metadata) > * clean architecture (no unnatural intrusive changes to existing code base) > Cons: large effort, but arguable this is what's needed to build a true > cloud native architecture > > KIP-1176 (tier active segment) > Pros: > * limited benefits to the users > --saving of cross zone network cost (limited saving on the producer side) > * small effort > Cons: > * the current availability story is weak > * it's not clear if the effort is still small once details on correctness, > cost, cleanness are figured out > > KIP-1183 (share storage) > Pros: > * moderate benefits to the users > -- saving of cross zone network cost (limited saving on the producer side > and the consumer side) > -- better durability (by leveraging block storage) > -- improved scalability > Cons: > * weaker availability (no hot standby) > * scalability not as good as KIP-1150 > * effort to build the plugin is too large > > Thanks, > > Jun > > On Wed, Aug 6, 2025 at 12:58 AM Luke Chen <show...@gmail.com> wrote: > > > Hi Josep, > > > > Thanks for the update. > > > > > Luke, thank you for being proactive and caring about this topic! > > I believe many community users are also caring about this topic! :) > > > > Look forward to seeing the updated KIP! > > > > > > Hi Stanislav, > > > > Yes, it'd be good for the community to decide which way we want to go, > > Leaderless or leader-based is absolutely one of the decisions. > > And yes, more than one KIP is also good to me. It's just that we need a > way > > to move them forward. > > Otherwise, suppose one of the KIPs is ready for voting, we can anticipate > > requests to wait for the other two related KIPs. > > Any good suggestions? > > > > Hi Xinyu, > > > > Thanks for the reply. > > Look forward to seeing the updated KIP! > > > > > If the community plans to adopt a leaderless architecture, will the > focus > > be on a complete transition to leaderless, or will both architectures > > coexist in the long term? > > > > I don't think we will abandon the leader-based design as a lot of users > are > > still relying on it. > > Besides, KIP-1150 also claims the existing leader-based protocol works as > > usual. > > So, I think they should coexist in the long term. > > > > > > Thank you. > > Luke > > > > > > On Wed, Aug 6, 2025 at 10:13 AM Xinyu Zhou <yu...@apache.org> wrote: > > > > > Hi Luke, > > > > > > Thank you for creating this dedicated thread; we definitely need a > space > > to > > > discuss future steps for these topics. I apologize for my delay on > > KIP-1183 > > > and will provide more details in the coming weeks. > > > > > > I agree with Stanislav that we should first focus on the community's > > > direction. Specifically, should we consider introducing a leaderless > > > architecture to Kafka, given that it currently relies on a partitioned, > > > leader-based model? > > > > > > From my own perspective, I’m particularly interested in how Leaderless > > and > > > Leader-based architectures differ when it comes to handling data > > > locality—which directly affects batching and fetch efficiency—and in > the > > > way core features are implemented. For instance, ordering, compaction, > > > transactions, idempotent producers, and queues all have to be realized > on > > > the Coordinator in a Leaderless design, whereas in a Leader-based > design > > > they are handled by the Leader Partition. > > > > > > If the community plans to adopt a leaderless architecture, will the > focus > > > be on a complete transition to leaderless, or will both architectures > > > coexist in the long term? > > > > > > I welcome discussions on this topic and am eager to hear diverse > > opinions. > > > > > > Regards, > > > Xinyu > > > > > > On Wed, Aug 6, 2025 at 3:05 AM Stanislav Kozlovski < > > > stanislavkozlov...@apache.org> wrote: > > > > > > > Thank you Luke for this wonderful summary and taking initiative. > > > > > > > > To me, it seems like a large differentiator from KIP-1150 and others > is > > > > the leaderless design. The other two don’t allow for it. > > > > > > > > It sounds productive to focus the discussion on whether the > leaderless > > > > design is worth it on top of the replication cost savings. > > > > > > > > I’m of the opinion that it’s worth pursuing - both for the truly zero > > > > network cost (no producer cross az) but perhaps even more importantly > > the > > > > zero state architecture that promises to significantly simplify > > > operations, > > > > including auto scaling brokers and scaling throughput per partition > > > > > > > > It would be great if the folks at Aiven could address the concerns > > > > regarding queue and transactions support. I’m not of the opinion that > > > these > > > > things need to ship with v1, but it would be wise to ensure nothing > in > > > the > > > > architecture blocks these features from being shipped in the future > > > > > > > > KIP-1176 is also very cool, addressing the acks=1 case will still be > > > > necessary. I think it’s a necessary feature to implement, but I’d be > > > > disappointed if that’s the only diskless solution the community > agrees > > > on. > > > > > > > > A good path, if possible, may be to merge KIP-1150 and KIP-1176. > > > > > > > > If instead the community decides leaderless isn’t necessary, then > > > KIP-1183 > > > > seems fit. > > > > > > > > That’s my opinion. Happy to hear if anyone disagrees. > > > > > > > > On 2025/08/05 14:30:45 Josep Prat wrote: > > > > > Hi Luke and community! > > > > > > > > > > Luke, thank you for being proactive and caring about this topic! > > > > > > > > > > In the meantime we have been keeping ourselves busy pushing our > > > > > implementation of KIP-1150 to production to validate our > assumptions > > > and > > > > > confirm its strengths while discovering its weaknesses. > > > > > Now, after gathering some experience running it, we are (as I'm > > writing > > > > > this, gathered in the same room) working on an improved proposal > for > > > > > KIP-1150 that also addresses the concerns from the community. > > > > > We expect to share the updated KIP in the next couple of weeks. > > > > > > > > > > We apologize for the recent period of silence and are committed to > > more > > > > > regular communication as we move forward. > > > > > > > > > > Best, > > > > > > > > > > > > > > > On Tue, Aug 5, 2025 at 10:31 AM Luke Chen <show...@gmail.com> > wrote: > > > > > > > > > > > Hi all, > > > > > > > > > > > > The Kafka community is currently seeing an unprecedented > situation > > > with > > > > > > three KIPs (KIP-1150, IP-1176, KIP-1183) simultaneously > addressing > > > the > > > > same > > > > > > challenge of high replication costs when running Kafka across > > > multiple > > > > > > cloud availability zones. Each KIP offers a different solution to > > > this > > > > > > issue. While diversity of innovative ideas is a key strength of > > > > open-source > > > > > > projects, it creates a burden for reviewers and users who must > > > compare > > > > and > > > > > > comment on multiple proposals simultaneously. Furthermore, > > discussion > > > > > > around the three KIPs has stalled for over two months now. This > > could > > > > be > > > > > > due to the authors being hesitant to proceed due to the existence > > of > > > > > > alternative, potentially conflicting, solutions. Addressing > > > replication > > > > > > cost is a key concern of Kafka’s userbase and we should try to > move > > > the > > > > > > conversation forward if we can. > > > > > > > > > > > > From what I understand, these three KIPs are not mutually > > exclusive. > > > > But > > > > > > adopting all three KIPs in the community might not be what we > > expect. > > > > Thus, > > > > > > I would like to *start a discussion on how we could move the > > > > conversation > > > > > > forward*. > > > > > > > > > > > > To save time for the KIP readers/reviewers, I have created this > > > > document > > > > > > < > > > > > > > > > > > > > > > > https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcwiki.apache.org%2Fconfluence%2Fdisplay%2FKAFKA%2FThe%2BPath%2BForward%2Bfor%2BSaving%2BCross-AZ%2BReplication%2BCosts%2BKIPs&data=05%7C02%7Cmartin.andersson%40kambi.com%7C6f8f10951991466f010e08ddd50ba60a%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638900968226724065%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=XlcXhHw7k%2FNatVfCW%2Fe9geGTolCUGSpQspCGTiPpyEc%3D&reserved=0 > < > https://cwiki.apache.org/confluence/display/KAFKA/The+Path+Forward+for+Saving+Cross-AZ+Replication+Costs+KIPs > > > > > > > > >[1] > > > > > > to help summarize each of the KIPs and describe their current > > status. > > > > *Hope > > > > > > to get some suggestions/feedback from the community*. > > > > > > > > > > > > > > > > > > [1] > > > > > > > > > > > > > > > > > > > > > > https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcwiki.apache.org%2Fconfluence%2Fdisplay%2FKAFKA%2FThe%2BPath%2BForward%2Bfor%2BSaving%2BCross-AZ%2BReplication%2BCosts%2BKIPs&data=05%7C02%7Cmartin.andersson%40kambi.com%7C6f8f10951991466f010e08ddd50ba60a%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638900968226744629%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=4uiW8wx%2FXLuQbpkOoUw1U15XzxoEp7AvUuBM%2Fi9m%2Fsg%3D&reserved=0 > < > https://cwiki.apache.org/confluence/display/KAFKA/The+Path+Forward+for+Saving+Cross-AZ+Replication+Costs+KIPs > > > > > > > > > > > > > > KIP-1150: > > > > > > > > > > > > > > > > > > > > > > https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcwiki.apache.org%2Fconfluence%2Fdisplay%2FKAFKA%2FKIP-1150%253A%2BDiskless%2BTopics&data=05%7C02%7Cmartin.andersson%40kambi.com%7C6f8f10951991466f010e08ddd50ba60a%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638900968226758530%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=NzLNnkDFtqpYKNqK9lJHOhZQjdEipwpb%2Fl4nbP3Z9yE%3D&reserved=0 > < > https://cwiki.apache.org/confluence/display/KAFKA/KIP-1150%3A+Diskless+Topics > > > > > > > > KIP-1176 > > > > > > < > > > > > > > > > > https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcwiki.apache.org%2Fconfluence%2Fdisplay%2FKAFKA%2FKIP-1150%253A%2BDiskless%2BTopicsKIP-1176&data=05%7C02%7Cmartin.andersson%40kambi.com%7C6f8f10951991466f010e08ddd50ba60a%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638900968226772010%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=8idg5M0Qlbsb5IPvS7T1YSWbstZAhoPyTZr%2FvtzikLA%3D&reserved=0 > < > https://cwiki.apache.org/confluence/display/KAFKA/KIP-1150%3A+Diskless+TopicsKIP-1176 > > > > > > > > > > > > > : > > > > > > > > > > > > > > > > > > > > > > https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcwiki.apache.org%2Fconfluence%2Fdisplay%2FKAFKA%2FKIP-1176%253A%2BTiered%2BStorage%2Bfor%2BActive%2BLog%2BSegment&data=05%7C02%7Cmartin.andersson%40kambi.com%7C6f8f10951991466f010e08ddd50ba60a%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638900968226786006%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=g%2BNmHjUy7Y%2BMlwYsoxxXSJnVV%2B5sd6GNAro1Piib70s%3D&reserved=0 > < > https://cwiki.apache.org/confluence/display/KAFKA/KIP-1176%3A+Tiered+Storage+for+Active+Log+Segment > > > > > > > > KIP-1183 > > > > > > < > > > > > > > > > > https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcwiki.apache.org%2Fconfluence%2Fdisplay%2FKAFKA%2FKIP-1176%253A%2BTiered%2BStorage%2Bfor%2BActive%2BLog%2BSegmentKIP-1183&data=05%7C02%7Cmartin.andersson%40kambi.com%7C6f8f10951991466f010e08ddd50ba60a%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638900968226799957%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=0%2FF1J89TUYaq7fF5Smzajkuzi8lB6e5v8X0O%2Bs88j1Q%3D&reserved=0 > < > https://cwiki.apache.org/confluence/display/KAFKA/KIP-1176%3A+Tiered+Storage+for+Active+Log+SegmentKIP-1183 > > > > > > > > > > > > > : > > > > > > > > > > > > > > > > > > > > > > https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcwiki.apache.org%2Fconfluence%2Fdisplay%2FKAFKA%2FKIP-1183%253A%2BUnified%2BShared%2BStorage&data=05%7C02%7Cmartin.andersson%40kambi.com%7C6f8f10951991466f010e08ddd50ba60a%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638900968226813508%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=tr7CDNnP9jiRZtzVTSSuorvoE5zL5Z9wOWHSzIJjomY%3D&reserved=0 > < > https://cwiki.apache.org/confluence/display/KAFKA/KIP-1183%3A+Unified+Shared+Storage > > > > > > > > > > > > > > > > > > > > Thank you. > > > > > > Luke > > > > > > > > > > > > > > > > > > > > > -- > > > > > [image: Aiven] < > https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.aiven.io%2F&data=05%7C02%7Cmartin.andersson%40kambi.com%7C6f8f10951991466f010e08ddd50ba60a%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638900968226826866%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=ZRmCXxxQIuez3qPavKDgVtYBTDGuIXIshSncmR2wDz0%3D&reserved=0 > <https://www.aiven.io/>> > > > > > > > > > > *Josep Prat* > > > > > Sr. Engineering Director, Streaming Services, *Aiven* > > > > > josep.p...@aiven.io | +491715557497 > > > > > aiven.io < > https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.aiven.io%2F&data=05%7C02%7Cmartin.andersson%40kambi.com%7C6f8f10951991466f010e08ddd50ba60a%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638900968226841631%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=9SypeGgNwa7ZkMoNgaBKWbhAtkyqVS%2BM5FGOHdcJjpE%3D&reserved=0 > <https://www.aiven.io/>> | < > > > > > https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.facebook.com%2Faivencloud&data=05%7C02%7Cmartin.andersson%40kambi.com%7C6f8f10951991466f010e08ddd50ba60a%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638900968226855142%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=CPZ36zACfoHpZHFBYc6QjpaJIClA6vYTxBCxAiTowdM%3D&reserved=0 > ><https://www.facebook.com/aivencloud> > > > > > < > https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fwww.linkedin.com%2Fcompany%2Faiven%2F&data=05%7C02%7Cmartin.andersson%40kambi.com%7C6f8f10951991466f010e08ddd50ba60a%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638900968226868705%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=e9gnyVdYzNfpNEmVrVvXhZimF0t9oKs%2BQTBJeIrq7Dg%3D&reserved=0 > <https://www.linkedin.com/company/aiven/>> < > > > > > https://gbr01.safelinks.protection.outlook.com/?url=https%3A%2F%2Ftwitter.com%2Faiven_io&data=05%7C02%7Cmartin.andersson%40kambi.com%7C6f8f10951991466f010e08ddd50ba60a%7Ce3ec1ec4b9944e9e82e080234621871f%7C0%7C0%7C638900968226881987%7CUnknown%7CTWFpbGZsb3d8eyJFbXB0eU1hcGkiOnRydWUsIlYiOiIwLjAuMDAwMCIsIlAiOiJXaW4zMiIsIkFOIjoiTWFpbCIsIldUIjoyfQ%3D%3D%7C0%7C%7C%7C&sdata=5BmuFb5ElLaHKHejmEf606q9yKwRgLticidEcHOetHQ%3D&reserved=0 > ><https://twitter.com/aiven_io> > > > > > *Aiven Deutschland GmbH* > > > > > Alexanderufer 3-7, 10117 Berlin > > > > > > > > > > Geschäftsführer: Oskari Saarenmaa, Hannu Valtonen, > > > > > > > > > > Kenneth Chen > > > > > Amtsgericht Charlottenburg, HRB 209739 B > > > > > > > > > > > > > > > CONFIDENTIALITY NOTICE: This email message (and any attachment) is > intended only for the individual or entity to which it is addressed. The > information in this email is confidential and may contain information that > is legally privileged or exempt from disclosure under applicable law. If > you are not the intended recipient, you are strictly prohibited from > reading, using, publishing or disseminating such information and upon > receipt, must permanently delete the original and destroy any copies. We > take steps to protect against viruses and other defects but advise you to > carry out your own checks and precautions as Kambi does not accept any > liability for any which remain. Thank you for your co-operation. >