Re: [GSoC 2026] Kafka Streams Runner — design doc + proposal for feedback

Jan Lukavský Mon, 04 May 2026 03:12:45 -0700

Thank you for the design document, Junaid.

We would like to ask anyone interested to comment on the describeddesign so that we can possibly catch any rough edges beforeimplementation starts.


Thanks!
 Jan

On 5/4/26 10:26, Junaid wrote:

Hi Beam devs,
I’m *Junaid Shaukat*, a GSoC 2026 contributor for Apache Beam,mentored by *Jan Lukavský* on a Portable Kafka Streams runner (#18479<https://github.com/apache/beam/issues/18479>)
Per my mentor’s suggestion, I’m re-sharing the design document andproposal here to get broader community feedback and traction.
*Design doc:*https://docs.google.com/document/d/1BBMURhSG4SxPcvvnKMTrmnKCr_jhXL6R4TBDBW7zsy8/edit?usp=sharing*Proposal:*https://docs.google.com/document/d/1NbFrw_-krXNM_0t4XFaa6WLIM-xK7IdCdP0PFTXIRi8/edit?usp=sharing
*What we’re aiming for (v1 skeleton)*: a Beam portable runner on KafkaStreams using the Processor API—ingestion, fused executable stages viathe Fn API, internal repartitions for GBK/Combine, RocksDB-backedstate, and correctness-oriented execution aligned with the prototypedesign.
*Feedback I’d especially appreciate
*
*1. Watermark management *— per-partition progress and how topropagate a safe event-time frontier across stages (including afterrepartition), especially in the “vector clock / frontier” style wesketch in the doc.*2. Partition + task metadata* — how to treat changing partitionassignment / rescaling in Kafka Streams while keeping watermarksemantics sound.*3. Anything risky or missing* in translation boundaries,bundles/commits, or ValidatesRunner expectations we should addressbefore the first upstream PRs land.
I’d be grateful for any comments from “high-level direction” to “thisparagraph is wrong because…”. I’ll incorporate feedback into the docand post a short summary of decisions back on the GitHub issue.
Thanks you for your time.

Best,
Junaid Shaukat
https://github.com/junaiddshaukat

Re: [GSoC 2026] Kafka Streams Runner — design doc + proposal for feedback

Reply via email to