What is the current canonical way to join more than 2 watermarked streams (Spark 3.5.6)?

cheapsolutionarchit...@gmail.com Wed, 25 Jun 2025 00:46:50 -0700

Hi,

Given two Spark-Structured streams and using them ashttps://spark.apache.org/docs/3.5.6/structured-streaming-programming-guide.html#inner-joins-with-optional-watermarking,just works.

Now if I want to join three streams using the same technique, Sparkcomplains about multiple possible watermarks. I have a roughunderstanding of what happened, and concluded this works as designed.

But as I am certainly not the only one who tried that, what is thecanonical way of doing this? My first idea was like: I'm going to joinS1,S2 with their corresponding watermarks, then write that result todisk, possibly into a delta table, read the result with another streamand join this one with the remaining stream and third watermark.

Is there some other way? Or is this the current canonical way of joiningmore than two streams that carry a watermark?


Best Regards

M.



---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

What is the current canonical way to join more than 2 watermarked streams (Spark 3.5.6)?

Reply via email to