viirya commented on a change in pull request #33691: URL: https://github.com/apache/spark/pull/33691#discussion_r687062709
########## File path: docs/structured-streaming-programming-guide.md ########## @@ -1134,6 +1134,73 @@ sessionizedCounts = events \ </div> </div> +Instead of static value, we can also provide an expression to specify gap duration dynamically +based on the input row. Note that the rows with negative or zero gap duration will be filtered +out from the aggregation. + +<div class="codetabs"> +<div data-lang="scala" markdown="1"> + +{% highlight scala %} +import spark.implicits._ + +val events = ... // streaming DataFrame of schema { timestamp: Timestamp, userId: String } + +val sessionWindow = SessionWindow($"timestamp".expr, when($"userId" === "user1", "5 seconds") + .when($"userId" === "user2", "20 seconds") + .otherwise("5 minutes") Review comment: Oops! -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org