[ https://issues.apache.org/jira/browse/SPARK-22230?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Shixiong Zhu updated SPARK-22230: --------------------------------- Fix Version/s: 2.3.0 > agg(last('attr)) gives weird results for streaming > -------------------------------------------------- > > Key: SPARK-22230 > URL: https://issues.apache.org/jira/browse/SPARK-22230 > Project: Spark > Issue Type: Bug > Components: Structured Streaming > Affects Versions: 2.2.0 > Reporter: Jose Torres > Priority: Minor > Fix For: 2.3.0 > > > In stream aggregation, last('attr) yields the last value from the first > microbatch forever. I'm not sure if it's fair to call this a correctness bug, > since last doesn't have strong correctness semantics, but ignoring all rows > past the first microbatch is at least weird. > Simple repro in StreamingAggregationSuite: > val input = MemoryStream[Int] > val aggregated = input.toDF().agg(last('value)) > testStream(aggregated, OutputMode.Complete())( > AddData(input, 1, 2, 3), > CheckLastBatch(3), > AddData(input, 4, 5, 6), > CheckLastBatch(6) // actually yields 3 again -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org