gaurav created SPARK-9592: ----------------------------- Summary: First and Last aggregates are calculating the values for entire DataFrame partition not on GroupedData partition. Key: SPARK-9592 URL: https://issues.apache.org/jira/browse/SPARK-9592 Project: Spark Issue Type: Bug Components: SQL Affects Versions: 1.4.0, 1.5.0 Reporter: gaurav Priority: Minor Fix For: 1.5.0
In current implementation, First and Last aggregates were calculating the values for entire DataFrame partition and then the same value was returned for all GroupedData in the partition. sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregates.scala Fixed the First and Last aggregates should compute first and last value per GroupedData instead of entire DataFrame. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org