AngersZhuuuu opened a new pull request #29087: URL: https://github.com/apache/spark/pull/29087
### What changes were proposed in this pull request? For Spark SQL, it can't support sql like : ``` SELECT TRANSFORM ( d2, max(d1) as maxd1, cast(sum(d3) as string)) USING 'cat' AS (a,b,c) FROM script_trans WHERE d1 <= 100 GROUP BY d2 HAVING maxd1 > 0 ``` But in hive, it can support this kind SQL. This makes SQL migration difficult and complex, without this we nee to change too many jobs, This PR is to support use Aggregation with TRANSFORM and make SQL migration from Hive to Spark easier. ### Why are the changes needed? Support transform with aggregate , make sql migration more smoothly ### Does this PR introduce _any_ user-facing change? User can write transform with aggregation. ### How was this patch tested? Added UT ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
