[GitHub] [spark] AngersZhuuuu opened a new pull request #29087: [SPARK-28227][SQL] Support TRANSFORM with aggregation

GitBox Mon, 13 Jul 2020 06:20:15 -0700


AngersZhuuuu opened a new pull request #29087:
URL: https://github.com/apache/spark/pull/29087



   ### What changes were proposed in this pull request?
   For Spark SQL, it can't support sql like :
   ```
   SELECT TRANSFORM ( d2, max(d1) as maxd1, cast(sum(d3) as string))
   USING 'cat' AS (a,b,c)
   FROM script_trans
   WHERE d1 <= 100
   GROUP BY d2
   HAVING maxd1 > 0
   ```
   
   But in hive, it can support this kind SQL.
   This makes SQL migration difficult and complex, without this we nee to 
change too many jobs,
   This PR is to support use Aggregation with TRANSFORM and make SQL migration 
from Hive to Spark easier.
   
   ### Why are the changes needed?
   Support transform with aggregate , make sql migration more smoothly
   
   
   ### Does this PR introduce _any_ user-facing change?
   User can write transform with aggregation.
   
   
   ### How was this patch tested?
   Added UT


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] AngersZhuuuu opened a new pull request #29087: [SPARK-28227][SQL] Support TRANSFORM with aggregation

Reply via email to