[ 
https://issues.apache.org/jira/browse/SPARK-41635?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Reynold Xin updated SPARK-41635:
--------------------------------
    Description: 
This patch implements GROUP BY ALL, similar to the one initially implemented in 
DuckDB. When specified, the analyzer automatically infers the grouping columns 
based on the expressions specified in the select clause: all expressions that 
don't include any aggregate expressions are pulled implicitly into the grouping 
columns. This avoids users having to specify individually the list of grouping 
columns in most cases.

Examples: 
{noformat}
select key, count, sum(score) from table group by all
-- rewritten to
select key, count, sum(score) from table group by key{noformat}

  was:
This patch implements GROUP BY ALL, similar to the one initially implemented in 
DuckDB. When specified, the analyzer automatically infers the grouping columns 
based on the expressions specified in the select clause: all expressions that 
don't include any aggregate expressions are pulled implicitly into the grouping 
columns. This avoids users having to specify individually the list of grouping 
columns in most cases.

Examples: 

{{select key, count(*), sum(score) from table group by all}}
{{-- rewritten to}}
{{select key, count(*), sum(score) from table group by key}}


> GROUP BY ALL
> ------------
>
>                 Key: SPARK-41635
>                 URL: https://issues.apache.org/jira/browse/SPARK-41635
>             Project: Spark
>          Issue Type: New Feature
>          Components: SQL
>    Affects Versions: 3.3.0
>            Reporter: Reynold Xin
>            Assignee: Reynold Xin
>            Priority: Major
>
> This patch implements GROUP BY ALL, similar to the one initially implemented 
> in DuckDB. When specified, the analyzer automatically infers the grouping 
> columns based on the expressions specified in the select clause: all 
> expressions that don't include any aggregate expressions are pulled 
> implicitly into the grouping columns. This avoids users having to specify 
> individually the list of grouping columns in most cases.
> Examples: 
> {noformat}
> select key, count, sum(score) from table group by all
> -- rewritten to
> select key, count, sum(score) from table group by key{noformat}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to