Re: [PR] [SPARK-57688][SQL] Add spark.sql.execution.bypassPartialAggregation to skip partial agg [spark]

via GitHub Thu, 25 Jun 2026 10:19:21 -0700


uros-b commented on code in PR #56777:
URL: https://github.com/apache/spark/pull/56777#discussion_r3476300485



##########
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala:
##########
@@ -3869,6 +3869,17 @@ object SQLConf {
     .booleanConf
     .createWithDefault(true)
 
+  val BYPASS_PARTIAL_AGGREGATION = 
buildConf("spark.sql.execution.bypassPartialAggregation")
+    .doc("When true, skips the pre-shuffle partial aggregation and runs a 
single Complete-mode " +
+      "aggregation after the shuffle. Bypassing partial aggregation can 
improve performance " +
+      "when group cardinality is high and the pre-shuffle reduction ratio is 
low. " +
+      "When false (default), uses a two-phase Partial+Final aggregation across 
a shuffle. " +
+      "This setting has no effect on queries containing DISTINCT aggregate 
functions, where " +
+      "the partial aggregation phases are required for correctness and are 
always applied.")
+    .version("3.3.1")

Review Comment:
   ```suggestion
       .version("4.3.0")
   ```



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: [PR] [SPARK-57688][SQL] Add spark.sql.execution.bypassPartialAggregation to skip partial agg [spark]

Reply via email to