Github user kiszk commented on a diff in the pull request:
https://github.com/apache/spark/pull/21931#discussion_r208561224
--- Diff:
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala ---
@@ -1437,6 +1437,16 @@ object SQLConf {
.intConf
.createWithDefault(20)
+ val FAST_HASH_AGGREGATE_MAX_ROWS_CAPACITY_BIT =
+ buildConf("spark.sql.codegen.aggregate.map.row.capacitybit")
+ .internal()
+ .doc("Capacity for the max number of rows to be held in memory by
the fast hash aggregate " +
+ "product operator. the bit not for actual value, but the actual
numBuckets is determined " +
+ "by loadFactor (e.g: default bit value 16 , the actual numBuckets
is ((1 >> 16) / 0.5).")
--- End diff --
nit: `1 >> 16` -> `1 << 16`
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]