Github user ilooner commented on a diff in the pull request: https://github.com/apache/drill/pull/1101#discussion_r171133616 --- Diff: exec/java-exec/src/main/resources/drill-module.conf --- @@ -427,8 +427,8 @@ drill.exec.options: { exec.enable_union_type: false, exec.errors.verbose: false, exec.hashagg.mem_limit: 0, - exec.hashagg.min_batches_per_partition: 2, - exec.hashagg.num_partitions: 32, + exec.hashagg.min_batches_per_partition: 1, --- End diff -- @Ben-Zvi This setting controls the minimum number of batches kept in memory per partition. Making this larger will cause us to consume more memory. Making it smaller makes us consume less memory. Also in general the purpose of this PR was to make the memory calculations more precise and deterministic and it passes all regression tests.
---