[jira] [Commented] (DRILL-6032) Use RecordBatchSizer to estimate size of columns in HashAgg

ASF GitHub Bot (JIRA) Tue, 27 Feb 2018 18:48:37 -0800

    [ 
https://issues.apache.org/jira/browse/DRILL-6032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16379694#comment-16379694
 ]


ASF GitHub Bot commented on DRILL-6032:
---------------------------------------

Github user Ben-Zvi commented on a diff in the pull request:

    https://github.com/apache/drill/pull/1101#discussion_r171130622
  
    --- Diff: exec/java-exec/src/main/resources/drill-module.conf ---
    @@ -427,8 +427,8 @@ drill.exec.options: {
         exec.enable_union_type: false,
         exec.errors.verbose: false,
         exec.hashagg.mem_limit: 0,
    -    exec.hashagg.min_batches_per_partition: 2,
    -    exec.hashagg.num_partitions: 32,
    +    exec.hashagg.min_batches_per_partition: 1,
    --- End diff --
    
    This option was meant to create a "slack". **1** is the lowest value - 
requiring only 1 batch per each partition, i.e., no slack; so that requires the 
memory computations to be more precise now !!



> Use RecordBatchSizer to estimate size of columns in HashAgg
> -----------------------------------------------------------
>
>                 Key: DRILL-6032
>                 URL: https://issues.apache.org/jira/browse/DRILL-6032
>             Project: Apache Drill
>          Issue Type: Improvement
>            Reporter: Timothy Farkas
>            Assignee: Timothy Farkas
>            Priority: Major
>             Fix For: 1.13.0
>
>
> We need to use the RecordBatchSize to estimate the size of columns in the 
> Partition batches created by HashAgg.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (DRILL-6032) Use RecordBatchSizer to estimate size of columns in HashAgg

Reply via email to